Alexandrov N N, Mironov A A
Genetics of Microorganisms Institute, Moscow, USSR.
Nucleic Acids Res. 1990 Apr 11;18(7):1847-52. doi: 10.1093/nar/18.7.1847.
An algorithm from the pattern recognition theory 'generalized portrait' was used to find a distinguishing vector (scoring matrix) for E. coli promoters. We have attempted to solve three closely linked problems: (i) the selection of significant features of the signal; (ii) subsequent multiple alignment and (iii) calculation of the vector coordinates. Promoters with known strength have been successfully ranked in the correct order using this vector. We demonstrate the use of this method in predicting the location of promoters. A revised consensus promoter sequence is also presented.
运用模式识别理论“广义画像”中的一种算法来寻找大肠杆菌启动子的区分向量(评分矩阵)。我们试图解决三个紧密相关的问题:(i)信号显著特征的选择;(ii)随后的多序列比对;以及(iii)向量坐标的计算。利用该向量已成功地将已知强度的启动子按正确顺序进行了排序。我们展示了该方法在预测启动子位置方面的应用。还给出了一个修订后的共有启动子序列。