Reczko M, Bohr H
DKFZ-German Cancer Research Center, Heidelberg, Germany.
Nucleic Acids Res. 1994 Sep;22(17):3616-9.
A new method for predicting protein fold-classes and protein domains from sequence data is constructed and used for generating a data base of protein fold-class assignments. Any given sequence of amino acids is assigned a specific prediction of one out of 45 typical protein fold-classes, a prediction of one out of 4 super fold-classes for the content of secondary structures and a profile of fold-class predictions along the sequence. The prediction accuracy for the super fold-classes is around 91% correct and 82% correct for the specific fold-classes. This accuracy is maintained down to a few percent of sequence identity.
构建了一种从序列数据预测蛋白质折叠类和蛋白质结构域的新方法,并用于生成蛋白质折叠类分配的数据库。任何给定的氨基酸序列都被赋予45种典型蛋白质折叠类中一种的特定预测、二级结构含量的4种超级折叠类中一种的预测以及沿序列的折叠类预测概况。超级折叠类的预测准确率约为91%正确,特定折叠类的预测准确率为82%正确。这种准确率在序列同一性低至百分之几时仍能保持。