Levin J M, Pascarella S, Argos P, Garnier J
Unité d'Ingénierie des Protéines, Biotechnologies, INRA, Jouy-en-Josas, France.
Protein Eng. 1993 Nov;6(8):849-54. doi: 10.1093/protein/6.8.849.
The use of multiple sequence alignments for secondary structure predictions is analysed. Seven different protein families, containing only sequences of known structure, were considered to provide a range of alignment and prediction conditions. Using alignments obtained by spatial superposition of main chain atoms in known tertiary protein structures allowed a mean of 8% in secondary structure prediction accuracy, when compared to those obtained from the individual sequences. Substitution of these alignments by those determined directly from an automated sequence alignment algorithm showed variations in the prediction accuracy which correlated with the quality of the multiple alignments and distance of the primary sequence. Secondary structure predictions can be reliably improved using alignments from an automatic alignment procedure with a mean increase of 6.8%, giving an overall prediction accuracy of 68.5%, if there is a minimum of 25% sequence identity between all sequences in a family.
分析了使用多序列比对进行二级结构预测的情况。考虑了七个不同的蛋白质家族,这些家族仅包含已知结构的序列,以提供一系列比对和预测条件。与从单个序列获得的比对相比,使用通过已知三级蛋白质结构中主链原子的空间叠加获得的比对,二级结构预测准确率平均提高了8%。用直接从自动序列比对算法确定的比对替换这些比对后,预测准确率出现了变化,这与多序列比对的质量和一级序列的距离相关。如果一个家族中所有序列之间的序列同一性至少为25%,则使用自动比对程序得到的比对可以可靠地提高二级结构预测,平均提高6.8%,总体预测准确率达到68.5%。