Suppr超能文献

一种从多重比对序列预测蛋白质二级结构的简单快速方法,准确率高于70%。

A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%.

作者信息

Mehta P K, Heringa J, Argos P

机构信息

European Molecular Biology Laboratory, Heidelberg, Germany.

出版信息

Protein Sci. 1995 Dec;4(12):2517-25. doi: 10.1002/pro.5560041208.

Abstract

To improve secondary structure predictions in protein sequences, the information residing in multiple sequence alignments of substituted but structurally related proteins is exploited. A database comprised of 70 protein families and a total of 2,500 sequences, some of which were aligned by tertiary structural superpositions, was used to calculate residue exchange weight matrices within alpha-helical, beta-strand, and coil substructures, respectively. Secondary structure predictions were made based on the observed residue substitutions in local regions of the multiple alignments and the largest possible associated exchange weights in each of the three matrix types. Comparison of the observed and predicted secondary structure on a per-residue basis yielded a mean accuracy of 72.2%. Individual alpha-helix, beta-strand, and coil states were respectively predicted at 66.7, and 75.8% correctness, representing a well-balanced three-state prediction. The accuracy level, verified by cross-validation through jack-knife tests on all protein families, dropped, on average, to only 70.9%, indicating the rigor of the prediction procedure. On the basis of robustness, conceptual clarity, accuracy, and executable efficiency, the method has considerable advantage, especially with its sole reliance on amino acid substitutions within structurally related proteins.

摘要

为了改进蛋白质序列中的二级结构预测,人们利用了存在于结构相关但经过替换的蛋白质的多序列比对中的信息。一个由70个蛋白质家族和总共2500个序列组成的数据库被用于分别计算α螺旋、β链和卷曲子结构内的残基交换权重矩阵,其中一些序列是通过三级结构叠加进行比对的。二级结构预测是基于多序列比对局部区域中观察到的残基替换以及三种矩阵类型中每种类型可能的最大相关交换权重进行的。在逐个残基的基础上对观察到的和预测的二级结构进行比较,得到的平均准确率为72.2%。α螺旋、β链和卷曲的单个状态分别以66.7%和75.8%的正确率被预测,代表了一个平衡良好的三状态预测。通过对所有蛋白质家族进行留一法交叉验证所验证的准确率水平平均仅降至70.9%,这表明了预测过程的严格性。基于稳健性、概念清晰度、准确性和可执行效率,该方法具有相当大的优势,特别是它仅依赖于结构相关蛋白质内的氨基酸替换。

相似文献

引用本文的文献

7
Probing protein fold space with a simplified model.用简化模型探索蛋白质折叠空间
J Mol Biol. 2008 Jan 25;375(4):920-33. doi: 10.1016/j.jmb.2007.10.087. Epub 2007 Nov 9.

本文引用的文献

7
A comprehensive set of sequence analysis programs for the VAX.一套适用于VAX的综合序列分析程序。
Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):387-95. doi: 10.1093/nar/12.1part1.387.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验