• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质二级结构预测准确率高于70%。

Prediction of protein secondary structure at better than 70% accuracy.

作者信息

Rost B, Sander C

机构信息

European Molecular Biology Laboratory, Heidelberg, Germany.

出版信息

J Mol Biol. 1993 Jul 20;232(2):584-99. doi: 10.1006/jmbi.1993.1413.

DOI:10.1006/jmbi.1993.1413
PMID:8345525
Abstract

We have trained a two-layered feed-forward neural network on a non-redundant data base of 130 protein chains to predict the secondary structure of water-soluble proteins. A new key aspect is the use of evolutionary information in the form of multiple sequence alignments that are used as input in place of single sequences. The inclusion of protein family information in this form increases the prediction accuracy by six to eight percentage points. A combination of three levels of networks results in an overall three-state accuracy of 70.8% for globular proteins (sustained performance). If four membrane protein chains are included in the evaluation, the overall accuracy drops to 70.2%. The prediction is well balanced between alpha-helix, beta-strand and loop: 65% of the observed strand residues are predicted correctly. The accuracy in predicting the content of three secondary structure types is comparable to that of circular dichroism spectroscopy. The performance accuracy is verified by a sevenfold cross-validation test, and an additional test on 26 recently solved proteins. Of particular practical importance is the definition of a position-specific reliability index. For half of the residues predicted with a high level of reliability the overall accuracy increases to better than 82%. A further strength of the method is the more realistic prediction of segment length. The protein family prediction method is available for testing by academic researchers via an electronic mail server.

摘要

我们在一个由130条蛋白质链组成的非冗余数据库上训练了一个两层前馈神经网络,以预测水溶性蛋白质的二级结构。一个新的关键方面是使用多序列比对形式的进化信息,将其用作输入来替代单序列。以这种形式纳入蛋白质家族信息可将预测准确率提高6至8个百分点。三个层次网络的组合使得球状蛋白质的整体三态准确率达到70.8%(持续性能)。如果在评估中纳入四条膜蛋白链,整体准确率降至70.2%。该预测在α螺旋、β链和环之间实现了良好的平衡:65%的观察到的链残基被正确预测。预测三种二级结构类型含量的准确率与圆二色光谱法相当。通过七重交叉验证测试以及对26个最近解析的蛋白质进行的额外测试,验证了性能准确率。特别具有实际重要性的是位置特异性可靠性指数的定义。对于一半预测可靠性高的残基,整体准确率提高到82%以上。该方法的另一个优势是对片段长度的预测更符合实际情况。学术研究人员可通过电子邮件服务器对蛋白质家族预测方法进行测试。

相似文献

1
Prediction of protein secondary structure at better than 70% accuracy.蛋白质二级结构预测准确率高于70%。
J Mol Biol. 1993 Jul 20;232(2):584-99. doi: 10.1006/jmbi.1993.1413.
2
Combining evolutionary information and neural networks to predict protein secondary structure.结合进化信息与神经网络预测蛋白质二级结构。
Proteins. 1994 May;19(1):55-72. doi: 10.1002/prot.340190108.
3
A neural network method for prediction of beta-turn types in proteins using evolutionary information.一种利用进化信息预测蛋白质中β-转角类型的神经网络方法。
Bioinformatics. 2004 Nov 1;20(16):2751-8. doi: 10.1093/bioinformatics/bth322. Epub 2004 May 14.
4
Protein secondary structure prediction using local alignments.利用局部比对进行蛋白质二级结构预测。
J Mol Biol. 1997 Apr 25;268(1):31-6. doi: 10.1006/jmbi.1997.0958.
5
Role of evolutionary information in prediction of aromatic-backbone NH interactions in proteins.进化信息在预测蛋白质中芳香族主链NH相互作用中的作用。
FEBS Lett. 2004 Apr 23;564(1-2):47-57. doi: 10.1016/S0014-5793(04)00305-9.
6
Transmembrane helices predicted at 95% accuracy.预测的跨膜螺旋准确率达95%。
Protein Sci. 1995 Mar;4(3):521-33. doi: 10.1002/pro.5560040318.
7
Highly accurate and consistent method for prediction of helix and strand content from primary protein sequences.一种从蛋白质一级序列预测螺旋和链含量的高度准确且一致的方法。
Artif Intell Med. 2005 Sep-Oct;35(1-2):19-35. doi: 10.1016/j.artmed.2005.02.006.
8
Real value prediction of solvent accessibility in proteins using multiple sequence alignment and secondary structure.利用多序列比对和二级结构对蛋白质中溶剂可及性进行实际值预测。
Proteins. 2005 Nov 1;61(2):318-24. doi: 10.1002/prot.20630.
9
Prediction of alpha-turns in proteins using PSI-BLAST profiles and secondary structure information.利用PSI-BLAST序列谱和二级结构信息预测蛋白质中的α-转角
Proteins. 2004 Apr 1;55(1):83-90. doi: 10.1002/prot.10569.
10
Combining prediction of secondary structure and solvent accessibility in proteins.蛋白质二级结构预测与溶剂可及性预测相结合。
Proteins. 2005 May 15;59(3):467-75. doi: 10.1002/prot.20441.

引用本文的文献

1
PLMFit: benchmarking transfer learning with protein language models for protein engineering.PLMFit:使用蛋白质语言模型进行蛋白质工程的迁移学习基准测试
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf381.
2
Chameleonic Nature of Aβ: Implications for Alzheimer's and Other Amyloid Diseases.Aβ的变色龙特性:对阿尔茨海默病及其他淀粉样疾病的影响
Bioessays. 2025 Sep;47(9):e70039. doi: 10.1002/bies.70039. Epub 2025 Jul 11.
3
AFToolkit: a framework for molecular modeling of proteins with AlphaFold-derived representations.AFToolkit:一个用于基于AlphaFold衍生表示进行蛋白质分子建模的框架。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf324.
4
Enhancing Functional Protein Design Using Heuristic Optimization and Deep Learning for Anti-Inflammatory and Gene Therapy Applications.利用启发式优化和深度学习增强功能性蛋白质设计以用于抗炎和基因治疗应用
Proteins. 2025 Jul;93(7):1238-1256. doi: 10.1002/prot.26810. Epub 2025 Feb 22.
5
Bilingual language model for protein sequence and structure.用于蛋白质序列和结构的双语语言模型。
NAR Genom Bioinform. 2024 Nov 15;6(4):lqae150. doi: 10.1093/nargab/lqae150. eCollection 2024 Dec.
6
Importance of Secondary Structure Data in Large Scale Protein Modeling Using Low-Resolution SURPASS Method.使用低分辨率 SURPASS 方法进行大规模蛋白质建模中二级结构数据的重要性。
Methods Mol Biol. 2025;2867:55-78. doi: 10.1007/978-1-0716-4196-5_4.
7
Improving Protein Secondary Structure Prediction by Deep Language Models and Transformer Networks.深度学习语言模型和变换网络在蛋白质二级结构预测中的改进。
Methods Mol Biol. 2025;2867:43-53. doi: 10.1007/978-1-0716-4196-5_3.
8
The Iconic α-Helix: From Pauling to the Present.标志性的α-螺旋:从鲍林到现在。
Methods Mol Biol. 2025;2867:1-17. doi: 10.1007/978-1-0716-4196-5_1.
9
Assessing the role of evolutionary information for enhancing protein language model embeddings.评估进化信息在增强蛋白质语言模型嵌入中的作用。
Sci Rep. 2024 Sep 5;14(1):20692. doi: 10.1038/s41598-024-71783-8.
10
Identification of the protease inhibitory domain of Trichinella spiralis novel cystatin (TsCstN).鉴定旋毛虫新型半胱氨酸蛋白酶抑制剂(TsCstN)的蛋白酶抑制结构域。
Parasites Hosts Dis. 2024 Aug;62(3):330-341. doi: 10.3347/PHD.24026. Epub 2024 Aug 26.