• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质二级结构预测的多序列方法评估与改进

Evaluation and improvement of multiple sequence methods for protein secondary structure prediction.

作者信息

Cuff J A, Barton G J

机构信息

Laboratory of Molecular Biophysics, Oxford, United Kingdom.

出版信息

Proteins. 1999 Mar 1;34(4):508-19. doi: 10.1002/(sici)1097-0134(19990301)34:4<508::aid-prot10>3.0.co;2-4.

DOI:10.1002/(sici)1097-0134(19990301)34:4<508::aid-prot10>3.0.co;2-4
PMID:10081963
Abstract

A new dataset of 396 protein domains is developed and used to evaluate the performance of the protein secondary structure prediction algorithms DSC, PHD, NNSSP, and PREDATOR. The maximum theoretical Q3 accuracy for combination of these methods is shown to be 78%. A simple consensus prediction on the 396 domains, with automatically generated multiple sequence alignments gives an average Q3 prediction accuracy of 72.9%. This is a 1% improvement over PHD, which was the best single method evaluated. Segment Overlap Accuracy (SOV) is 75.4% for the consensus method on the 396-protein set. The secondary structure definition method DSSP defines 8 states, but these are reduced by most authors to 3 for prediction. Application of the different published 8- to 3-state reduction methods shows variation of over 3% on apparent prediction accuracy. This suggests that care should be taken to compare methods by the same reduction method. Two new sequence datasets (CB513 and CB251) are derived which are suitable for cross-validation of secondary structure prediction methods without artifacts due to internal homology. A fully automatic World Wide Web service that predicts protein secondary structure by a combination of methods is available via http://barton.ebi.ac.uk/.

摘要

开发了一个包含396个蛋白质结构域的新数据集,并用于评估蛋白质二级结构预测算法DSC、PHD、NNSSP和PREDATOR的性能。这些方法组合的最大理论Q3准确率显示为78%。对396个结构域进行简单的一致性预测,并自动生成多序列比对,得到的平均Q3预测准确率为72.9%。这比评估的最佳单一方法PHD提高了1%。对于396个蛋白质集的一致性方法,片段重叠准确率(SOV)为75.4%。二级结构定义方法DSSP定义了8种状态,但大多数作者将其简化为3种用于预测。应用不同的已发表的8到3状态简化方法,表观预测准确率的变化超过3%。这表明在通过相同的简化方法比较方法时应谨慎。推导了两个新的序列数据集(CB513和CB251),它们适用于蛋白质二级结构预测方法的交叉验证,且不存在由于内部同源性导致的假象。通过http://barton.ebi.ac.uk/ 可获得一个通过方法组合预测蛋白质二级结构的全自动万维网服务。

相似文献

1
Evaluation and improvement of multiple sequence methods for protein secondary structure prediction.蛋白质二级结构预测的多序列方法评估与改进
Proteins. 1999 Mar 1;34(4):508-19. doi: 10.1002/(sici)1097-0134(19990301)34:4<508::aid-prot10>3.0.co;2-4.
2
Multiple linear regression for protein secondary structure prediction.用于蛋白质二级结构预测的多元线性回归
Proteins. 2001 May 15;43(3):256-9. doi: 10.1002/prot.1036.
3
Protein secondary structure prediction using local alignments.利用局部比对进行蛋白质二级结构预测。
J Mol Biol. 1997 Apr 25;268(1):31-6. doi: 10.1006/jmbi.1997.0958.
4
Application of multiple sequence alignment profiles to improve protein secondary structure prediction.应用多序列比对轮廓来改进蛋白质二级结构预测。
Proteins. 2000 Aug 15;40(3):502-11. doi: 10.1002/1097-0134(20000815)40:3<502::aid-prot170>3.0.co;2-q.
5
Context-based features enhance protein secondary structure prediction accuracy.基于上下文的特征提高了蛋白质二级结构预测的准确性。
J Chem Inf Model. 2014 Mar 24;54(3):992-1002. doi: 10.1021/ci400647u. Epub 2014 Mar 12.
6
JPred: a consensus secondary structure prediction server.JPred:一个一致性二级结构预测服务器。
Bioinformatics. 1998;14(10):892-3. doi: 10.1093/bioinformatics/14.10.892.
7
ProtEST: protein multiple sequence alignments from expressed sequence tags.ProtEST:来自表达序列标签的蛋白质多序列比对
Bioinformatics. 2000 Feb;16(2):111-6. doi: 10.1093/bioinformatics/16.2.111.
8
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.OXBench:一种用于评估蛋白质多序列比对准确性的基准。
BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47.
9
Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method.β-桶状外膜蛋白拓扑结构预测方法的评估及一种共识预测方法
BMC Bioinformatics. 2005 Jan 12;6:7. doi: 10.1186/1471-2105-6-7.
10
Prediction of beta-turns in proteins from multiple alignment using neural network.利用神经网络从多序列比对预测蛋白质中的β-转角。
Protein Sci. 2003 Mar;12(3):627-34. doi: 10.1110/ps.0228903.

引用本文的文献

1
Combining knowledge distillation and neural networks to predict protein secondary structure.结合知识蒸馏与神经网络预测蛋白质二级结构。
Sci Rep. 2025 Aug 31;15(1):32031. doi: 10.1038/s41598-025-17513-0.
2
In silico prediction of variant effects: promises and limitations for precision plant breeding.变异效应的计算机模拟预测:精准植物育种的前景与局限
Theor Appl Genet. 2025 Jul 28;138(8):193. doi: 10.1007/s00122-025-04973-1.
3
DeepPredict: a state-of-the-art web server for protein secondary structure and relative solvent accessibility prediction.
DeepPredict:用于蛋白质二级结构和相对溶剂可及性预测的先进网络服务器。
Front Bioinform. 2025 Jun 6;5:1607402. doi: 10.3389/fbinf.2025.1607402. eCollection 2025.
4
Advancements in one-dimensional protein structure prediction using machine learning and deep learning.利用机器学习和深度学习进行一维蛋白质结构预测的进展。
Comput Struct Biotechnol J. 2025 Apr 3;27:1416-1430. doi: 10.1016/j.csbj.2025.04.005. eCollection 2025.
5
Post-processing enhances protein secondary structure prediction with second order deep learning and embeddings.后处理通过二阶深度学习和嵌入增强蛋白质二级结构预测。
Comput Struct Biotechnol J. 2025 Jan 2;27:243-251. doi: 10.1016/j.csbj.2024.12.022. eCollection 2025.
6
MHTAPred-SS: A Highly Targeted Autoencoder-Driven Deep Multi-Task Learning Framework for Accurate Protein Secondary Structure Prediction.MHTAPred-SS:一种用于准确蛋白质二级结构预测的高度靶向的自动编码器驱动的深度多任务学习框架。
Int J Mol Sci. 2024 Dec 15;25(24):13444. doi: 10.3390/ijms252413444.
7
Prediction of Protein Secondary Structures Based on Substructural Descriptors of Molecular Fragments.基于分子片段亚结构描述符的蛋白质二级结构预测
Int J Mol Sci. 2024 Nov 21;25(23):12525. doi: 10.3390/ijms252312525.
8
S-PLM: Structure-Aware Protein Language Model via Contrastive Learning Between Sequence and Structure.S-PLM:通过序列与结构之间的对比学习实现的结构感知蛋白质语言模型
Adv Sci (Weinh). 2025 Feb;12(5):e2404212. doi: 10.1002/advs.202404212. Epub 2024 Dec 12.
9
ILMCNet: A Deep Neural Network Model That Uses PLM to Process Features and Employs CRF to Predict Protein Secondary Structure.ILMCNet:一种利用 PLM 处理特征并采用 CRF 预测蛋白质二级结构的深度神经网络模型。
Genes (Basel). 2024 Oct 21;15(10):1350. doi: 10.3390/genes15101350.
10
Impact of Multi-Factor Features on Protein Secondary Structure Prediction.多因素特征对蛋白质二级结构预测的影响。
Biomolecules. 2024 Sep 13;14(9):1155. doi: 10.3390/biom14091155.