• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质折叠识别的基准共识模型质量评估

Benchmarking consensus model quality assessment for protein fold recognition.

作者信息

McGuffin Liam J

机构信息

The School of Biological Sciences, University of Reading, Whiteknights, Reading RG6 6AS, UK.

出版信息

BMC Bioinformatics. 2007 Sep 18;8:345. doi: 10.1186/1471-2105-8-345.

DOI:10.1186/1471-2105-8-345
PMID:17877795
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2048972/
Abstract

BACKGROUND

Selecting the highest quality 3D model of a protein structure from a number of alternatives remains an important challenge in the field of structural bioinformatics. Many Model Quality Assessment Programs (MQAPs) have been developed which adopt various strategies in order to tackle this problem, ranging from the so called "true" MQAPs capable of producing a single energy score based on a single model, to methods which rely on structural comparisons of multiple models or additional information from meta-servers. However, it is clear that no current method can separate the highest accuracy models from the lowest consistently. In this paper, a number of the top performing MQAP methods are benchmarked in the context of the potential value that they add to protein fold recognition. Two novel methods are also described: ModSSEA, which based on the alignment of predicted secondary structure elements and ModFOLD which combines several true MQAP methods using an artificial neural network.

RESULTS

The ModSSEA method is found to be an effective model quality assessment program for ranking multiple models from many servers, however further accuracy can be gained by using the consensus approach of ModFOLD. The ModFOLD method is shown to significantly outperform the true MQAPs tested and is competitive with methods which make use of clustering or additional information from multiple servers. Several of the true MQAPs are also shown to add value to most individual fold recognition servers by improving model selection, when applied as a post filter in order to re-rank models.

CONCLUSION

MQAPs should be benchmarked appropriately for the practical context in which they are intended to be used. Clustering based methods are the top performing MQAPs where many models are available from many servers; however, they often do not add value to individual fold recognition servers when limited models are available. Conversely, the true MQAP methods tested can often be used as effective post filters for re-ranking few models from individual fold recognition servers and further improvements can be achieved using a consensus of these methods.

摘要

背景

从多个备选模型中选择蛋白质结构的最高质量3D模型,仍然是结构生物信息学领域的一项重大挑战。已经开发了许多模型质量评估程序(MQAP),它们采用各种策略来解决这个问题,从能够基于单个模型产生单一能量分数的所谓“真正的”MQAP,到依赖于多个模型的结构比较或来自元服务器的附加信息的方法。然而,很明显,目前没有一种方法能够始终如一地将最高准确性的模型与最低准确性的模型区分开来。在本文中,一些表现最佳的MQAP方法在它们为蛋白质折叠识别增加的潜在价值的背景下进行了基准测试。还描述了两种新方法:基于预测二级结构元件比对的ModSSEA和使用人工神经网络结合多种真正MQAP方法的ModFOLD。

结果

发现ModSSEA方法是一种有效的模型质量评估程序,可用于对来自多个服务器的多个模型进行排名,但是通过使用ModFOLD的共识方法可以获得更高的准确性。结果表明,ModFOLD方法明显优于所测试的真正MQAP,并且与使用聚类或来自多个服务器的附加信息的方法具有竞争力。当作为后过滤器应用以重新对模型进行排名时,一些真正的MQAP还通过改进模型选择为大多数单个折叠识别服务器增加了价值。

结论

MQAP应针对其预期使用的实际背景进行适当的基准测试。基于聚类的方法是表现最佳的MQAP,当有来自多个服务器的许多模型时;然而,当可用模型有限时,它们通常不会为单个折叠识别服务器增加价值。相反,所测试的真正MQAP方法通常可以用作有效的后过滤器,用于对来自单个折叠识别服务器的少数模型进行重新排名,并且通过这些方法的共识可以实现进一步的改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/5005644970aa/1471-2105-8-345-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/9cd0ab4781ab/1471-2105-8-345-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/be1dd48d0217/1471-2105-8-345-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/58c1c17fb31b/1471-2105-8-345-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/5005644970aa/1471-2105-8-345-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/9cd0ab4781ab/1471-2105-8-345-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/be1dd48d0217/1471-2105-8-345-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/58c1c17fb31b/1471-2105-8-345-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c089/2048972/5005644970aa/1471-2105-8-345-4.jpg

相似文献

1
Benchmarking consensus model quality assessment for protein fold recognition.蛋白质折叠识别的基准共识模型质量评估
BMC Bioinformatics. 2007 Sep 18;8:345. doi: 10.1186/1471-2105-8-345.
2
Rapid model quality assessment for protein structure predictions using the comparison of multiple models without structural alignments.使用不进行结构比对的多模型比较进行蛋白质结构预测的快速模型质量评估。
Bioinformatics. 2010 Jan 15;26(2):182-8. doi: 10.1093/bioinformatics/btp629. Epub 2009 Nov 6.
3
Improving sequence-based fold recognition by using 3D model quality assessment.通过使用三维模型质量评估来改进基于序列的折叠识别。
Bioinformatics. 2005 Sep 1;21(17):3509-15. doi: 10.1093/bioinformatics/bti540. Epub 2005 Jun 14.
4
3D-SHOTGUN: a novel, cooperative, fold-recognition meta-predictor.3D-霰弹枪:一种新型的、协作式的折叠识别元预测器。
Proteins. 2003 May 15;51(3):434-41. doi: 10.1002/prot.10357.
5
Prediction of global and local model quality in CASP8 using the ModFOLD server.利用 ModFOLD 服务器预测 CASP8 中的全局和局部模型质量。
Proteins. 2009;77 Suppl 9:185-90. doi: 10.1002/prot.22491.
6
A "FRankenstein's monster" approach to comparative modeling: merging the finest fragments of Fold-Recognition models and iterative model refinement aided by 3D structure evaluation.一种用于比较建模的“科学怪人”方法:融合折叠识别模型的最佳片段,并借助三维结构评估进行迭代模型优化。
Proteins. 2003;53 Suppl 6:369-79. doi: 10.1002/prot.10545.
7
Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre.在Phyre程序中使用集成折叠识别方法探索序列/结构空间的极限。
Proteins. 2008 Feb 15;70(3):611-25. doi: 10.1002/prot.21688.
8
Pcons5: combining consensus, structural evaluation and fold recognition scores.Pcons5:结合共识、结构评估和折叠识别分数。
Bioinformatics. 2005 Dec 1;21(23):4248-54. doi: 10.1093/bioinformatics/bti702. Epub 2005 Oct 4.
9
Prediction of global and local model quality in CASP7 using Pcons and ProQ.使用Pcons和ProQ预测CASP7中的全局和局部模型质量。
Proteins. 2007;69 Suppl 8:184-93. doi: 10.1002/prot.21774.
10
Assessment of progress over the CASP experiments.对蛋白质结构预测关键评估(CASP)实验进展的评估。
Proteins. 2003;53 Suppl 6:585-95. doi: 10.1002/prot.10530.

引用本文的文献

1
Recent advances and challenges in protein complex model accuracy estimation.蛋白质复合物模型准确性评估的最新进展与挑战
Comput Struct Biotechnol J. 2024 Apr 21;23:1824-1832. doi: 10.1016/j.csbj.2024.04.049. eCollection 2024 Dec.
2
A gated graph transformer for protein complex structure quality assessment and its performance in CASP15.门控图转换器用于蛋白质复合物结构质量评估及其在 CASP15 中的性能。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i308-i317. doi: 10.1093/bioinformatics/btad203.
3
Usage of a dataset of NMR resolved protein structures to test aggregation versus solubility prediction algorithms.

本文引用的文献

1
High throughput profile-profile based fold recognition for the entire human proteome.针对整个人类蛋白质组的基于高通量图谱-图谱的折叠识别。
BMC Bioinformatics. 2006 Jun 7;7:288. doi: 10.1186/1471-2105-7-288.
2
A composite score for predicting errors in protein structure models.用于预测蛋白质结构模型错误的综合评分。
Protein Sci. 2006 Jul;15(7):1653-66. doi: 10.1110/ps.062095806. Epub 2006 Jun 2.
3
Servers for protein structure prediction.蛋白质结构预测服务器。
使用核磁共振解析的蛋白质结构数据集来测试聚集与溶解度预测算法。
Protein Sci. 2017 Sep;26(9):1864-1869. doi: 10.1002/pro.3225. Epub 2017 Jul 15.
4
ModFOLD6: an accurate web server for the global and local quality estimation of 3D protein models.ModFOLD6:一个用于全局和局部 3D 蛋白质模型质量评估的准确的网络服务器。
Nucleic Acids Res. 2017 Jul 3;45(W1):W416-W421. doi: 10.1093/nar/gkx332.
5
Prediction of Local Quality of Protein Structure Models Considering Spatial Neighbors in Graphical Models.基于图形模型中空间邻域考虑预测局部蛋白质结构模型的质量。
Sci Rep. 2017 Jan 11;7:40629. doi: 10.1038/srep40629.
6
The structure of the catalytic domain of a plant cellulose synthase and its assembly into dimers.植物纤维素合酶催化结构域的结构及其组装成二聚体的过程。
Plant Cell. 2014 Jul;26(7):2996-3009. doi: 10.1105/tpc.114.126862. Epub 2014 Jul 10.
7
SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines.SMOQ:一种使用支持向量机预测单个蛋白质模型绝对残基特异性质量的工具。
BMC Bioinformatics. 2014 Apr 28;15:120. doi: 10.1186/1471-2105-15-120.
8
Molecular replacement then and now.分子置换的过去与现在。
Acta Crystallogr D Biol Crystallogr. 2013 Nov;69(Pt 11):2266-75. doi: 10.1107/S0907444913011426. Epub 2013 Oct 18.
9
QA-RecombineIt: a server for quality assessment and recombination of protein models.QA-RecombineIt:用于蛋白质模型质量评估和重组的服务器。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W389-97. doi: 10.1093/nar/gkt408. Epub 2013 May 21.
10
The ModFOLD4 server for the quality assessment of 3D protein models.ModFOLD4 服务器用于评估 3D 蛋白质模型的质量。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W368-72. doi: 10.1093/nar/gkt294. Epub 2013 Apr 25.
Curr Opin Struct Biol. 2006 Apr;16(2):178-82. doi: 10.1016/j.sbi.2006.03.004. Epub 2006 Mar 20.
4
The victor/FRST function for model quality estimation.用于模型质量评估的胜利者/FRST函数。
J Comput Biol. 2005 Dec;12(10):1316-27. doi: 10.1089/cmb.2005.12.1316.
5
Prediction of novel and analogous folds using fragment assembly and fold recognition.使用片段组装和折叠识别预测新型和类似折叠结构
Proteins. 2005;61 Suppl 7:143-151. doi: 10.1002/prot.20731.
6
Improving sequence-based fold recognition by using 3D model quality assessment.通过使用三维模型质量评估来改进基于序列的折叠识别。
Bioinformatics. 2005 Sep 1;21(17):3509-15. doi: 10.1093/bioinformatics/bti540. Epub 2005 Jun 14.
7
TM-align: a protein structure alignment algorithm based on the TM-score.TM-align:一种基于TM分数的蛋白质结构比对算法。
Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.
8
Scoring function for automated assessment of protein structure template quality.用于自动评估蛋白质结构模板质量的评分函数。
Proteins. 2004 Dec 1;57(4):702-10. doi: 10.1002/prot.20264.
9
Automatic consensus-based fold recognition using Pcons, ProQ, and Pmodeller.使用Pcons、ProQ和Pmodeller基于共识的自动折叠识别。
Proteins. 2003;53 Suppl 6:534-41. doi: 10.1002/prot.10536.
10
3D-Jury: a simple approach to improve protein structure predictions.3D评审:一种改进蛋白质结构预测的简单方法。
Bioinformatics. 2003 May 22;19(8):1015-8. doi: 10.1093/bioinformatics/btg124.