Suppr超能文献

评估 CASP13 中蛋白质模型结构准确性估计:深度学习时代的挑战。

Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning.

机构信息

Department of Chemistry, Seoul National University, Seoul, Republic of Korea.

Genome Center, University of California, Davis, California.

出版信息

Proteins. 2019 Dec;87(12):1351-1360. doi: 10.1002/prot.25804. Epub 2019 Aug 30.

Abstract

Scoring model structure is an essential component of protein structure prediction that can affect the prediction accuracy tremendously. Users of protein structure prediction results also need to score models to select the best models for their application studies. In Critical Assessment of techniques for protein Structure Prediction (CASP), model accuracy estimation methods have been tested in a blind fashion by providing models submitted by the tertiary structure prediction servers for scoring. In CASP13, model accuracy estimation results were evaluated in terms of both global and local structure accuracy. Global structure accuracy estimation was evaluated by the quality of the models selected by the global structure scores and by the absolute estimates of the global scores. Residue-wise, local structure accuracy estimations were evaluated by three different measures. A new measure introduced in CASP13 evaluates the ability to predict inaccurately modeled regions that may be improved by refinement. An intensive comparative analysis on CASP13 and the previous CASPs revealed that the tertiary structure models generated by the CASP13 servers show very distinct features. Higher consensus toward models of higher global accuracy appeared even for free modeling targets, and many models of high global accuracy were not well optimized at the atomic level. This is related to the new technology in CASP13, deep learning for tertiary contact prediction. The tertiary model structures generated by deep learning pose a new challenge for EMA (estimation of model accuracy) method developers. Model accuracy estimation itself is also an area where deep learning can potentially have an impact, although current EMA methods have not fully explored that direction.

摘要

评分模型结构是蛋白质结构预测的重要组成部分,它可以极大地影响预测的准确性。蛋白质结构预测结果的使用者也需要对模型进行评分,以选择最适合其应用研究的模型。在蛋白质结构预测技术的关键评估 (Critical Assessment of techniques for protein Structure Prediction, CASP) 中,通过提供由三级结构预测服务器提交的模型进行评分,以盲法测试模型准确性估计方法。在 CASP13 中,从全局和局部结构准确性两个方面评估模型准确性估计结果。全局结构准确性估计通过全局结构得分选择的模型的质量和全局得分的绝对估计来评估。在残基水平上,通过三种不同的方法评估局部结构准确性估计。CASP13 中引入的一种新方法评估了预测不准确建模区域的能力,这些区域可以通过细化来改进。对 CASP13 和之前的 CASP 的深入比较分析表明,CASP13 服务器生成的三级结构模型具有非常明显的特征。即使对于免费建模目标,更高的全局准确性模型的共识度也更高,许多全局准确性较高的模型在原子水平上并没有得到很好的优化。这与 CASP13 中的新技术,即三级接触预测的深度学习有关。深度学习生成的三级模型结构给 EMA(模型准确性估计)方法开发者带来了新的挑战。模型准确性估计本身也是深度学习可能产生影响的一个领域,尽管目前的 EMA 方法尚未充分探索这一方向。

相似文献

1
Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning.
Proteins. 2019 Dec;87(12):1351-1360. doi: 10.1002/prot.25804. Epub 2019 Aug 30.
2
Estimation of model accuracy in CASP13.
Proteins. 2019 Dec;87(12):1361-1377. doi: 10.1002/prot.25767. Epub 2019 Jul 16.
3
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.
4
A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments.
Proteins. 2019 Dec;87(12):1100-1112. doi: 10.1002/prot.25787. Epub 2019 Aug 7.
5
Analysis of distance-based protein structure prediction by deep learning in CASP13.
Proteins. 2019 Dec;87(12):1069-1081. doi: 10.1002/prot.25810. Epub 2019 Sep 13.
6
Deep-learning contact-map guided protein structure prediction in CASP13.
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.
8
Assessment of protein model structure accuracy estimation in CASP14: Old and new challenges.
Proteins. 2021 Dec;89(12):1940-1948. doi: 10.1002/prot.26192. Epub 2021 Aug 5.
9
Critical assessment of methods of protein structure prediction (CASP)-Round XIII.
Proteins. 2019 Dec;87(12):1011-1020. doi: 10.1002/prot.25823. Epub 2019 Oct 23.

引用本文的文献

1
Recent advances and challenges in protein complex model accuracy estimation.
Comput Struct Biotechnol J. 2024 Apr 21;23:1824-1832. doi: 10.1016/j.csbj.2024.04.049. eCollection 2024 Dec.
3
Challenges in bridging the gap between protein structure prediction and functional interpretation.
Proteins. 2025 Jan;93(1):400-410. doi: 10.1002/prot.26614. Epub 2023 Oct 18.
4
iQDeep: an integrated web server for protein scoring using multiscale deep learning models.
J Mol Biol. 2023 Jul 15;435(14):168057. doi: 10.1016/j.jmb.2023.168057. Epub 2023 Mar 23.
5
New prediction categories in CASP15.
Proteins. 2023 Dec;91(12):1550-1557. doi: 10.1002/prot.26515. Epub 2023 Jun 12.
6
Protein Model Quality Estimation Using Molecular Dynamics Simulation.
ACS Omega. 2022 Jul 5;7(28):24274-24281. doi: 10.1021/acsomega.2c01475. eCollection 2022 Jul 19.
8
Fast and effective protein model refinement using deep graph neural networks.
Nat Comput Sci. 2021 Jul;1(7):462-469. doi: 10.1038/s43588-021-00098-9. Epub 2021 Jul 15.
9
Ins and outs of AlphaFold2 transmembrane protein structure predictions.
Cell Mol Life Sci. 2022 Jan 15;79(1):73. doi: 10.1007/s00018-021-04112-1.
10
Critical assessment of methods of protein structure prediction (CASP)-Round XIV.
Proteins. 2021 Dec;89(12):1607-1617. doi: 10.1002/prot.26237. Epub 2021 Oct 7.

本文引用的文献

1
CASP13 target classification into tertiary structure prediction categories.
Proteins. 2019 Dec;87(12):1021-1036. doi: 10.1002/prot.25775. Epub 2019 Jul 24.
2
Estimation of model accuracy in CASP13.
Proteins. 2019 Dec;87(12):1361-1377. doi: 10.1002/prot.25767. Epub 2019 Jul 16.
3
Comparative analysis of methods for evaluation of protein models against native structures.
Bioinformatics. 2019 Mar 15;35(6):937-944. doi: 10.1093/bioinformatics/bty760.
5
Assessment of model accuracy estimations in CASP12.
Proteins. 2018 Mar;86 Suppl 1(Suppl 1):345-360. doi: 10.1002/prot.25371. Epub 2017 Sep 8.
6
Protein structure determination using metagenome sequence data.
Science. 2017 Jan 20;355(6322):294-298. doi: 10.1126/science.aah4043.
7
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
8
ProQ3: Improved model quality assessments using Rosetta energy terms.
Sci Rep. 2016 Oct 4;6:33509. doi: 10.1038/srep33509.
10
CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations.
Bioinformatics. 2014 Nov 1;30(21):3128-30. doi: 10.1093/bioinformatics/btu500. Epub 2014 Jul 26.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验