评估 CASP13 中蛋白质模型结构准确性估计：深度学习时代的挑战。

Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning.

机构信息

Department of Chemistry, Seoul National University, Seoul, Republic of Korea.

Genome Center, University of California, Davis, California.

出版信息

Proteins. 2019 Dec;87(12):1351-1360. doi: 10.1002/prot.25804. Epub 2019 Aug 30.

DOI:10.1002/prot.25804

PMID:31436360

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6851486/

Abstract

Scoring model structure is an essential component of protein structure prediction that can affect the prediction accuracy tremendously. Users of protein structure prediction results also need to score models to select the best models for their application studies. In Critical Assessment of techniques for protein Structure Prediction (CASP), model accuracy estimation methods have been tested in a blind fashion by providing models submitted by the tertiary structure prediction servers for scoring. In CASP13, model accuracy estimation results were evaluated in terms of both global and local structure accuracy. Global structure accuracy estimation was evaluated by the quality of the models selected by the global structure scores and by the absolute estimates of the global scores. Residue-wise, local structure accuracy estimations were evaluated by three different measures. A new measure introduced in CASP13 evaluates the ability to predict inaccurately modeled regions that may be improved by refinement. An intensive comparative analysis on CASP13 and the previous CASPs revealed that the tertiary structure models generated by the CASP13 servers show very distinct features. Higher consensus toward models of higher global accuracy appeared even for free modeling targets, and many models of high global accuracy were not well optimized at the atomic level. This is related to the new technology in CASP13, deep learning for tertiary contact prediction. The tertiary model structures generated by deep learning pose a new challenge for EMA (estimation of model accuracy) method developers. Model accuracy estimation itself is also an area where deep learning can potentially have an impact, although current EMA methods have not fully explored that direction.

摘要

评分模型结构是蛋白质结构预测的重要组成部分，它可以极大地影响预测的准确性。蛋白质结构预测结果的使用者也需要对模型进行评分，以选择最适合其应用研究的模型。在蛋白质结构预测技术的关键评估 (Critical Assessment of techniques for protein Structure Prediction, CASP) 中，通过提供由三级结构预测服务器提交的模型进行评分，以盲法测试模型准确性估计方法。在 CASP13 中，从全局和局部结构准确性两个方面评估模型准确性估计结果。全局结构准确性估计通过全局结构得分选择的模型的质量和全局得分的绝对估计来评估。在残基水平上，通过三种不同的方法评估局部结构准确性估计。CASP13 中引入的一种新方法评估了预测不准确建模区域的能力，这些区域可以通过细化来改进。对 CASP13 和之前的 CASP 的深入比较分析表明，CASP13 服务器生成的三级结构模型具有非常明显的特征。即使对于免费建模目标，更高的全局准确性模型的共识度也更高，许多全局准确性较高的模型在原子水平上并没有得到很好的优化。这与 CASP13 中的新技术，即三级接触预测的深度学习有关。深度学习生成的三级模型结构给 EMA（模型准确性估计）方法开发者带来了新的挑战。模型准确性估计本身也是深度学习可能产生影响的一个领域，尽管目前的 EMA 方法尚未充分探索这一方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a15c/6851486/2d061b112564/nihms-1537912-f0001.jpg

相似文献

Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning.评估 CASP13 中蛋白质模型结构准确性估计：深度学习时代的挑战。

Proteins. 2019 Dec;87(12):1351-1360. doi: 10.1002/prot.25804. Epub 2019 Aug 30.

Estimation of model accuracy in CASP13.CASP13 模型精度估计。

Proteins. 2019 Dec;87(12):1361-1377. doi: 10.1002/prot.25767. Epub 2019 Jul 16.

Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。

Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.

A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments.在 CASP13 中，三级结构预测的进一步改进促使未来评估有了新的途径。

Proteins. 2019 Dec;87(12):1100-1112. doi: 10.1002/prot.25787. Epub 2019 Aug 7.

Analysis of distance-based protein structure prediction by deep learning in CASP13.基于深度学习的 CASP13 蛋白质结构预测距离分析。

Proteins. 2019 Dec;87(12):1069-1081. doi: 10.1002/prot.25810. Epub 2019 Sep 13.

Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。

Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.

Small angle X-ray scattering-assisted protein structure prediction in CASP13 and emergence of solution structure differences.小角 X 射线散射辅助的蛋白质结构预测在 CASP13 中的应用和溶液结构差异的出现。

Proteins. 2019 Dec;87(12):1298-1314. doi: 10.1002/prot.25827. Epub 2019 Oct 16.

Assessment of protein model structure accuracy estimation in CASP14: Old and new challenges.评估 CASP14 中蛋白质模型结构准确性估计：新老挑战。

Proteins. 2021 Dec;89(12):1940-1948. doi: 10.1002/prot.26192. Epub 2021 Aug 5.

Critical assessment of methods of protein structure prediction (CASP)-Round XIII.蛋白质结构预测方法的关键评估（CASP）-第十三轮。

Proteins. 2019 Dec;87(12):1011-1020. doi: 10.1002/prot.25823. Epub 2019 Oct 23.

Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14.基于深度学习和残差距离预测的蛋白质模型准确性估计在 CASP14 中的应用。

Sci Rep. 2021 May 25;11(1):10943. doi: 10.1038/s41598-021-90303-6.

引用本文的文献

Recent advances and challenges in protein complex model accuracy estimation.蛋白质复合物模型准确性评估的最新进展与挑战

Comput Struct Biotechnol J. 2024 Apr 21;23:1824-1832. doi: 10.1016/j.csbj.2024.04.049. eCollection 2024 Dec.

Assessing protein model quality based on deep graph coupled networks using protein language model.基于蛋白质语言模型的深度图耦合网络评估蛋白质模型质量。

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad420.

Challenges in bridging the gap between protein structure prediction and functional interpretation.弥合蛋白质结构预测与功能解释之间差距所面临的挑战。

Proteins. 2025 Jan;93(1):400-410. doi: 10.1002/prot.26614. Epub 2023 Oct 18.

iQDeep: an integrated web server for protein scoring using multiscale deep learning models.iQDeep：一个使用多尺度深度学习模型的蛋白质评分的集成网络服务器。

J Mol Biol. 2023 Jul 15;435(14):168057. doi: 10.1016/j.jmb.2023.168057. Epub 2023 Mar 23.

New prediction categories in CASP15.CASP15 中的新预测类别。

Proteins. 2023 Dec;91(12):1550-1557. doi: 10.1002/prot.26515. Epub 2023 Jun 12.

Protein Model Quality Estimation Using Molecular Dynamics Simulation.使用分子动力学模拟进行蛋白质模型质量评估。

ACS Omega. 2022 Jul 5;7(28):24274-24281. doi: 10.1021/acsomega.2c01475. eCollection 2022 Jul 19.

Predicting residue-specific qualities of individual protein models using residual neural networks and graph neural networks.使用残差神经网络和图神经网络预测个体蛋白质模型的残基特异性性质。

Proteins. 2022 Dec;90(12):2091-2102. doi: 10.1002/prot.26400. Epub 2022 Jul 30.

Fast and effective protein model refinement using deep graph neural networks.使用深度图神经网络进行快速有效的蛋白质模型优化。

Nat Comput Sci. 2021 Jul;1(7):462-469. doi: 10.1038/s43588-021-00098-9. Epub 2021 Jul 15.

Ins and outs of AlphaFold2 transmembrane protein structure predictions.AlphaFold2 跨膜蛋白结构预测的来龙去脉。

Cell Mol Life Sci. 2022 Jan 15;79(1):73. doi: 10.1007/s00018-021-04112-1.

Critical assessment of methods of protein structure prediction (CASP)-Round XIV.蛋白质结构预测方法的关键性评估（CASP）-第十四轮。

Proteins. 2021 Dec;89(12):1607-1617. doi: 10.1002/prot.26237. Epub 2021 Oct 7.

本文引用的文献

CASP13 target classification into tertiary structure prediction categories.CASP13 目标分类到三级结构预测类别。

Proteins. 2019 Dec;87(12):1021-1036. doi: 10.1002/prot.25775. Epub 2019 Jul 24.

Estimation of model accuracy in CASP13.CASP13 模型精度估计。

Proteins. 2019 Dec;87(12):1361-1377. doi: 10.1002/prot.25767. Epub 2019 Jul 16.

Comparative analysis of methods for evaluation of protein models against native structures.评估蛋白质模型与天然结构一致性的方法比较分析。

Bioinformatics. 2019 Mar 15;35(6):937-944. doi: 10.1093/bioinformatics/bty760.

An analysis and evaluation of the WeFold collaborative for protein structure prediction and its pipelines in CASP11 and CASP12.对 WeFold 协作进行蛋白质结构预测及其在 CASP11 和 CASP12 中的管道的分析和评估。

Sci Rep. 2018 Jul 2;8(1):9939. doi: 10.1038/s41598-018-26812-8.

Assessment of model accuracy estimations in CASP12.在蛋白质结构预测技术关键评估（CASP）12中对模型准确性估计的评估。

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):345-360. doi: 10.1002/prot.25371. Epub 2017 Sep 8.

Protein structure determination using metagenome sequence data.利用宏基因组序列数据进行蛋白质结构测定。

Science. 2017 Jan 20;355(6322):294-298. doi: 10.1126/science.aah4043.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

ProQ3: Improved model quality assessments using Rosetta energy terms.ProQ3：使用 Rosetta 能量项改进模型质量评估。

Sci Rep. 2016 Oct 4;6:33509. doi: 10.1038/srep33509.

Methods of model accuracy estimation can help selecting the best models from decoy sets: Assessment of model accuracy estimations in CASP11.模型准确性估计方法有助于从诱饵集中选择最佳模型：CASP11中模型准确性估计的评估。

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):349-69. doi: 10.1002/prot.24919. Epub 2015 Sep 28.

CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations.CCMpred--快速准确地预测蛋白质残基-残基接触的相关突变。

Bioinformatics. 2014 Nov 1;30(21):3128-30. doi: 10.1093/bioinformatics/btu500. Epub 2014 Jul 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评估 CASP13 中蛋白质模型结构准确性估计：深度学习时代的挑战。

Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献