评估的评估：对CASP10中模型质量评估的评价

Assessment of the assessment: evaluation of the model quality estimates in CASP10.

作者信息

Kryshtafovych Andriy, Barbato Alessandro, Fidelis Krzysztof, Monastyrskyy Bohdan, Schwede Torsten, Tramontano Anna

机构信息

Genome Center, University of California, Davis, 95616 California, USA.

出版信息

Proteins. 2014 Feb;82 Suppl 2(0 2):112-26. doi: 10.1002/prot.24347. Epub 2013 Aug 31.

DOI:10.1002/prot.24347

PMID:23780644

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4406045/

Abstract

The article presents an assessment of the ability of the thirty-seven model quality assessment (MQA) methods participating in CASP10 to provide an a priori estimation of the quality of structural models, and of the 67 tertiary structure prediction groups to provide confidence estimates for their predicted coordinates. The assessment of MQA predictors is based on the methods used in previous CASPs, such as correlation between the predicted and observed quality of the models (both at the global and local levels), accuracy of methods in distinguishing between good and bad models as well as good and bad regions within them, and ability to identify the best models in the decoy sets. Several numerical evaluations were used in our analysis for the first time, such as comparison of global and local quality predictors with reference (baseline) predictors and a ROC analysis of the predictors' ability to differentiate between the well and poorly modeled regions. For the evaluation of the reliability of self-assessment of the coordinate errors, we used the correlation between the predicted and observed deviations of the coordinates and a ROC analysis of correctly identified errors in the models. A modified two-stage procedure for testing MQA methods in CASP10 whereby a small number of models spanning the whole range of model accuracy was released first followed by the release of a larger number of models of more uniform quality, allowed a more thorough analysis of abilities and inabilities of different types of methods. Clustering methods were shown to have an advantage over the single- and quasi-single- model methods on the larger datasets. At the same time, the evaluation revealed that the size of the dataset has smaller influence on the global quality assessment scores (for both clustering and nonclustering methods), than its diversity. Narrowing the quality range of the assessed models caused significant decrease in accuracy of ranking for global quality predictors but essentially did not change the results for local predictors. Self-assessment error estimates submitted by the majority of groups were poor overall, with two research groups showing significantly better results than the remaining ones.

摘要

本文对参与蛋白质结构预测关键评估第10轮（CASP10）的37种模型质量评估（MQA）方法给出结构模型质量先验估计的能力，以及67个三级结构预测团队对其预测坐标给出可信度估计的能力进行了评估。对MQA预测方法的评估基于以往CASP中使用的方法，如模型预测质量与观测质量之间的相关性（包括全局和局部层面）、区分好坏模型以及模型内部好坏区域的方法准确性，以及在诱饵集中识别最佳模型的能力。我们的分析首次使用了几种数值评估方法，如将全局和局部质量预测器与参考（基线）预测器进行比较，以及对预测器区分建模良好和较差区域能力的ROC分析。为评估坐标误差自我评估的可靠性，我们使用了坐标预测偏差与观测偏差之间的相关性以及对模型中正确识别误差的ROC分析。在CASP10中测试MQA方法的一种改进的两阶段程序，即先发布少量涵盖整个模型准确性范围的模型，随后发布大量质量更均匀的模型，使得能够更全面地分析不同类型方法的能力和不足。在更大的数据集上，聚类方法显示出优于单模型和准单模型方法的优势。同时，评估表明，数据集的大小对全局质量评估分数（对于聚类和非聚类方法）的影响小于其多样性。缩小评估模型的质量范围会导致全局质量预测器排名准确性显著下降，但基本不会改变局部预测器的结果。大多数团队提交的自我评估误差估计总体较差，有两个研究团队的结果明显优于其他团队。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc0/4406045/579f3277cb8c/nihms678601f1.jpg

相似文献

Assessment of the assessment: evaluation of the model quality estimates in CASP10.评估的评估：对CASP10中模型质量评估的评价

Proteins. 2014 Feb;82 Suppl 2(0 2):112-26. doi: 10.1002/prot.24347. Epub 2013 Aug 31.

Methods of model accuracy estimation can help selecting the best models from decoy sets: Assessment of model accuracy estimations in CASP11.模型准确性估计方法有助于从诱饵集中选择最佳模型：CASP11中模型准确性估计的评估。

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):349-69. doi: 10.1002/prot.24919. Epub 2015 Sep 28.

Assessment of model accuracy estimations in CASP12.在蛋白质结构预测技术关键评估（CASP）12中对模型准确性估计的评估。

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):345-360. doi: 10.1002/prot.25371. Epub 2017 Sep 8.

Assessment of protein disorder region predictions in CASP10.CASP10中蛋白质无序区域预测的评估

Proteins. 2014 Feb;82 Suppl 2(0 2):127-37. doi: 10.1002/prot.24391. Epub 2013 Nov 22.

Evaluation of model quality predictions in CASP9.CASP9 模型质量预测评估。

Proteins. 2011;79 Suppl 10(Suppl 10):91-106. doi: 10.1002/prot.23180. Epub 2011 Oct 14.

Evaluation of residue-residue contact prediction in CASP10.蛋白质结构预测关键评估第10轮（CASP10）中残基-残基接触预测的评估

Proteins. 2014 Feb;82 Suppl 2(0 2):138-53. doi: 10.1002/prot.24340. Epub 2013 Aug 31.

Evaluation of CASP8 model quality predictions.CASP8 模型质量预测评估。

Proteins. 2009;77 Suppl 9:157-66. doi: 10.1002/prot.22534.

Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning.评估 CASP13 中蛋白质模型结构准确性估计：深度学习时代的挑战。

Proteins. 2019 Dec;87(12):1351-1360. doi: 10.1002/prot.25804. Epub 2019 Aug 30.

CASP prediction center infrastructure and evaluation measures in CASP10 and CASP ROLL.CASP10和CASP ROLL中的CASP预测中心基础设施及评估措施。

Proteins. 2014 Feb;82 Suppl 2(0 2):7-13. doi: 10.1002/prot.24399. Epub 2013 Oct 18.

Designing and evaluating the MULTICOM protein local and global model quality prediction methods in the CASP10 experiment.在蛋白质结构预测关键评估第10轮（CASP10）实验中设计并评估多组件蛋白质局部和全局模型质量预测方法。

BMC Struct Biol. 2014 Apr 15;14:13. doi: 10.1186/1472-6807-14-13.

引用本文的文献

Recent advances and challenges in protein complex model accuracy estimation.蛋白质复合物模型准确性评估的最新进展与挑战

Comput Struct Biotechnol J. 2024 Apr 21;23:1824-1832. doi: 10.1016/j.csbj.2024.04.049. eCollection 2024 Dec.

PSSP-MFFNet: A Multifeature Fusion Network for Protein Secondary Structure Prediction.PSSP-MFFNet：一种用于蛋白质二级结构预测的多特征融合网络。

ACS Omega. 2024 Jan 25;9(5):5985-5994. doi: 10.1021/acsomega.3c10230. eCollection 2024 Feb 6.

Challenges in bridging the gap between protein structure prediction and functional interpretation.弥合蛋白质结构预测与功能解释之间差距所面临的挑战。

Proteins. 2025 Jan;93(1):400-410. doi: 10.1002/prot.26614. Epub 2023 Oct 18.

iQDeep: an integrated web server for protein scoring using multiscale deep learning models.iQDeep：一个使用多尺度深度学习模型的蛋白质评分的集成网络服务器。

J Mol Biol. 2023 Jul 15;435(14):168057. doi: 10.1016/j.jmb.2023.168057. Epub 2023 Mar 23.

New prediction categories in CASP15.CASP15 中的新预测类别。

Proteins. 2023 Dec;91(12):1550-1557. doi: 10.1002/prot.26515. Epub 2023 Jun 12.

Estimation of model accuracy by a unique set of features and tree-based regressor.通过一组独特的特征和基于树的回归器来估计模型的准确性。

Sci Rep. 2022 Aug 18;12(1):14074. doi: 10.1038/s41598-022-17097-z.

Prediction of protein secondary structure based on an improved channel attention and multiscale convolution module.基于改进的通道注意力和多尺度卷积模块的蛋白质二级结构预测

Front Bioeng Biotechnol. 2022 Jul 22;10:901018. doi: 10.3389/fbioe.2022.901018. eCollection 2022.

Modeling SARS-CoV-2 proteins in the CASP-commons experiment.在 CASP-commons 实验中模拟 SARS-CoV-2 蛋白。

Proteins. 2021 Dec;89(12):1987-1996. doi: 10.1002/prot.26231. Epub 2021 Oct 5.

Assessment of protein model structure accuracy estimation in CASP14: Old and new challenges.评估 CASP14 中蛋白质模型结构准确性估计：新老挑战。

Proteins. 2021 Dec;89(12):1940-1948. doi: 10.1002/prot.26192. Epub 2021 Aug 5.

Decoy selection for protein structure prediction via extreme gradient boosting and ranking.通过极端梯度提升和排序选择蛋白质结构预测的诱饵。

BMC Bioinformatics. 2020 Dec 9;21(Suppl 1):189. doi: 10.1186/s12859-020-3523-9.

本文引用的文献

Definition and classification of evaluation units for CASP10.CASP10评估单元的定义与分类。

Proteins. 2014 Feb;82 Suppl 2(0 2):14-25. doi: 10.1002/prot.24434. Epub 2013 Nov 22.

Improved model quality assessment using ProQ2.使用 ProQ2 提高模型质量评估。

BMC Bioinformatics. 2012 Sep 10;13:224. doi: 10.1186/1471-2105-13-224.

GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction.GOAP：一种广义的、基于取向的、全原子蛋白质结构预测统计势能。

Biophys J. 2011 Oct 19;101(8):2043-52. doi: 10.1016/j.bpj.2011.09.012.

Assessment of template based protein structure predictions in CASP9.评估基于模板的蛋白质结构预测在 CASP9 中的表现。

Proteins. 2011;79 Suppl 10:37-58. doi: 10.1002/prot.23177. Epub 2011 Oct 15.

Critical assessment of methods of protein structure prediction (CASP)--round IX.蛋白质结构预测方法的关键评估（CASP）——第九轮。

Proteins. 2011;79 Suppl 10(0 10):1-5. doi: 10.1002/prot.23200. Epub 2011 Oct 14.

MUFOLD-WQA: A new selective consensus method for quality assessment in protein structure prediction.MUFOLD-WQA：一种新的蛋白质结构预测中用于质量评估的选择性共识方法。

Proteins. 2011;79 Suppl 10(Suppl 10):185-95. doi: 10.1002/prot.23185. Epub 2011 Oct 14.

CASP9 results compared to those of previous CASP experiments.与之前的 CASP 实验相比，CASP9 的结果。

Proteins. 2011;79 Suppl 10(0 10):196-207. doi: 10.1002/prot.23182. Epub 2011 Oct 14.

Evaluation of model quality predictions in CASP9.CASP9 模型质量预测评估。

Proteins. 2011;79 Suppl 10(Suppl 10):91-106. doi: 10.1002/prot.23180. Epub 2011 Oct 14.

The IntFOLD server: an integrated web resource for protein fold recognition, 3D model quality assessment, intrinsic disorder prediction, domain prediction and ligand binding site prediction.IntFOLD 服务器：一个集成的蛋白质折叠识别、3D 模型质量评估、固有无序预测、结构域预测和配体结合位点预测的网络资源。

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W171-6. doi: 10.1093/nar/gkr184. Epub 2011 Mar 31.

Toward the estimation of the absolute quality of individual protein structure models.朝着估计个体蛋白质结构模型的绝对质量的方向努力。

Bioinformatics. 2011 Feb 1;27(3):343-50. doi: 10.1093/bioinformatics/btq662. Epub 2010 Dec 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评估的评估：对CASP10中模型质量评估的评价

Assessment of the assessment: evaluation of the model quality estimates in CASP10.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献