其余 90%的蛋白质：基于 Casp8 模板和高精度模型的 Calphas 评估之外的评估。

The other 90% of the protein: assessment beyond the Calphas for CASP8 template-based and high-accuracy models.

机构信息

Department of Biochemistry, Duke University Medical Center, Durham, North Carolina 27710, USA.

出版信息

Proteins. 2009;77 Suppl 9(Suppl 9):29-49. doi: 10.1002/prot.22551.

DOI:10.1002/prot.22551

PMID:19731372

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2877634/

Abstract

For template-based modeling in the CASP8 Critical Assessment of Techniques for Protein Structure Prediction, this work develops and applies six new full-model metrics. They are designed to complement and add value to the traditional template-based assessment by the global distance test (GDT) and related scores (based on multiple superpositions of Calpha atoms between target structure and predictions labeled "Model 1"). The new metrics evaluate each predictor group on each target, using all atoms of their best model with above-average GDT. Two metrics evaluate how "protein-like" the predicted model is: the MolProbity score used for validating experimental structures, and a mainchain reality score using all-atom steric clashes, bond length and angle outliers, and backbone dihedrals. Four other new metrics evaluate match of model to target for mainchain and sidechain hydrogen bonds, sidechain end positioning, and sidechain rotamers. Group-average Z-score across the six full-model measures is averaged with group-average GDT Z-score to produce the overall ranking for full-model, high-accuracy performance. Separate assessments are reported for specific aspects of predictor-group performance, such as robustness of approximately correct template or fold identification, and self-scoring ability at identifying the best of their models. Fold identification is distinct from but correlated with group-average GDT Z-score if target difficulty is taken into account, whereas self-scoring is done best by servers and is uncorrelated with GDT performance. Outstanding individual models on specific targets are identified and discussed. Predictor groups excelled at different aspects, highlighting the diversity of current methodologies. However, good full-model scores correlate robustly with high Calpha accuracy.

摘要

在 CASP8 蛋白质结构预测技术评估中的基于模板建模中，这项工作开发并应用了六个新的全模型指标。它们旨在通过全局距离测试 (GDT) 和相关分数（基于目标结构和预测标签为“Model 1”的 Calpha 原子的多个超叠）来补充和增加基于模板的评估的价值。新指标在每个目标上评估每个预测器组，使用其最佳模型中具有平均以上 GDT 的所有原子。两个指标评估预测模型的“蛋白质样”程度：用于验证实验结构的 MolProbity 分数，以及使用所有原子的主链现实分数，包括立体冲突、键长和角度异常以及主链二面角。其他四个新指标评估模型与目标的主链和侧链氢键、侧链末端定位和侧链旋转异构体的匹配。六个全模型度量的组平均 Z 分数与组平均 GDT Z 分数平均，以产生全模型、高精度性能的总体排名。还报告了针对预测器组性能的特定方面的单独评估，例如近似正确模板或折叠识别的稳健性，以及自我评分能力以识别其模型中的最佳模型。如果考虑目标难度，折叠识别与组平均 GDT Z 分数不同但相关，而自我评分是由服务器完成的，与 GDT 性能无关。确定并讨论了特定目标上的杰出单个模型。预测器组在不同方面表现出色，突出了当前方法的多样性。然而，良好的全模型得分与高 Calpha 精度高度相关。

相似文献

The other 90% of the protein: assessment beyond the Calphas for CASP8 template-based and high-accuracy models.其余 90%的蛋白质：基于 Casp8 模板和高精度模型的 Calphas 评估之外的评估。

Proteins. 2009;77 Suppl 9(Suppl 9):29-49. doi: 10.1002/prot.22551.

Finding nearly optimal GDT scores.寻找近乎最优的全局距离测试（GDT）分数。

J Comput Biol. 2011 May;18(5):693-704. doi: 10.1089/cmb.2010.0123.

Assessment of template-based protein structure predictions in CASP10.在蛋白质结构预测技术关键评估第10轮（CASP10）中基于模板的蛋白质结构预测评估

Proteins. 2014 Feb;82 Suppl 2(0 2):43-56. doi: 10.1002/prot.24488.

Evaluation of template-based modeling in CASP13.基于模板的建模在 CASP13 中的评估。

Proteins. 2019 Dec;87(12):1113-1127. doi: 10.1002/prot.25800. Epub 2019 Aug 20.

Prediction of global and local quality of CASP8 models by MULTICOM series.MULTICOM 系列预测 CASP8 模型的全局和局部质量。

Proteins. 2009;77 Suppl 9:181-4. doi: 10.1002/prot.22487.

Massive integration of diverse protein quality assessment methods to improve template based modeling in CASP11.在蛋白质结构预测关键评估（CASP11）中大规模整合多种蛋白质质量评估方法以改进基于模板的建模。

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):247-59. doi: 10.1002/prot.24924. Epub 2015 Sep 29.

Structure prediction for CASP8 with all-atom refinement using Rosetta.使用 Rosetta 进行全原子精修的 CASP8 结构预测。

Proteins. 2009;77 Suppl 9(0 9):89-99. doi: 10.1002/prot.22540.

Assessment of template-based modeling of protein structure in CASP11.CASP11中基于模板的蛋白质结构建模评估。

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):200-20. doi: 10.1002/prot.25049. Epub 2016 Jun 15.

I-TASSER: fully automated protein structure prediction in CASP8.I-TASSER：在 CASP8 中全自动的蛋白质结构预测。

Proteins. 2009;77 Suppl 9(Suppl 9):100-13. doi: 10.1002/prot.22588.

Bioinformatics. 2009 May 15;25(10):1259-63. doi: 10.1093/bioinformatics/btp148. Epub 2009 Mar 25.

引用本文的文献

Clustering Protein Binding Pockets and Identifying Potential Drug Interactions: A Novel Ligand-Based Featurization Method.聚类蛋白结合口袋并识别潜在药物相互作用：一种新的基于配体的特征化方法。

J Chem Inf Model. 2023 Nov 13;63(21):6655-6666. doi: 10.1021/acs.jcim.3c00722. Epub 2023 Oct 17.

The transformative power of transformers in protein structure prediction.变压器在蛋白质结构预测中的变革力量。

Proc Natl Acad Sci U S A. 2023 Aug 8;120(32):e2303499120. doi: 10.1073/pnas.2303499120. Epub 2023 Jul 31.

A Computational Pipeline to Identify and Characterize Binding Sites and Interacting Chemotypes in SARS-CoV-2.一种用于识别和表征新型冠状病毒中结合位点及相互作用化学型的计算流程

ACS Omega. 2023 Jun 6;8(24):21871-21884. doi: 10.1021/acsomega.3c01621. eCollection 2023 Jun 20.

PDBspheres: a method for finding 3D similarities in local regions in proteins.PDB球体：一种在蛋白质局部区域寻找三维相似性的方法。

NAR Genom Bioinform. 2022 Oct 10;4(4):lqac078. doi: 10.1093/nargab/lqac078. eCollection 2022 Dec.

Topology evaluation of models for difficult targets in the 14th round of the critical assessment of protein structure prediction (CASP14).第 14 轮蛋白质结构预测关键评估（CASP14）中困难靶标模型的拓扑评估。

Proteins. 2021 Dec;89(12):1673-1686. doi: 10.1002/prot.26172. Epub 2021 Jul 23.

Multi-Scale Flexible Fitting of Proteins to Cryo-EM Density Maps at Medium Resolution.中等分辨率下蛋白质与冷冻电镜密度图的多尺度灵活拟合

Front Mol Biosci. 2021 Mar 19;8:631854. doi: 10.3389/fmolb.2021.631854. eCollection 2021.

DeepTracer for fast de novo cryo-EM protein structure modeling and special studies on CoV-related complexes.DeepTracer 用于快速从头冷冻电镜蛋白质结构建模以及对 CoV 相关复合物的特殊研究。

Proc Natl Acad Sci U S A. 2021 Jan 12;118(2). doi: 10.1073/pnas.2017525118.

A method for validating the accuracy of NMR protein structures.一种验证 NMR 蛋白质结构准确性的方法。

Nat Commun. 2020 Dec 18;11(1):6321. doi: 10.1038/s41467-020-20177-1.

ResiRole: residue-level functional site predictions to gauge the accuracies of protein structure prediction techniques.ResiRole：残基水平功能位点预测，以评估蛋白质结构预测技术的准确性。

Bioinformatics. 2021 Apr 20;37(3):351-359. doi: 10.1093/bioinformatics/btaa712.

Evaluation of template-based modeling in CASP13.基于模板的建模在 CASP13 中的评估。

Proteins. 2019 Dec;87(12):1113-1127. doi: 10.1002/prot.25800. Epub 2019 Aug 20.

本文引用的文献

Assessment of disorder predictions in CASP8.评估 CASP8 中的紊乱预测。

Proteins. 2009;77 Suppl 9:210-6. doi: 10.1002/prot.22586.

Assessment of CASP8 structure predictions for template free targets.无模板靶标中 CASP8 结构预测的评估。

Proteins. 2009;77 Suppl 9:50-65. doi: 10.1002/prot.22591.

Improving physical realism, stereochemistry, and side-chain accuracy in homology modeling: Four approaches that performed well in CASP8.提高同源建模中的物理真实性、立体化学和侧链准确性：在 CASP8 中表现出色的四种方法。

Proteins. 2009;77 Suppl 9(Suppl 9):114-22. doi: 10.1002/prot.22570.

Evaluation of template-based models in CASP8 with standard measures.基于模板的模型在 CASP8 中的评估与标准度量。

Proteins. 2009;77 Suppl 9(0 9):18-28. doi: 10.1002/prot.22561.

Assessment of the protein-structure refinement category in CASP8.评估 CASP8 中的蛋白质结构精修类别。

Proteins. 2009;77 Suppl 9(Suppl 9):66-80. doi: 10.1002/prot.22538.

Target domain definition and classification in CASP8.目标域在 CASP8 中的定义和分类。

Proteins. 2009;77 Suppl 9(Suppl 9):10-7. doi: 10.1002/prot.22497.

KinImmerse: Macromolecular VR for NMR ensembles.KinImmerse：用于核磁共振集合体的大分子虚拟现实

Source Code Biol Med. 2009 Feb 17;4:3. doi: 10.1186/1751-0473-4-3.

Searching protein structure databases with DaliLite v.3.使用DaliLite v.3搜索蛋白质结构数据库。

Bioinformatics. 2008 Dec 1;24(23):2780-1. doi: 10.1093/bioinformatics/btn507. Epub 2008 Sep 25.

Structure of the guide-strand-containing argonaute silencing complex.含引导链的AGO沉默复合体的结构

Nature. 2008 Nov 13;456(7219):209-13. doi: 10.1038/nature07315. Epub 2008 Aug 27.

Progress from CASP6 to CASP7.从第6届蛋白质结构预测关键评估（CASP6）到第7届蛋白质结构预测关键评估（CASP7）的进展。

Proteins. 2007;69 Suppl 8:194-207. doi: 10.1002/prot.21769.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验