QMEANclust：通过结合复合评分函数与结构密度信息来估计蛋白质模型质量。

QMEANclust: estimation of protein model quality by combining a composite scoring function with structural density information.

作者信息

Benkert Pascal, Schwede Torsten, Tosatto Silvio Ce

机构信息

Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056 Basel, Switzerland.

出版信息

BMC Struct Biol. 2009 May 20;9:35. doi: 10.1186/1472-6807-9-35.

DOI:10.1186/1472-6807-9-35

PMID:19457232

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2709111/

Abstract

BACKGROUND

The selection of the most accurate protein model from a set of alternatives is a crucial step in protein structure prediction both in template-based and ab initio approaches. Scoring functions have been developed which can either return a quality estimate for a single model or derive a score from the information contained in the ensemble of models for a given sequence. Local structural features occurring more frequently in the ensemble have a greater probability of being correct. Within the context of the CASP experiment, these so called consensus methods have been shown to perform considerably better in selecting good candidate models, but tend to fail if the best models are far from the dominant structural cluster. In this paper we show that model selection can be improved if both approaches are combined by pre-filtering the models used during the calculation of the structural consensus.

RESULTS

Our recently published QMEAN composite scoring function has been improved by including an all-atom interaction potential term. The preliminary model ranking based on the new QMEAN score is used to select a subset of reliable models against which the structural consensus score is calculated. This scoring function called QMEANclust achieves a correlation coefficient of predicted quality score and GDT_TS of 0.9 averaged over the 98 CASP7 targets and perform significantly better in selecting good models from the ensemble of server models than any other groups participating in the quality estimation category of CASP7. Both scoring functions are also benchmarked on the MOULDER test set consisting of 20 target proteins each with 300 alternatives models generated by MODELLER. QMEAN outperforms all other tested scoring functions operating on individual models, while the consensus method QMEANclust only works properly on decoy sets containing a certain fraction of near-native conformations. We also present a local version of QMEAN for the per-residue estimation of model quality (QMEANlocal) and compare it to a new local consensus-based approach.

CONCLUSION

Improved model selection is obtained by using a composite scoring function operating on single models in order to enrich higher quality models which are subsequently used to calculate the structural consensus. The performance of consensus-based methods such as QMEANclust highly depends on the composition and quality of the model ensemble to be analysed. Therefore, performance estimates for consensus methods based on large meta-datasets (e.g. CASP) might overrate their applicability in more realistic modelling situations with smaller sets of models based on individual methods.

摘要

背景

从一组备选模型中选择最准确的蛋白质模型是基于模板和从头预测方法的蛋白质结构预测中的关键步骤。已经开发出评分函数，其既可以返回单个模型的质量估计值，也可以从给定序列的模型集合中包含的信息得出一个分数。在集合中更频繁出现的局部结构特征更有可能是正确的。在蛋白质结构预测关键评估（CASP）实验的背景下，这些所谓的一致性方法已被证明在选择良好候选模型方面表现得相当出色，但如果最佳模型远离主要结构簇，则往往会失败。在本文中，我们表明，如果通过在计算结构一致性期间对使用的模型进行预过滤来结合这两种方法，可以改进模型选择。

结果

我们最近发表的QMEAN复合评分函数通过纳入全原子相互作用势项得到了改进。基于新的QMEAN分数的初步模型排名用于选择一组可靠的模型，针对这些模型计算结构一致性分数。这个名为QMEANclust的评分函数在98个CASP7目标上平均预测质量分数与全局距离测试总分（GDT_TS）的相关系数达到0.9，并且在从服务器模型集合中选择良好模型方面比参与CASP7质量评估类别的任何其他团队表现都显著更好。这两个评分函数也在由20个目标蛋白质组成的MOULDER测试集上进行了基准测试，每个目标蛋白质有由MODELLER生成的300个备选模型。QMEAN优于所有其他对单个模型进行操作的测试评分函数，而一致性方法QMEANclust仅在包含一定比例近天然构象的诱饵集上能正常工作。我们还提出了一种用于逐个残基估计模型质量的局部版本的QMEAN（QMEANlocal），并将其与一种新的基于局部一致性的方法进行比较。

结论

通过使用对单个模型进行操作的复合评分函数来富集更高质量的模型，随后使用这些模型来计算结构一致性，从而实现了改进的模型选择。像QMEANclust这样基于一致性的方法的性能高度依赖于要分析的模型集合成分和质量。因此，基于大型元数据集（例如CASP）对一致性方法的性能估计可能会高估它们在基于单个方法的较小模型集的更实际建模情况下的适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a28d/2709111/35ae257a8825/1472-6807-9-35-1.jpg

相似文献

QMEANclust: estimation of protein model quality by combining a composite scoring function with structural density information.QMEANclust：通过结合复合评分函数与结构密度信息来估计蛋白质模型质量。

BMC Struct Biol. 2009 May 20;9:35. doi: 10.1186/1472-6807-9-35.

Global and local model quality estimation at CASP8 using the scoring functions QMEAN and QMEANclust.使用评分函数 QMEAN 和 QMEANclust 在 CASP8 中进行全局和局部模型质量评估。

Proteins. 2009;77 Suppl 9:173-80. doi: 10.1002/prot.22532.

QMEAN server for protein model quality estimation.用于蛋白质模型质量评估的QMEAN服务器。

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W510-4. doi: 10.1093/nar/gkp322. Epub 2009 May 8.

SELECTpro: effective protein model selection using a structure-based energy function resistant to BLUNDERs.SELECTpro：使用基于结构的抗BLUNDERs能量函数进行有效的蛋白质模型选择。

BMC Struct Biol. 2008 Dec 3;8:52. doi: 10.1186/1472-6807-8-52.

QMEAN: A comprehensive scoring function for model quality assessment.QMEAN：一种用于模型质量评估的综合评分函数。

Proteins. 2008 Apr;71(1):261-77. doi: 10.1002/prot.21715.

Protein structural model selection by combining consensus and single scoring methods.通过组合共识和单评分方法选择蛋白质结构模型。

PLoS One. 2013 Sep 2;8(9):e74006. doi: 10.1371/journal.pone.0074006. eCollection 2013.

Improved protein structure selection using decoy-dependent discriminatory functions.使用诱饵依赖型判别函数改进蛋白质结构选择

BMC Struct Biol. 2004 Jun 18;4:8. doi: 10.1186/1472-6807-4-8.

Ranking predicted protein structures with support vector regression.使用支持向量回归对预测的蛋白质结构进行排名。

Proteins. 2008 May 15;71(3):1175-82. doi: 10.1002/prot.21809.

Improving a consensus approach for protein structure selection by removing redundancy.通过消除冗余来改进蛋白质结构选择的共识方法。

IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1708-15. doi: 10.1109/TCBB.2011.75.

Improving predicted protein loop structure ranking using a Pareto-optimality consensus method.使用帕累托最优共识方法改进预测的蛋白质环结构排名。

BMC Struct Biol. 2010 Jul 20;10:22. doi: 10.1186/1472-6807-10-22.

引用本文的文献

Characterization of the SARS-CoV-2 coronavirus X4-like accessory protein.严重急性呼吸综合征冠状病毒2（SARS-CoV-2）X4样辅助蛋白的特性分析

Egypt J Med Hum Genet. 2021;22(1):48. doi: 10.1186/s43042-021-00160-1. Epub 2021 May 8.

Isolation, characterization, and cloning of thermostable pullulanase from ADM-11.从ADM-11中分离、鉴定和克隆耐热支链淀粉酶

Saudi J Biol Sci. 2024 Feb;31(2):103901. doi: 10.1016/j.sjbs.2023.103901. Epub 2023 Dec 10.

Illuminating the "Twilight Zone": Advances in Difficult Protein Modeling.阐明“混沌地带”：困难蛋白建模的进展。

Methods Mol Biol. 2023;2627:25-40. doi: 10.1007/978-1-0716-2974-1_2.

In Silico Methods for Identification of Potential Active Sites of Therapeutic Targets.基于计算机的方法鉴定治疗靶标潜在活性部位

Molecules. 2022 Oct 20;27(20):7103. doi: 10.3390/molecules27207103.

Improvement of the catalytic performance of glycerol kinase from Bacillus subtilis by chromosomal site-directed mutagenesis.通过染色体定点突变提高枯草芽孢杆菌甘油激酶的催化性能。

Biotechnol Lett. 2022 Sep;44(9):1051-1061. doi: 10.1007/s10529-022-03281-8. Epub 2022 Aug 3.

ABCA7, a Genetic Risk Factor Associated with Alzheimer's Disease Risk in African Americans.载脂蛋白 A7，与非裔美国人阿尔茨海默病风险相关的遗传风险因素。

J Alzheimers Dis. 2022;86(1):5-19. doi: 10.3233/JAD-215306.

Recombinant laccase rPOXA 1B real-time, accelerated and molecular dynamics stability study.重组漆酶 rPOXA1B 的实时、加速和分子动力学稳定性研究。

BMC Biotechnol. 2021 Jun 4;21(1):37. doi: 10.1186/s12896-021-00698-3.

Efficient rational modification of non-ribosomal peptides by adenylation domain substitution.通过腺苷化结构域替换对非核糖体肽进行高效合理修饰。

Nat Commun. 2020 Sep 11;11(1):4554. doi: 10.1038/s41467-020-18365-0.

Discovery of sulfone-resistant dihydropteroate synthase (DHPS) as a target enzyme for kaempferol, a natural flavanoid.发现抗砜二氢蝶酸合酶（DHPS）是天然类黄酮山奈酚的靶标酶。

Heliyon. 2020 Feb 12;6(2):e03378. doi: 10.1016/j.heliyon.2020.e03378. eCollection 2020 Feb.

The Novel Serine/Threonine Protein Kinase LmjF.22.0810 from may be Involved in the Resistance to Drugs such as Paromomycin.可能与对诸如巴龙霉素等药物的抗性有关的新型丝氨酸/苏氨酸蛋白激酶 LmjF.22.0810 来自。

Biomolecules. 2019 Nov 11;9(11):723. doi: 10.3390/biom9110723.

本文引用的文献

QMEAN server for protein model quality estimation.用于蛋白质模型质量评估的QMEAN服务器。

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W510-4. doi: 10.1093/nar/gkp322. Epub 2009 May 8.

How well can the accuracy of comparative protein structure models be predicted?比较蛋白质结构模型的准确性能被预测到什么程度？

Protein Sci. 2008 Nov;17(11):1881-93. doi: 10.1110/ps.036061.108. Epub 2008 Oct 1.

Threading without optimizing weighting factors for scoring function.在不优化评分函数加权因子的情况下进行线程处理。

Proteins. 2008 Nov 15;73(3):581-96. doi: 10.1002/prot.22082.

A multi-template combination algorithm for protein comparative modeling.一种用于蛋白质比较建模的多模板组合算法。

BMC Struct Biol. 2008 Mar 17;8:18. doi: 10.1186/1472-6807-8-18.

Estimating quality of template-based protein models by alignment stability.通过比对稳定性评估基于模板的蛋白质模型的质量。

Proteins. 2008 May 15;71(3):1255-74. doi: 10.1002/prot.21819.

Protein model quality assessment prediction by combining fragment comparisons and a consensus C(alpha) contact potential.通过结合片段比较和一致性Cα接触势进行蛋白质模型质量评估预测

Proteins. 2008 May 15;71(3):1211-8. doi: 10.1002/prot.21813.

Ranking predicted protein structures with support vector regression.使用支持向量回归对预测的蛋白质结构进行排名。

Proteins. 2008 May 15;71(3):1175-82. doi: 10.1002/prot.21809.

High-resolution structure prediction and the crystallographic phase problem.高分辨率结构预测与晶体学相位问题。

Nature. 2007 Nov 8;450(7167):259-64. doi: 10.1038/nature06249. Epub 2007 Oct 14.

QMEAN: A comprehensive scoring function for model quality assessment.QMEAN：一种用于模型质量评估的综合评分函数。

Proteins. 2008 Apr;71(1):261-77. doi: 10.1002/prot.21715.

Critical assessment of methods of protein structure prediction-Round VII.蛋白质结构预测方法的批判性评估——第七轮。

Proteins. 2007;69 Suppl 8(S8):3-9. doi: 10.1002/prot.21767.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

QMEANclust：通过结合复合评分函数与结构密度信息来估计蛋白质模型质量。

QMEANclust: estimation of protein model quality by combining a composite scoring function with structural density information.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献