能否识别出正确的蛋白质模型？

Can correct protein models be identified?

作者信息

Wallner Björn, Elofsson Arne

机构信息

Stockholm Bioinformatics Center, SCFAB, Stockholm University, SE-106 91 Stockholm, Sweden.

出版信息

Protein Sci. 2003 May;12(5):1073-86. doi: 10.1110/ps.0236803.

DOI:10.1110/ps.0236803

PMID:12717029

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2323877/

Abstract

The ability to separate correct models of protein structures from less correct models is of the greatest importance for protein structure prediction methods. Several studies have examined the ability of different types of energy function to detect the native, or native-like, protein structure from a large set of decoys. In contrast to earlier studies, we examine here the ability to detect models that only show limited structural similarity to the native structure. These correct models are defined by the existence of a fragment that shows significant similarity between this model and the native structure. It has been shown that the existence of such fragments is useful for comparing the performance between different fold recognition methods and that this performance correlates well with performance in fold recognition. We have developed ProQ, a neural-network-based method to predict the quality of a protein model that extracts structural features, such as frequency of atom-atom contacts, and predicts the quality of a model, as measured either by LGscore or MaxSub. We show that ProQ performs at least as well as other measures when identifying the native structure and is better at the detection of correct models. This performance is maintained over several different test sets. ProQ can also be combined with the Pcons fold recognition predictor (Pmodeller) to increase its performance, with the main advantage being the elimination of a few high-scoring incorrect models. Pmodeller was successful in CASP5 and results from the latest LiveBench, LiveBench-6, indicating that Pmodeller has a higher specificity than Pcons alone.

摘要

对于蛋白质结构预测方法而言，将正确的蛋白质结构模型与不太正确的模型区分开来的能力至关重要。已有多项研究考察了不同类型能量函数从大量诱饵结构中检测天然或类天然蛋白质结构的能力。与早期研究不同，我们在此考察检测那些仅与天然结构呈现有限结构相似性的模型的能力。这些正确模型由一个片段的存在来定义，该片段在此模型与天然结构之间呈现出显著相似性。研究表明，此类片段的存在对于比较不同折叠识别方法之间的性能很有用，并且这种性能与折叠识别中的性能密切相关。我们开发了ProQ，这是一种基于神经网络的方法，用于预测蛋白质模型的质量，它提取诸如原子 - 原子接触频率等结构特征，并根据LGscore或MaxSub来预测模型的质量。我们表明，在识别天然结构时，ProQ的表现至少与其他方法一样好，并且在检测正确模型方面更出色。这种性能在几个不同的测试集上都得以保持。ProQ还可以与Pcons折叠识别预测器（Pmodeller）相结合以提高其性能，主要优势在于消除了一些高分的错误模型。Pmodeller在CASP5以及最新的LiveBench（LiveBench - 6）中取得了成功，这表明Pmodeller比单独的Pcons具有更高的特异性。

相似文献

Can correct protein models be identified?

Protein Sci. 2003 May;12(5):1073-86. doi: 10.1110/ps.0236803.

Automatic consensus-based fold recognition using Pcons, ProQ, and Pmodeller.

Proteins. 2003;53 Suppl 6:534-41. doi: 10.1002/prot.10536.

Combining evolutionary and structural information for local protein structure prediction.

Proteins. 2004 Sep 1;56(4):782-94. doi: 10.1002/prot.20158.

Assessment of global and local model quality in CASP8 using Pcons and ProQ.

Proteins. 2009;77 Suppl 9:167-72. doi: 10.1002/prot.22476.

Protein secondary structure prediction with dihedral angles.

Proteins. 2005 May 15;59(3):476-81. doi: 10.1002/prot.20435.

Prediction of global and local model quality in CASP7 using Pcons and ProQ.

Proteins. 2007;69 Suppl 8:184-93. doi: 10.1002/prot.21774.

Pcons: a neural-network-based consensus predictor that improves fold recognition.

Protein Sci. 2001 Nov;10(11):2354-62. doi: 10.1110/ps.08501.

Pcons5: combining consensus, structural evaluation and fold recognition scores.

Bioinformatics. 2005 Dec 1;21(23):4248-54. doi: 10.1093/bioinformatics/bti702. Epub 2005 Oct 4.

Network properties of decoys and CASP predicted models: a comparison with native protein structures.

Mol Biosyst. 2013 Jul;9(7):1774-88. doi: 10.1039/c3mb70157c. Epub 2013 May 22.

Protein structure evaluation using an all-atom energy based empirical scoring function.

J Biomol Struct Dyn. 2006 Feb;23(4):385-406. doi: 10.1080/07391102.2006.10531234.

引用本文的文献

Estimating protein complex model accuracy using graph transformers and pairwise similarity graphs.

Bioinform Adv. 2025 Jul 29;5(1):vbaf180. doi: 10.1093/bioadv/vbaf180. eCollection 2025.

Identification of novel drug targets and small molecule discovery for MRSA infections.

Front Bioinform. 2025 Apr 15;5:1562596. doi: 10.3389/fbinf.2025.1562596. eCollection 2025.

The Prominent Role of Serines 302/307 in the Activity and Stability of Human Caspase9: Appraisal of the S302D and S307D Variants.

Biochem Genet. 2025 Mar 9. doi: 10.1007/s10528-025-11076-5.

Estimating Protein Complex Model Accuracy Using Graph Transformers and Pairwise Similarity Graphs.

bioRxiv. 2025 Feb 23:2025.02.04.636562. doi: 10.1101/2025.02.04.636562.

Exploring the impact of the stargazin V143L mutation on the dynamics of the AMPA receptor: stargazin complex.

Front Cell Neurosci. 2025 Jan 17;18:1505846. doi: 10.3389/fncel.2024.1505846. eCollection 2024.

Repurposing FDA-approved drugs targeting FZD10 in nasopharyngeal carcinoma: insights from molecular dynamics simulations and experimental validation.

Sci Rep. 2024 Dec 28;14(1):31461. doi: 10.1038/s41598-024-82967-7.

Integrated virtual screening and MD simulation study to discover potential inhibitors of mycobacterial electron transfer flavoprotein oxidoreductase.

PLoS One. 2024 Nov 15;19(11):e0312860. doi: 10.1371/journal.pone.0312860. eCollection 2024.

Targeting Polyprotein to Design Potential Multiepitope Vaccine against (OHFV) by Evaluating Allergenicity, Antigenicity, and Toxicity Using Immunoinformatic Approaches.

Biology (Basel). 2024 Sep 20;13(9):738. doi: 10.3390/biology13090738.

Exploring pathogenic SNPs and estrogen receptor alpha interactions in breast cancer: An approach.

Heliyon. 2024 Aug 31;10(17):e37297. doi: 10.1016/j.heliyon.2024.e37297. eCollection 2024 Sep 15.

Insights into structural vaccinology harnessed for universal coronavirus vaccine development.

Clin Exp Vaccine Res. 2024 Jul;13(3):202-217. doi: 10.7774/cevr.2024.13.3.202. Epub 2024 Jul 31.

本文引用的文献

Information-theoretic dissection of pairwise contact potentials.

Proteins. 2002 Oct 1;49(1):7-14. doi: 10.1002/prot.10198.

Design of an optimal Chebyshev-expanded discrimination function for globular proteins.

Protein Sci. 2002 Aug;11(8):2010-21. doi: 10.1110/ps.0200702.

Distinguishing native conformations of proteins from decoys with an effective free energy estimator based on the OPLS all-atom force field and the Surface Generalized Born solvent model.

Proteins. 2002 Aug 1;48(2):404-22. doi: 10.1002/prot.10171.

In search for more accurate alignments in the twilight zone.

Protein Sci. 2002 Jul;11(7):1702-13. doi: 10.1110/ps.4820102.

Increasing the precision of comparative models with YASARA NOVA--a self-parameterizing force field.

Proteins. 2002 May 15;47(3):393-402. doi: 10.1002/prot.10104.

Identifying native-like protein structures using physics-based potentials.

J Comput Chem. 2002 Jan 15;23(1):147-60. doi: 10.1002/jcc.10018.

LiveBench-2: large-scale automated evaluation of protein structure prediction servers.

Proteins. 2001;Suppl 5:184-91. doi: 10.1002/prot.10039.

Assessment of the CASP4 fold recognition category.

Proteins. 2001;Suppl 5:55-67. doi: 10.1002/prot.10006.

Critical assessment of methods of protein structure prediction (CASP): round IV.

Proteins. 2001;Suppl 5:2-7.

Statistical potentials for fold assessment.

Protein Sci. 2002 Feb;11(2):430-48. doi: 10.1002/pro.110430.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

能否识别出正确的蛋白质模型？

Can correct protein models be identified?

作者信息

Wallner Björn, Elofsson Arne

机构信息

Stockholm Bioinformatics Center, SCFAB, Stockholm University, SE-106 91 Stockholm, Sweden.

出版信息

Protein Sci. 2003 May;12(5):1073-86. doi: 10.1110/ps.0236803.

DOI:10.1110/ps.0236803

PMID:12717029

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2323877/

Abstract

摘要

能否识别出正确的蛋白质模型？

Can correct protein models be identified?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

能否识别出正确的蛋白质模型？

Can correct protein models be identified?

作者信息

机构信息

出版信息