比较共进化方法及其在无模板蛋白质结构预测中的应用。

Comparing co-evolution methods and their application to template-free protein structure prediction.

机构信息

Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.

Department of Informatics, UCB Pharma, Slough SL1 3WE, UK

出版信息

Bioinformatics. 2017 Feb 1;33(3):373-381. doi: 10.1093/bioinformatics/btw618.

DOI:10.1093/bioinformatics/btw618

PMID:28171606

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5860252/

Abstract

MOTIVATION

Co-evolution methods have been used as contact predictors to identify pairs of residues that share spatial proximity. Such contact predictors have been compared in terms of the precision of their predictions, but there is no study that compares their usefulness to model generation.

RESULTS

We compared eight different co-evolution methods for a set of ∼3500 proteins and found that metaPSICOV stage 2 produces, on average, the most precise predictions. Precision of all the methods is dependent on SCOP class, with most methods predicting contacts in all α and membrane proteins poorly. The contact predictions were then used to assist in de novo model generation. We found that it was not the method with the highest average precision, but rather metaPSICOV stage 1 predictions that consistently led to the best models being produced. Our modelling results show a correlation between the proportion of predicted long range contacts that are satisfied on a model and its quality. We used this proportion to effectively classify models as correct/incorrect; discarding decoys classified as incorrect led to an enrichment in the proportion of good decoys in our final ensemble by a factor of seven. For 17 out of the 18 cases where correct answers were generated, the best models were not discarded by this approach. We were also able to identify eight cases where no correct decoy had been generated.

AVAILABILITY AND IMPLEMENTATION

Data is available for download from: http://opig.stats.ox.ac.uk/resources.

CONTACT

saulo.deoliveira@dtc.ox.ac.uk

SUPPLIMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

共进化方法已被用作接触预测因子，以识别具有空间接近性的残基对。已经根据预测精度比较了这些接触预测因子，但没有研究比较它们在模型生成方面的有用性。

结果

我们比较了八种不同的共进化方法对一组约 3500 种蛋白质，发现 metaPSICOV 阶段 2 平均产生最精确的预测。所有方法的精度都依赖于 SCOP 类，其中大多数方法都无法很好地预测所有α和膜蛋白中的接触。然后将接触预测用于辅助从头模型生成。我们发现，并不是平均精度最高的方法，而是 metaPSICOV 阶段 1 的预测，始终导致生成最佳模型。我们的建模结果表明，预测的长程接触的比例与模型的质量之间存在相关性。我们使用这个比例有效地将模型分为正确/不正确；通过将不正确的诱饵分类为不正确，可以将我们最终集合中的良好诱饵的比例提高 7 倍。在产生正确答案的 17 个案例中，有 17 个案例都没有通过这种方法丢弃最佳模型。我们还能够确定 8 个案例中没有生成正确的诱饵。

可用性和实现

数据可从以下网址下载：http://opig.stats.ox.ac.uk/resources。

联系方式

saulo.deoliveira@dtc.ox.ac.uk

补充信息

补充数据可在 Bioinformatics 在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07f1/5860252/586736c1e723/btw618f1.jpg

相似文献

Comparing co-evolution methods and their application to template-free protein structure prediction.

Bioinformatics. 2017 Feb 1;33(3):373-381. doi: 10.1093/bioinformatics/btw618.

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

Bioinformatics. 2018 Apr 1;34(7):1132-1140. doi: 10.1093/bioinformatics/btx722.

MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.

Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.

Combining co-evolution and secondary structure prediction to improve fragment library generation.

Bioinformatics. 2018 Jul 1;34(13):2219-2227. doi: 10.1093/bioinformatics/bty084.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

The evolution of contact prediction: evidence that contact selection in statistical contact prediction is changing.

Bioinformatics. 2020 Mar 1;36(6):1750-1756. doi: 10.1093/bioinformatics/btz816.

Increasing the accuracy of protein loop structure prediction with evolutionary constraints.

Bioinformatics. 2019 Aug 1;35(15):2585-2592. doi: 10.1093/bioinformatics/bty996.

RFQAmodel: Random Forest Quality Assessment to identify a predicted protein structure in the correct fold.

PLoS One. 2019 Oct 21;14(10):e0218149. doi: 10.1371/journal.pone.0218149. eCollection 2019.

Predicting loop conformational ensembles.

Bioinformatics. 2018 Mar 15;34(6):949-956. doi: 10.1093/bioinformatics/btx718.

Sphinx: merging knowledge-based and ab initio approaches to improve protein loop prediction.

Bioinformatics. 2017 May 1;33(9):1346-1353. doi: 10.1093/bioinformatics/btw823.

引用本文的文献

Protein Function Analysis through Machine Learning.

Biomolecules. 2022 Sep 6;12(9):1246. doi: 10.3390/biom12091246.

ConPlot: web-based application for the visualization of protein contact maps integrated with other data.

Bioinformatics. 2021 Sep 9;37(17):2763-2765. doi: 10.1093/bioinformatics/btab049.

Evaluation of residue-residue contact prediction methods: From retrospective to prospective.

PLoS Comput Biol. 2021 May 24;17(5):e1009027. doi: 10.1371/journal.pcbi.1009027. eCollection 2021 May.

RFQAmodel: Random Forest Quality Assessment to identify a predicted protein structure in the correct fold.

PLoS One. 2019 Oct 21;14(10):e0218149. doi: 10.1371/journal.pone.0218149. eCollection 2019.

Enhancing coevolution-based contact prediction by imposing structural self-consistency of the contacts.

Sci Rep. 2018 Jul 24;8(1):11112. doi: 10.1038/s41598-018-29357-y.

Acyltransferases as Tools for Polyketide Synthase Engineering.

Antibiotics (Basel). 2018 Jul 18;7(3):62. doi: 10.3390/antibiotics7030062.

Identifying functionally informative evolutionary sequence profiles.

Bioinformatics. 2018 Apr 15;34(8):1278-1286. doi: 10.1093/bioinformatics/btx779.

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

Bioinformatics. 2018 Apr 1;34(7):1132-1140. doi: 10.1093/bioinformatics/btx722.

Applications of sequence coevolution in membrane protein biochemistry.

Biochim Biophys Acta Biomembr. 2018 Apr;1860(4):895-908. doi: 10.1016/j.bbamem.2017.10.004. Epub 2017 Oct 7.

Co-evolution techniques are reshaping the way we do structural bioinformatics.

F1000Res. 2017 Jul 25;6:1224. doi: 10.12688/f1000research.11543.1. eCollection 2017.

本文引用的文献

Combining Evolutionary Information and an Iterative Sampling Strategy for Accurate Protein Structure Prediction.

PLoS Comput Biol. 2015 Dec 29;11(12):e1004661. doi: 10.1371/journal.pcbi.1004661. eCollection 2015 Dec.

New encouraging developments in contact prediction: Assessment of the CASP11 results.

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):131-44. doi: 10.1002/prot.24943. Epub 2015 Nov 17.

Large-scale determination of previously unsolved protein structures using evolutionary information.

Elife. 2015 Sep 3;4:e09248. doi: 10.7554/eLife.09248.

Building a better fragment library for de novo protein structure prediction.

PLoS One. 2015 Apr 22;10(4):e0123998. doi: 10.1371/journal.pone.0123998. eCollection 2015.

bbcontacts: prediction of β-strand pairing from direct coupling patterns.

Bioinformatics. 2015 Jun 1;31(11):1729-37. doi: 10.1093/bioinformatics/btv041. Epub 2015 Jan 23.

Amino acid coevolution reveals three-dimensional structure and functional domains of insect odorant receptors.

Nat Commun. 2015 Jan 13;6:6077. doi: 10.1038/ncomms7077.

MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.

Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.

Improved contact predictions using the recognition of protein like contact patterns.

PLoS Comput Biol. 2014 Nov 6;10(11):e1003889. doi: 10.1371/journal.pcbi.1003889. eCollection 2014 Nov.

Combining physicochemical and evolutionary information for protein contact prediction.

PLoS One. 2014 Oct 22;9(10):e108438. doi: 10.1371/journal.pone.0108438. eCollection 2014.

Improving contact prediction along three dimensions.

PLoS Comput Biol. 2014 Oct 9;10(10):e1003847. doi: 10.1371/journal.pcbi.1003847. eCollection 2014 Oct.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

比较共进化方法及其在无模板蛋白质结构预测中的应用。

Comparing co-evolution methods and their application to template-free protein structure prediction.

机构信息

Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.

Department of Informatics, UCB Pharma, Slough SL1 3WE, UK