使用实用相关突变方法预测残基接触：减少假阳性

Predicting residue contacts using pragmatic correlated mutations method: reducing the false positives.

作者信息

Kundrotas Petras J, Alexov Emil G

机构信息

Computational Biophysics and Bioinformatics, Department of Physics, Clemson University, Clemson, SC 29634, USA.

出版信息

BMC Bioinformatics. 2006 Nov 16;7:503. doi: 10.1186/1471-2105-7-503.

DOI:10.1186/1471-2105-7-503

PMID:17109752

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1654194/

Abstract

BACKGROUND

Predicting residues' contacts using primary amino acid sequence alone is an important task that can guide 3D structure modeling and can verify the quality of the predicted 3D structures. The correlated mutations (CM) method serves as the most promising approach and it has been used to predict amino acids pairs that are distant in the primary sequence but form contacts in the native 3D structure of homologous proteins.

RESULTS

Here we report a new implementation of the CM method with an added set of selection rules (filters). The parameters of the algorithm were optimized against fifteen high resolution crystal structures with optimization criterion that maximized the confidentiality of the predictions. The optimization resulted in a true positive ratio (TPR) of 0.08 for the CM without filters and a TPR of 0.14 for the CM with filters. The protocol was further benchmarked against 65 high resolution structures that were not included in the optimization test. The benchmarking resulted in a TPR of 0.07 for the CM without filters and to a TPR of 0.09 for the CM with filters.

CONCLUSION

Thus, the inclusion of selection rules resulted to an overall improvement of 30%. In addition, the pair-wise comparison of TPR for each protein without and with filters resulted in an average improvement of 1.7. The methodology was implemented into a web server http://www.ces.clemson.edu/compbio/recon that is freely available to the public. The purpose of this implementation is to provide the 3D structure predictors with a tool that can help with ranking alternative models by satisfying the largest number of predicted contacts, as well as it can provide a confidence score for contacts in cases where structure is known.

摘要

背景

仅使用一级氨基酸序列预测残基间的接触是一项重要任务，它可指导三维结构建模并验证预测的三维结构的质量。相关突变（CM）方法是最具前景的方法，已被用于预测在一级序列中距离较远但在同源蛋白质的天然三维结构中形成接触的氨基酸对。

结果

在此，我们报告了一种添加了一组选择规则（过滤器）的CM方法的新实现方式。针对15个高分辨率晶体结构对算法参数进行了优化，优化标准是使预测的可信度最大化。优化后，无过滤器的CM的真阳性率（TPR）为0.08，有过滤器的CM的TPR为0.14。该方案进一步以65个未包含在优化测试中的高分辨率结构为基准进行测试。基准测试结果显示，无过滤器的CM的TPR为0.07，有过滤器的CM的TPR为0.09。

结论

因此，纳入选择规则使整体提升了30%。此外，对每种蛋白质有无过滤器时的TPR进行成对比较，平均提升了1.7。该方法已在一个网络服务器（http://www.ces.clemson.edu/compbio/recon）上实现，公众可免费使用。此实现的目的是为三维结构预测者提供一种工具，该工具可通过满足最多数量的预测接触来帮助对替代模型进行排名，并且在已知结构的情况下可为接触提供置信度得分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b563/1654194/a9a7f36d5081/1471-2105-7-503-1.jpg

相似文献

Predicting residue contacts using pragmatic correlated mutations method: reducing the false positives.使用实用相关突变方法预测残基接触：减少假阳性

BMC Bioinformatics. 2006 Nov 16;7:503. doi: 10.1186/1471-2105-7-503.

CONFOLD: Residue-residue contact-guided ab initio protein folding.CONFOLD：基于残基-残基接触引导的从头算蛋白质折叠。

Proteins. 2015 Aug;83(8):1436-49. doi: 10.1002/prot.24829. Epub 2015 Jun 6.

Using inferred residue contacts to distinguish between correct and incorrect protein models.利用推断的残基接触来区分正确和错误的蛋白质模型。

Bioinformatics. 2008 Jul 15;24(14):1575-82. doi: 10.1093/bioinformatics/btn248. Epub 2008 May 29.

Predicting protein residue-residue contacts using random forests and deep networks.利用随机森林和深度网络预测蛋白质残基-残基接触。

BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):100. doi: 10.1186/s12859-019-2627-6.

Prediction of distant residue contacts with the use of evolutionary information.利用进化信息预测远距离残基接触。

Proteins. 2005 Mar 1;58(4):935-49. doi: 10.1002/prot.20370.

Predicting residue-residue contacts using random forest models.利用随机森林模型预测残基-残基接触。

Bioinformatics. 2011 Dec 15;27(24):3379-84. doi: 10.1093/bioinformatics/btr579. Epub 2011 Oct 20.

FAST: a novel protein structure alignment algorithm.FAST：一种新型蛋白质结构比对算法。

Proteins. 2005 Feb 15;58(3):618-27. doi: 10.1002/prot.20331.

The visualCMAT: A web-server to select and interpret correlated mutations/co-evolving residues in protein families.可视化CMAT：一个用于选择和解释蛋白质家族中相关突变/共同进化残基的网络服务器。

J Bioinform Comput Biol. 2018 Apr;16(2):1840005. doi: 10.1142/S021972001840005X. Epub 2017 Dec 28.

MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV：结合协同进化方法用于精确预测蛋白质中的接触和长程氢键

Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.

False positive reduction in protein-protein interaction predictions using gene ontology annotations.利用基因本体注释减少蛋白质-蛋白质相互作用预测中的假阳性

BMC Bioinformatics. 2007 Jul 23;8:262. doi: 10.1186/1471-2105-8-262.

引用本文的文献

Distant Non-Obvious Mutations Influence the Activity of a Hyperthermophilic Phosphoglucose Isomerase.远位非明显突变影响嗜热磷酸葡萄糖异构酶的活性。

Biomolecules. 2019 May 31;9(6):212. doi: 10.3390/biom9060212.

New insights regarding protein folding as learned from beta-sheets.从β折叠中获得的关于蛋白质折叠的新见解。

EXCLI J. 2012 Aug 27;11:543-55. eCollection 2012.

CNNcon: improved protein contact maps prediction using cascaded neural networks.CNNcon：使用级联神经网络改进蛋白质接触图预测。

PLoS One. 2013 Apr 23;8(4):e61533. doi: 10.1371/journal.pone.0061533. Print 2013.

Statistical Analysis of Terminal Extensions of Protein β-Strand Pairs.蛋白质β链对末端延伸的统计分析

Adv Bioinformatics. 2013;2013:909436. doi: 10.1155/2013/909436. Epub 2013 Jan 28.

Hyperdimensional analysis of amino acid pair distributions in proteins.蛋白质中氨基酸对分布的超维分析。

PLoS One. 2011;6(12):e25638. doi: 10.1371/journal.pone.0025638. Epub 2011 Dec 9.

A conformation ensemble approach to protein residue-residue contact.一种用于蛋白质残基-残基接触的构象系综方法。

BMC Struct Biol. 2011 Oct 12;11:38. doi: 10.1186/1472-6807-11-38.

Evaluation of residue-residue contact predictions in CASP9.评估 CASP9 中残基-残基接触预测的结果。

Proteins. 2011;79 Suppl 10(Suppl 10):119-25. doi: 10.1002/prot.23160. Epub 2011 Sep 17.

Improving protein structure prediction using multiple sequence-based contact predictions.利用基于多重序列的接触预测改进蛋白质结构预测。

Structure. 2011 Aug 10;19(8):1182-91. doi: 10.1016/j.str.2011.05.004.

Use of mutual information arrays to predict coevolving sites in the full length HIV gp120 protein for subtypes B and C.利用互信息数组预测 B 型和 C 型全长 HIV gp120 蛋白中的共进化位点。

Virol Sin. 2011 Apr;26(2):95-104. doi: 10.1007/s12250-011-3188-7. Epub 2011 Apr 7.

PLoS Comput Biol. 2010 Sep 16;6(9):e1000923. doi: 10.1371/journal.pcbi.1000923.

本文引用的文献

Proteins. 2006 Jun 1;63(4):832-45. doi: 10.1002/prot.20933.

Natural-like function in artificial WW domains.人工WW结构域中的类天然功能。

Nature. 2005 Sep 22;437(7058):579-83. doi: 10.1038/nature03990.

PROFcon: novel prediction of long-range contacts.PROFcon：长程接触的新型预测方法

Bioinformatics. 2005 Jul 1;21(13):2960-8. doi: 10.1093/bioinformatics/bti454. Epub 2005 May 12.

Prediction of the disulfide-bonding state of cysteines in proteins at 88% accuracy.以88%的准确率预测蛋白质中半胱氨酸的二硫键结合状态。

Protein Sci. 2003 Jul;12(7):1578. doi: 10.1110/ps.0219602.

Structural determinants of allosteric ligand activation in RXR heterodimers.视黄酸X受体（RXR）异二聚体中变构配体激活的结构决定因素。

Cell. 2004 Feb 6;116(3):417-29. doi: 10.1016/s0092-8674(04)00119-9.

Allosteric determinants in guanine nucleotide-binding proteins.鸟嘌呤核苷酸结合蛋白中的变构决定因素。

Proc Natl Acad Sci U S A. 2003 Nov 25;100(24):14445-50. doi: 10.1073/pnas.1835919100. Epub 2003 Nov 17.

PISCES: a protein sequence culling server.双鱼座：一个蛋白质序列筛选服务器。

Bioinformatics. 2003 Aug 12;19(12):1589-91. doi: 10.1093/bioinformatics/btg224.

Evolutionarily conserved networks of residues mediate allosteric communication in proteins.进化上保守的残基网络介导蛋白质中的变构通讯。

Nat Struct Biol. 2003 Jan;10(1):59-69. doi: 10.1038/nsb881.

Prediction of protein residue contacts with a PDB-derived likelihood matrix.利用源自蛋白质数据银行（PDB）的似然矩阵预测蛋白质残基接触。

Protein Eng. 2002 Sep;15(9):721-5. doi: 10.1093/protein/15.9.721.

Computational methods for the prediction of protein interactions.预测蛋白质相互作用的计算方法。

Curr Opin Struct Biol. 2002 Jun;12(3):368-73. doi: 10.1016/s0959-440x(02)00333-0.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用实用相关突变方法预测残基接触：减少假阳性

Predicting residue contacts using pragmatic correlated mutations method: reducing the false positives.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献