DNcon：一种使用深度网络进行蛋白质残基残基接触预测的方法的研究和基准测试。

A study and benchmark of DNcon: a method for protein residue-residue contact prediction using deep networks.

出版信息

BMC Bioinformatics. 2013;14 Suppl 14(Suppl 14):S12. doi: 10.1186/1471-2105-14-S14-S12. Epub 2013 Oct 9.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3850995/

Abstract

BACKGROUND

In recent years, the use and importance of predicted protein residue-residue contacts has grown considerably with demonstrated applications such as drug design, protein tertiary structure prediction and model quality assessment. Nevertheless, reported accuracies in the range of 25-35% stubbornly remain the norm for sequence based, long range contact predictions on hard targets. This is in spite of a prolonged effort on behalf of the community to improve the performance of residue-residue contact prediction. A thorough study of the quality of current residue-residue contact predictions and the evaluation metrics used as well as an analysis of current methods is needed to stimulate further advancement in contact prediction and its application. Such a study will better explain the quality and nature of residue-residue contact predictions generated by current methods and as a result lead to better use of this contact information.

RESULTS

We evaluated several sequence based residue-residue contact predictors that participated in the tenth Critical Assessment of protein Structure Prediction (CASP) experiment. The evaluation was performed using standard assessment techniques such as those used by the official CASP assessors as well as two novel evaluation metrics (i.e., cluster accuracy and cluster count). An in-depth analysis revealed that while most residue-residue contact predictions generated are not accurate at the residue level, there is quite a strong contact signal present when allowing for less than residue level precision. Our residue-residue contact predictor, DNcon, performed particularly well achieving an accuracy of 66% for the top L/10 long range contacts when evaluated in a neighbourhood of size 2. The coverage of residue-residue contact areas was also greater with DNcon when compared to other methods. We also provide an analysis of DNcon with respect to its underlying architecture and features used for classification.

CONCLUSIONS

Our novel evaluation metrics demonstrate that current residue-residue contact predictions do contain a strong contact signal and are of better quality than standard evaluation metrics indicate. Our method, DNcon, is a robust, state-of-the-art residue-residue sequence based contact predictor and excelled under a number of evaluation schemes. It is available as a web service at http://iris.rnet.missouri.edu/dncon/.

摘要

背景

近年来，预测蛋白质残基残基接触的使用和重要性显著增加，其应用包括药物设计、蛋白质三级结构预测和模型质量评估等。然而，针对硬目标的基于序列的长程接触预测，报告的准确率仍徘徊在 25-35%。尽管社区长期以来一直致力于提高残基残基接触预测的性能，但情况仍然如此。需要对当前残基残基接触预测的质量和使用的评估指标以及当前方法进行全面研究，以激发接触预测及其应用的进一步发展。这样的研究将更好地解释当前方法生成的残基残基接触预测的质量和性质，并因此更好地利用这种接触信息。

结果

我们评估了参加第十届蛋白质结构预测关键评估（CASP）实验的几种基于序列的残基残基接触预测器。评估使用了标准评估技术，如官方 CASP 评估员使用的技术以及两种新的评估指标（即聚类精度和聚类计数）。深入分析表明，虽然大多数残基残基接触预测在残基水平上不准确，但当允许精度低于残基水平时，存在相当强的接触信号。我们的残基残基接触预测器 DNcon 在评估大小为 2 的邻域时，对于前 L/10 个长程接触，其准确率达到 66%，表现尤为出色。与其他方法相比，DNcon 还覆盖了更多的残基残基接触区域。我们还提供了对 DNcon 的分析，包括其底层架构和用于分类的特征。

结论

我们的新评估指标表明，当前的残基残基接触预测确实包含强烈的接触信号，并且比标准评估指标所表明的质量更好。我们的方法 DNcon 是一种强大的、最先进的基于残基序列的残基残基接触预测器，在许多评估方案下表现出色。它可以作为一个网络服务在 http://iris.rnet.missouri.edu/dncon/ 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/912e/3850995/2f6641e95dc2/1471-2105-14-S14-S12-1.jpg

相似文献

A study and benchmark of DNcon: a method for protein residue-residue contact prediction using deep networks.

BMC Bioinformatics. 2013;14 Suppl 14(Suppl 14):S12. doi: 10.1186/1471-2105-14-S14-S12. Epub 2013 Oct 9.

Predicting protein residue-residue contacts using deep networks and boosting.

Bioinformatics. 2012 Dec 1;28(23):3066-72. doi: 10.1093/bioinformatics/bts598. Epub 2012 Oct 9.

Protein Residue Contacts and Prediction Methods.

Methods Mol Biol. 2016;1415:463-76. doi: 10.1007/978-1-4939-3572-7_24.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

DNCON2: improved protein contact prediction using two-level deep convolutional neural networks.

Bioinformatics. 2018 May 1;34(9):1466-1472. doi: 10.1093/bioinformatics/btx781.

Evaluation of residue-residue contact prediction in CASP10.

Proteins. 2014 Feb;82 Suppl 2(0 2):138-53. doi: 10.1002/prot.24340. Epub 2013 Aug 31.

Predicting protein residue-residue contacts using random forests and deep networks.

BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):100. doi: 10.1186/s12859-019-2627-6.

Assessment of domain boundary predictions and the prediction of intramolecular contacts in CASP8.

Proteins. 2009;77 Suppl 9:196-209. doi: 10.1002/prot.22554.

Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):84-96. doi: 10.1002/prot.25405. Epub 2017 Oct 31.

A further leap of improvement in tertiary structure prediction in CASP13 prompts new routes for future assessments.

Proteins. 2019 Dec;87(12):1100-1112. doi: 10.1002/prot.25787. Epub 2019 Aug 7.

引用本文的文献

Illuminating the "Twilight Zone": Advances in Difficult Protein Modeling.

Methods Mol Biol. 2023;2627:25-40. doi: 10.1007/978-1-0716-2974-1_2.

A tale of solving two computational challenges in protein science: neoantigen prediction and protein structure prediction.

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab493.

COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.

Membranes (Basel). 2021 Jun 30;11(7):503. doi: 10.3390/membranes11070503.

Recent Applications of Deep Learning Methods on Evolution- and Contact-Based Protein Structure Prediction.

Int J Mol Sci. 2021 Jun 2;22(11):6032. doi: 10.3390/ijms22116032.

Deep Learning-Based Advances in Protein Structure Prediction.

Int J Mol Sci. 2021 May 24;22(11):5553. doi: 10.3390/ijms22115553.

DeepECA: an end-to-end learning framework for protein contact prediction from a multiple sequence alignment.

BMC Bioinformatics. 2020 Jan 9;21(1):10. doi: 10.1186/s12859-019-3190-x.

Assessing the accuracy of contact predictions in CASP13.

Proteins. 2019 Dec;87(12):1058-1068. doi: 10.1002/prot.25819. Epub 2019 Oct 24.

Predicting protein residue-residue contacts using random forests and deep networks.

BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):100. doi: 10.1186/s12859-019-2627-6.

High precision in protein contact prediction using fully convolutional neural networks and minimal sequence features.

Bioinformatics. 2018 Oct 1;34(19):3308-3315. doi: 10.1093/bioinformatics/bty341.

DNCON2: improved protein contact prediction using two-level deep convolutional neural networks.

Bioinformatics. 2018 May 1;34(9):1466-1472. doi: 10.1093/bioinformatics/btx781.

本文引用的文献

Predicting protein residue-residue contacts using deep networks and boosting.

Bioinformatics. 2012 Dec 1;28(23):3066-72. doi: 10.1093/bioinformatics/bts598. Epub 2012 Oct 9.

Deep architectures for protein contact map prediction.

Bioinformatics. 2012 Oct 1;28(19):2449-57. doi: 10.1093/bioinformatics/bts475. Epub 2012 Jul 30.

Protein 3D structure computed from evolutionary sequence variation.

PLoS One. 2011;6(12):e28766. doi: 10.1371/journal.pone.0028766. Epub 2011 Dec 7.

PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments.

Bioinformatics. 2012 Jan 15;28(2):184-90. doi: 10.1093/bioinformatics/btr638. Epub 2011 Nov 17.

Predicting residue-residue contacts using random forest models.

Bioinformatics. 2011 Dec 15;27(24):3379-84. doi: 10.1093/bioinformatics/btr579. Epub 2011 Oct 20.

A conformation ensemble approach to protein residue-residue contact.

BMC Struct Biol. 2011 Oct 12;11:38. doi: 10.1186/1472-6807-11-38.

Evaluation of residue-residue contact predictions in CASP9.

Proteins. 2011;79 Suppl 10(Suppl 10):119-25. doi: 10.1002/prot.23160. Epub 2011 Sep 17.

Improving protein structure prediction using multiple sequence-based contact predictions.

Structure. 2011 Aug 10;19(8):1182-91. doi: 10.1016/j.str.2011.05.004.

Optimal contact definition for reconstruction of contact maps.

BMC Bioinformatics. 2010 May 27;11:283. doi: 10.1186/1471-2105-11-283.

Predicted residue-residue contacts can help the scoring of 3D models.

Proteins. 2010 Jun;78(8):1980-91. doi: 10.1002/prot.22714.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

DNcon：一种使用深度网络进行蛋白质残基残基接触预测的方法的研究和基准测试。

A study and benchmark of DNcon: a method for protein residue-residue contact prediction using deep networks.

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献