RRCRank：一种使用排序策略进行残基-残基接触预测的融合方法。

RRCRank: a fusion method using rank strategy for residue-residue contact prediction.

作者信息

Jing Xiaoyang, Dong Qiwen, Lu Ruqian

机构信息

School of Computer Science, Fudan University, Shanghai, 200433, People's Republic of China.

School of Data Science and Engineering, East China Normal University, Shanghai, 200062, People's Republic of China.

出版信息

BMC Bioinformatics. 2017 Sep 2;18(1):390. doi: 10.1186/s12859-017-1811-9.

DOI:10.1186/s12859-017-1811-9

PMID:28865433

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5581475/

Abstract

BACKGROUND

In structural biology area, protein residue-residue contacts play a crucial role in protein structure prediction. Some researchers have found that the predicted residue-residue contacts could effectively constrain the conformational search space, which is significant for de novo protein structure prediction. In the last few decades, related researchers have developed various methods to predict residue-residue contacts, especially, significant performance has been achieved by using fusion methods in recent years. In this work, a novel fusion method based on rank strategy has been proposed to predict contacts. Unlike the traditional regression or classification strategies, the contact prediction task is regarded as a ranking task. First, two kinds of features are extracted from correlated mutations methods and ensemble machine-learning classifiers, and then the proposed method uses the learning-to-rank algorithm to predict contact probability of each residue pair.

RESULTS

First, we perform two benchmark tests for the proposed fusion method (RRCRank) on CASP11 dataset and CASP12 dataset respectively. The test results show that the RRCRank method outperforms other well-developed methods, especially for medium and short range contacts. Second, in order to verify the superiority of ranking strategy, we predict contacts by using the traditional regression and classification strategies based on the same features as ranking strategy. Compared with these two traditional strategies, the proposed ranking strategy shows better performance for three contact types, in particular for long range contacts. Third, the proposed RRCRank has been compared with several state-of-the-art methods in CASP11 and CASP12. The results show that the RRCRank could achieve comparable prediction precisions and is better than three methods in most assessment metrics.

CONCLUSIONS

The learning-to-rank algorithm is introduced to develop a novel rank-based method for the residue-residue contact prediction of proteins, which achieves state-of-the-art performance based on the extensive assessment.

摘要

背景

在结构生物学领域，蛋白质残基-残基接触在蛋白质结构预测中起着至关重要的作用。一些研究人员发现，预测的残基-残基接触可以有效地限制构象搜索空间，这对于从头蛋白质结构预测具有重要意义。在过去几十年中，相关研究人员开发了各种方法来预测残基-残基接触，特别是近年来使用融合方法取得了显著的性能提升。在这项工作中，提出了一种基于排序策略的新型融合方法来预测接触。与传统的回归或分类策略不同，接触预测任务被视为一个排序任务。首先，从相关突变方法和集成机器学习分类器中提取两种特征，然后所提出的方法使用排序学习算法来预测每个残基对的接触概率。

结果

首先，我们分别在CASP11数据集和CASP12数据集上对所提出的融合方法（RRCRank）进行了两次基准测试。测试结果表明，RRCRank方法优于其他成熟的方法，特别是对于中短程接触。其次，为了验证排序策略的优越性，我们基于与排序策略相同的特征，使用传统的回归和分类策略来预测接触。与这两种传统策略相比，所提出的排序策略在三种接触类型上表现出更好的性能，特别是对于长程接触。第三，将所提出的RRCRank与CASP11和CASP12中的几种最先进的方法进行了比较。结果表明，RRCRank可以实现相当的预测精度，并且在大多数评估指标上优于三种方法。

结论

引入排序学习算法来开发一种用于蛋白质残基-残基接触预测的基于排序的新方法，该方法在广泛的评估中达到了最先进的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0754/5581475/72cb43f3c4fe/12859_2017_1811_Fig1_HTML.jpg

相似文献

RRCRank: a fusion method using rank strategy for residue-residue contact prediction.RRCRank：一种使用排序策略进行残基-残基接触预测的融合方法。

BMC Bioinformatics. 2017 Sep 2;18(1):390. doi: 10.1186/s12859-017-1811-9.

Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.通过整合深度多序列比对、协同进化和机器学习进行蛋白质接触预测。

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):84-96. doi: 10.1002/prot.25405. Epub 2017 Oct 31.

R2C: improving ab initio residue contact map prediction using dynamic fusion strategy and Gaussian noise filter.R2C：使用动态融合策略和高斯噪声滤波器改进从头开始的残基接触图预测。

Bioinformatics. 2016 Aug 15;32(16):2435-43. doi: 10.1093/bioinformatics/btw181. Epub 2016 Apr 10.

DNCON2: improved protein contact prediction using two-level deep convolutional neural networks.DNCON2：使用两级深度卷积神经网络改进蛋白质接触预测。

Bioinformatics. 2018 May 1;34(9):1466-1472. doi: 10.1093/bioinformatics/btx781.

Predicting protein residue-residue contacts using random forests and deep networks.利用随机森林和深度网络预测蛋白质残基-残基接触。

BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):100. doi: 10.1186/s12859-019-2627-6.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

FingerprintContacts: Predicting Alternative Conformations of Proteins from Coevolution.指纹接触：从协同进化预测蛋白质的替代构象

J Phys Chem B. 2020 May 7;124(18):3605-3615. doi: 10.1021/acs.jpcb.9b11869. Epub 2020 Apr 28.

Identification of residue pairing in interacting β-strands from a predicted residue contact map.从预测的残基接触图中鉴定相互作用的β-折叠中的残基对。

BMC Bioinformatics. 2018 Apr 19;19(1):146. doi: 10.1186/s12859-018-2150-1.

Deep learning methods for protein torsion angle prediction.用于蛋白质扭转角预测的深度学习方法。

BMC Bioinformatics. 2017 Sep 18;18(1):417. doi: 10.1186/s12859-017-1834-2.

Assessment of contact predictions in CASP12: Co-evolution and deep learning coming of age.蛋白质结构预测技术关键评估第12轮（CASP12）中的接触预测评估：协同进化与深度学习走向成熟。

Proteins. 2018 Mar;86 Suppl 1(Suppl Suppl 1):51-66. doi: 10.1002/prot.25407. Epub 2017 Nov 7.

本文引用的文献

Sorting protein decoys by machine-learning-to-rank.基于机器学习排序的蛋白质诱饵分类。

Sci Rep. 2016 Aug 17;6:31571. doi: 10.1038/srep31571.

Bioinformatics. 2016 Aug 15;32(16):2435-43. doi: 10.1093/bioinformatics/btw181. Epub 2016 Apr 10.

CASP 11 target classification.CASP 11目标分类。

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):20-33. doi: 10.1002/prot.24982. Epub 2016 Jan 27.

New encouraging developments in contact prediction: Assessment of the CASP11 results.接触预测方面新的鼓舞人心的进展：对CASP11结果的评估。

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):131-44. doi: 10.1002/prot.24943. Epub 2015 Nov 17.

Protein contact prediction by integrating joint evolutionary coupling analysis and supervised learning.基于联合进化耦合分析和监督学习的蛋白质接触预测。

Bioinformatics. 2015 Nov 1;31(21):3506-13. doi: 10.1093/bioinformatics/btv472. Epub 2015 Aug 14.

Accurate contact predictions using covariation techniques and machine learning.使用共变技术和机器学习进行准确的接触预测。

Proteins. 2016 Sep;84 Suppl 1(Suppl Suppl 1):145-51. doi: 10.1002/prot.24863. Epub 2015 Aug 14.

Learning to rank diversified results for biomedical information retrieval from multiple features.学习从多个特征中对生物医学信息检索的多样化结果进行排序。

Biomed Eng Online. 2014;13 Suppl 2(Suppl 2):S3. doi: 10.1186/1475-925X-13-S2-S3. Epub 2014 Dec 11.

MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV：结合协同进化方法用于精确预测蛋白质中的接触和长程氢键

Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.

Improving contact prediction along three dimensions.沿三个维度改进接触预测。

PLoS Comput Biol. 2014 Oct 9;10(10):e1003847. doi: 10.1371/journal.pcbi.1003847. eCollection 2014 Oct.

CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations.CCMpred--快速准确地预测蛋白质残基-残基接触的相关突变。

Bioinformatics. 2014 Nov 1;30(21):3128-30. doi: 10.1093/bioinformatics/btu500. Epub 2014 Jul 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

RRCRank：一种使用排序策略进行残基-残基接触预测的融合方法。

RRCRank: a fusion method using rank strategy for residue-residue contact prediction.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献