• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过深度残差卷积网络从一组共进化矩阵中推导高精度蛋白质接触图。

Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.

作者信息

Li Yang, Zhang Chengxin, Bell Eric W, Zheng Wei, Zhou Xiaogen, Yu Dong-Jun, Zhang Yang

机构信息

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China.

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America.

出版信息

PLoS Comput Biol. 2021 Mar 26;17(3):e1008865. doi: 10.1371/journal.pcbi.1008865. eCollection 2021 Mar.

DOI:10.1371/journal.pcbi.1008865
PMID:33770072
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8026059/
Abstract

The topology of protein folds can be specified by the inter-residue contact-maps and accurate contact-map prediction can help ab initio structure folding. We developed TripletRes to deduce protein contact-maps from discretized distance profiles by end-to-end training of deep residual neural-networks. Compared to previous approaches, the major advantage of TripletRes is in its ability to learn and directly fuse a triplet of coevolutionary matrices extracted from the whole-genome and metagenome databases and therefore minimize the information loss during the course of contact model training. TripletRes was tested on a large set of 245 non-homologous proteins from CASP 11&12 and CAMEO experiments and outperformed other top methods from CASP12 by at least 58.4% for the CASP 11&12 targets and 44.4% for the CAMEO targets in the top-L long-range contact precision. On the 31 FM targets from the latest CASP13 challenge, TripletRes achieved the highest precision (71.6%) for the top-L/5 long-range contact predictions. It was also shown that a simple re-training of the TripletRes model with more proteins can lead to further improvement with precisions comparable to state-of-the-art methods developed after CASP13. These results demonstrate a novel efficient approach to extend the power of deep convolutional networks for high-accuracy medium- and long-range protein contact-map predictions starting from primary sequences, which are critical for constructing 3D structure of proteins that lack homologous templates in the PDB library.

摘要

蛋白质折叠的拓扑结构可以通过残基间接触图来确定,准确的接触图预测有助于从头进行结构折叠。我们开发了TripletRes,通过深度残差神经网络的端到端训练从离散化的距离分布中推断蛋白质接触图。与先前的方法相比,TripletRes的主要优势在于其能够学习并直接融合从全基因组和宏基因组数据库中提取的三联共进化矩阵,从而在接触模型训练过程中最小化信息损失。TripletRes在来自CASP 11&12和CAMEO实验的一大组245个非同源蛋白质上进行了测试,在top-L长程接触精度方面,对于CASP 11&12的目标,其性能比CASP12中的其他顶级方法至少高出58.4%,对于CAMEO目标则高出44.4%。在最新的CASP13挑战的31个FM目标上,TripletRes在top-L/5长程接触预测中达到了最高精度(71.6%)。研究还表明,使用更多蛋白质对TripletRes模型进行简单的重新训练可以进一步提高精度,与CASP13之后开发的最先进方法相当。这些结果展示了一种新颖有效的方法,可扩展深度卷积网络的能力,用于从一级序列开始进行高精度的中长程蛋白质接触图预测,这对于构建PDB库中缺乏同源模板的蛋白质的三维结构至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/a19ed8762e81/pcbi.1008865.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/75516792b8b7/pcbi.1008865.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/20e6572fb172/pcbi.1008865.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/14b2879955eb/pcbi.1008865.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/a19ed8762e81/pcbi.1008865.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/75516792b8b7/pcbi.1008865.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/20e6572fb172/pcbi.1008865.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/14b2879955eb/pcbi.1008865.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8055/8026059/a19ed8762e81/pcbi.1008865.g004.jpg

相似文献

1
Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.通过深度残差卷积网络从一组共进化矩阵中推导高精度蛋白质接触图。
PLoS Comput Biol. 2021 Mar 26;17(3):e1008865. doi: 10.1371/journal.pcbi.1008865. eCollection 2021 Mar.
2
Ensembling multiple raw coevolutionary features with deep residual neural networks for contact-map prediction in CASP13.基于深度残差神经网络的原始共进化特征集成方法在 CASP13 中用于接触图预测。
Proteins. 2019 Dec;87(12):1082-1091. doi: 10.1002/prot.25798. Epub 2019 Aug 22.
3
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
4
Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.
5
ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.ResPRE:通过结合精度矩阵和深度残差神经网络进行高精度蛋白质接触预测。
Bioinformatics. 2019 Nov 1;35(22):4647-4655. doi: 10.1093/bioinformatics/btz291.
6
Analysis of distance-based protein structure prediction by deep learning in CASP13.基于深度学习的 CASP13 蛋白质结构预测距离分析。
Proteins. 2019 Dec;87(12):1069-1081. doi: 10.1002/prot.25810. Epub 2019 Sep 13.
7
Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks.基于二维递归神经网络的多类别距离图的从头预测和基于模板的预测。
BMC Struct Biol. 2009 Jan 30;9:5. doi: 10.1186/1472-6807-9-5.
8
Detecting distant-homology protein structures by aligning deep neural-network based contact maps.通过对齐基于深度神经网络的接触图来检测远程同源蛋白结构。
PLoS Comput Biol. 2019 Oct 17;15(10):e1007411. doi: 10.1371/journal.pcbi.1007411. eCollection 2019 Oct.
9
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.
10
A two-stage approach for improved prediction of residue contact maps.一种用于改进残基接触图预测的两阶段方法。
BMC Bioinformatics. 2006 Mar 30;7:180. doi: 10.1186/1471-2105-7-180.

引用本文的文献

1
Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER.基于深度学习的单域和多域蛋白质结构预测与D-I-TASSER
Nat Biotechnol. 2025 May 23. doi: 10.1038/s41587-025-02654-4.
2
Machine Learning Techniques to Infer Protein Structure and Function from Sequences: A Comprehensive Review.基于序列推断蛋白质结构和功能的机器学习技术:全面综述。
Methods Mol Biol. 2025;2867:79-104. doi: 10.1007/978-1-0716-4196-5_5.
3
PCP-GC-LM: single-sequence-based protein contact prediction using dual graph convolutional neural network and convolutional neural network.

本文引用的文献

1
Improved protein structure prediction using predicted interresidue orientations.利用预测的残基间取向改进蛋白质结构预测。
Proc Natl Acad Sci U S A. 2020 Jan 21;117(3):1496-1503. doi: 10.1073/pnas.1914677117. Epub 2020 Jan 2.
2
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.DeepMSA:构建深度多重序列比对以改进远距离同源蛋白质的接触预测和折叠识别。
Bioinformatics. 2020 Apr 1;36(7):2105-2112. doi: 10.1093/bioinformatics/btz863.
3
Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13).
PCP-GC-LM:基于双图卷积神经网络和卷积神经网络的单序列蛋白质接触预测。
BMC Bioinformatics. 2024 Sep 2;25(1):287. doi: 10.1186/s12859-024-05914-3.
4
Importance of Inter-residue Contacts for Understanding Protein Folding and Unfolding Rates, Remote Homology, and Drug Design.残基间接触对于理解蛋白质折叠与解折叠速率、远程同源性及药物设计的重要性。
Mol Biotechnol. 2025 Mar;67(3):862-884. doi: 10.1007/s12033-024-01119-4. Epub 2024 Mar 18.
5
Recent Progress of Protein Tertiary Structure Prediction.蛋白质三级结构预测的最新进展。
Molecules. 2024 Feb 13;29(4):832. doi: 10.3390/molecules29040832.
6
Structure prediction of linear and cyclic peptides using CABS-flex.使用 CABS-flex 进行线性和环状肽的结构预测。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae003.
7
Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15.在 CASP15 中,通过深度学习、线程比对和多 MSAs 策略,实现高质量蛋白质单体和复合物结构预测。
Proteins. 2023 Dec;91(12):1684-1703. doi: 10.1002/prot.26585. Epub 2023 Aug 31.
8
Refinement of AlphaFold2 models against experimental and hybrid cryo-EM density maps.针对实验性和混合冷冻电镜密度图对AlphaFold2模型进行优化。
QRB Discov. 2022;3. doi: 10.1017/qrd.2022.13. Epub 2022 Sep 20.
9
Microbiome-based enrichment pattern mining has enabled a deeper understanding of the biome-species-function relationship.基于微生物组的富集模式挖掘使人们能够更深入地了解生物群落-物种-功能关系。
Commun Biol. 2023 Apr 10;6(1):391. doi: 10.1038/s42003-023-04753-x.
10
Prediction of the tetramer protein complex interaction based on CNN and SVM.基于卷积神经网络(CNN)和支持向量机(SVM)的四聚体蛋白复合物相互作用预测
Front Genet. 2023 Jan 26;14:1076904. doi: 10.3389/fgene.2023.1076904. eCollection 2023.
使用多个深度神经网络进行蛋白质结构预测在第十三届蛋白质结构预测关键评估 (CASP13) 中。
Proteins. 2019 Dec;87(12):1141-1148. doi: 10.1002/prot.25834.
4
Assessing the accuracy of contact predictions in CASP13.评估 CASP13 中接触预测的准确性。
Proteins. 2019 Dec;87(12):1058-1068. doi: 10.1002/prot.25819. Epub 2019 Oct 24.
5
Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints.深度学习利用迭代预测的结构约束来扩展从头开始的蛋白质建模对基因组的覆盖范围。
Nat Commun. 2019 Sep 4;10(1):3977. doi: 10.1038/s41467-019-11994-0.
6
Ensembling multiple raw coevolutionary features with deep residual neural networks for contact-map prediction in CASP13.基于深度残差神经网络的原始共进化特征集成方法在 CASP13 中用于接触图预测。
Proteins. 2019 Dec;87(12):1082-1091. doi: 10.1002/prot.25798. Epub 2019 Aug 22.
7
Distance-based protein folding powered by deep learning.基于深度学习的距离相关蛋白质折叠。
Proc Natl Acad Sci U S A. 2019 Aug 20;116(34):16856-16865. doi: 10.1073/pnas.1821309116. Epub 2019 Aug 9.
8
Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.
9
ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.ResPRE:通过结合精度矩阵和深度残差神经网络进行高精度蛋白质接触预测。
Bioinformatics. 2019 Nov 1;35(22):4647-4655. doi: 10.1093/bioinformatics/btz291.
10
Clustering huge protein sequence sets in linear time.线性时间内的大规模蛋白质序列集聚类。
Nat Commun. 2018 Jun 29;9(1):2542. doi: 10.1038/s41467-018-04964-5.