• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分析影响基于深度学习的残基间接触预测的几个关键因素。

Analysis of several key factors influencing deep learning-based inter-residue contact prediction.

机构信息

Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA.

Department of Mathematics and Computer Science, University of Missouri, St. Louis, MO 63121, USA.

出版信息

Bioinformatics. 2020 Feb 15;36(4):1091-1098. doi: 10.1093/bioinformatics/btz679.

DOI:10.1093/bioinformatics/btz679
PMID:31504181
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7703788/
Abstract

MOTIVATION

Deep learning has become the dominant technology for protein contact prediction. However, the factors that affect the performance of deep learning in contact prediction have not been systematically investigated.

RESULTS

We analyzed the results of our three deep learning-based contact prediction methods (MULTICOM-CLUSTER, MULTICOM-CONSTRUCT and MULTICOM-NOVEL) in the CASP13 experiment and identified several key factors [i.e. deep learning technique, multiple sequence alignment (MSA), distance distribution prediction and domain-based contact integration] that influenced the contact prediction accuracy. We compared our convolutional neural network (CNN)-based contact prediction methods with three coevolution-based methods on 75 CASP13 targets consisting of 108 domains. We demonstrated that the CNN-based multi-distance approach was able to leverage global coevolutionary coupling patterns comprised of multiple correlated contacts for more accurate contact prediction than the local coevolution-based methods, leading to a substantial increase of precision by 19.2 percentage points. We also tested different alignment methods and domain-based contact prediction with the deep learning contact predictors. The comparison of the three methods showed deeper sequence alignments and the integration of domain-based contact prediction with the full-length contact prediction improved the performance of contact prediction. Moreover, we demonstrated that the domain-based contact prediction based on a novel ab initio approach of parsing domains from MSAs alone without using known protein structures was a simple, fast approach to improve contact prediction. Finally, we showed that predicting the distribution of inter-residue distances in multiple distance intervals could capture more structural information and improve binary contact prediction.

AVAILABILITY AND IMPLEMENTATION

https://github.com/multicom-toolbox/DNCON2/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

深度学习已成为蛋白质接触预测的主导技术。然而,影响深度学习在接触预测中性能的因素尚未得到系统研究。

结果

我们分析了我们在 CASP13 实验中三种基于深度学习的接触预测方法(MULTICOM-CLUSTER、MULTICOM-CONSTRUCT 和 MULTICOM-NOVEL)的结果,确定了几个关键因素[即深度学习技术、多重序列比对(MSA)、距离分布预测和基于域的接触整合],这些因素影响接触预测精度。我们将基于卷积神经网络(CNN)的接触预测方法与三种基于共进化的方法在由 108 个域组成的 75 个 CASP13 目标上进行了比较。我们证明,基于 CNN 的多距离方法能够利用由多个相关接触组成的全局共进化耦合模式进行更准确的接触预测,比基于局部共进化的方法提高了 19.2 个百分点的精度。我们还测试了不同的对齐方法和基于域的接触预测与深度学习接触预测器。三种方法的比较表明,更深层次的序列对齐以及基于域的接触预测与全长接触预测的整合提高了接触预测的性能。此外,我们证明了基于从头开始解析 MSAs 中域的新方法的基于域的接触预测,而无需使用已知的蛋白质结构,是一种简单、快速的方法来提高接触预测。最后,我们表明,预测多个距离间隔内的残基间距离分布可以捕获更多的结构信息,并提高二进制接触预测。

可用性和实现

https://github.com/multicom-toolbox/DNCON2/。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/8137f25d38e1/btz679f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/e617b9bae737/btz679f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/2fbe21c11774/btz679f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/15ccccd738bf/btz679f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/0a17db6fa365/btz679f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/8137f25d38e1/btz679f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/e617b9bae737/btz679f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/2fbe21c11774/btz679f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/15ccccd738bf/btz679f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/0a17db6fa365/btz679f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbb5/7703788/8137f25d38e1/btz679f5.jpg

相似文献

1
Analysis of several key factors influencing deep learning-based inter-residue contact prediction.分析影响基于深度学习的残基间接触预测的几个关键因素。
Bioinformatics. 2020 Feb 15;36(4):1091-1098. doi: 10.1093/bioinformatics/btz679.
2
Improving deep learning-based protein distance prediction in CASP14.在蛋白质结构预测关键评估第14轮(CASP14)中改进基于深度学习的蛋白质距离预测
Bioinformatics. 2021 Oct 11;37(19):3190-3196. doi: 10.1093/bioinformatics/btab355.
3
Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.通过深度学习和距离预测改进 CASP14 中的蛋白质三级结构预测。
Proteins. 2022 Jan;90(1):58-72. doi: 10.1002/prot.26186. Epub 2021 Jul 27.
4
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.
5
DNCON2: improved protein contact prediction using two-level deep convolutional neural networks.DNCON2:使用两级深度卷积神经网络改进蛋白质接触预测。
Bioinformatics. 2018 May 1;34(9):1466-1472. doi: 10.1093/bioinformatics/btx781.
6
MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction.基于深度学习和距离预测的 MULTICOM2 开源蛋白质结构预测系统。
Sci Rep. 2021 Jun 23;11(1):13155. doi: 10.1038/s41598-021-92395-6.
7
Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.通过整合深度多序列比对、协同进化和机器学习进行蛋白质接触预测。
Proteins. 2018 Mar;86 Suppl 1(Suppl 1):84-96. doi: 10.1002/prot.25405. Epub 2017 Oct 31.
8
The MULTICOM Protein Structure Prediction Server Empowered by Deep Learning and Contact Distance Prediction.基于深度学习和接触距离预测的 MULTICOM 蛋白质结构预测服务器。
Methods Mol Biol. 2020;2165:13-26. doi: 10.1007/978-1-0716-0708-4_2.
9
Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.
10
DeepDist: real-value inter-residue distance prediction with deep residual convolutional network.DeepDist:基于深度残差卷积网络的真实残差距离预测。
BMC Bioinformatics. 2021 Jan 25;22(1):30. doi: 10.1186/s12859-021-03960-9.

引用本文的文献

1
Predicting the structural impact of human alternative splicing.预测人类可变剪接的结构影响。
Genome Biol. 2025 Sep 17;26(1):283. doi: 10.1186/s13059-025-03744-x.
2
Modeling protein conformational ensembles by guiding AlphaFold2 with Double Electron Electron Resonance (DEER) distance distributions.通过双电子电子共振(DEER)距离分布引导AlphaFold2对蛋白质构象集合进行建模。
Nat Commun. 2025 Aug 2;16(1):7107. doi: 10.1038/s41467-025-62582-4.
3
Improving Protein Secondary Structure Prediction by Deep Language Models and Transformer Networks.

本文引用的文献

1
Analysis of distance-based protein structure prediction by deep learning in CASP13.基于深度学习的 CASP13 蛋白质结构预测距离分析。
Proteins. 2019 Dec;87(12):1069-1081. doi: 10.1002/prot.25810. Epub 2019 Sep 13.
2
Prediction of interresidue contacts with DeepMetaPSICOV in CASP13.在 CASP13 中使用 DeepMetaPSICOV 预测残基间接触。
Proteins. 2019 Dec;87(12):1092-1099. doi: 10.1002/prot.25779. Epub 2019 Jul 27.
3
ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.
深度学习语言模型和变换网络在蛋白质二级结构预测中的改进。
Methods Mol Biol. 2025;2867:43-53. doi: 10.1007/978-1-0716-4196-5_3.
4
The success rate of processed predicted models in molecular replacement: implications for experimental phasing in the AlphaFold era.处理后预测模型在分子置换中的成功率:对 AlphaFold 时代实验相位的影响。
Acta Crystallogr D Struct Biol. 2024 Nov 1;80(Pt 11):766-779. doi: 10.1107/S2059798324009380. Epub 2024 Oct 3.
5
SymProFold: Structural prediction of symmetrical biological assemblies.SymProFold:对称生物组装结构预测。
Nat Commun. 2024 Sep 18;15(1):8152. doi: 10.1038/s41467-024-52138-3.
6
AlphaFold predictions of fold-switched conformations are driven by structure memorization.AlphaFold 对构象转换构象的预测是由结构记忆驱动的。
Nat Commun. 2024 Aug 24;15(1):7296. doi: 10.1038/s41467-024-51801-z.
7
How much metagenome data is needed for protein structure prediction: The advantages of targeted approach from the ecological and evolutionary perspectives.蛋白质结构预测需要多少宏基因组数据:从生态和进化角度看靶向方法的优势
Imeta. 2022 Mar 6;1(1):e9. doi: 10.1002/imt2.9. eCollection 2022 Mar.
8
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
9
Predicting the Structural Impact of Human Alternative Splicing.预测人类可变剪接的结构影响
bioRxiv. 2023 Dec 24:2023.12.21.572928. doi: 10.1101/2023.12.21.572928.
10
AlphaFold2 has more to learn about protein energy landscapes.关于蛋白质能量景观,AlphaFold2还有更多需要学习的地方。
bioRxiv. 2023 Dec 13:2023.12.12.571380. doi: 10.1101/2023.12.12.571380.
ResPRE:通过结合精度矩阵和深度残差神经网络进行高精度蛋白质接触预测。
Bioinformatics. 2019 Nov 1;35(22):4647-4655. doi: 10.1093/bioinformatics/btz291.
4
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.
5
DESTINI: A deep-learning approach to contact-driven protein structure prediction.DESTINI:一种基于深度学习的接触驱动蛋白质结构预测方法。
Sci Rep. 2019 Mar 5;9(1):3514. doi: 10.1038/s41598-019-40314-1.
6
Clustering huge protein sequence sets in linear time.线性时间内的大规模蛋白质序列集聚类。
Nat Commun. 2018 Jun 29;9(1):2542. doi: 10.1038/s41467-018-04964-5.
7
Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks.通过将残差二维双向长短期记忆与卷积神经网络相结合,准确预测蛋白质接触图。
Bioinformatics. 2018 Dec 1;34(23):4039-4045. doi: 10.1093/bioinformatics/bty481.
8
High precision in protein contact prediction using fully convolutional neural networks and minimal sequence features.利用全卷积神经网络和最小序列特征进行高精度蛋白质接触预测。
Bioinformatics. 2018 Oct 1;34(19):3308-3315. doi: 10.1093/bioinformatics/bty341.
9
CONFOLD2: improved contact-driven ab initio protein structure modeling.CONFOLD2:改进的接触驱动从头蛋白质结构建模。
BMC Bioinformatics. 2018 Jan 25;19(1):22. doi: 10.1186/s12859-018-2032-6.
10
DNCON2: improved protein contact prediction using two-level deep convolutional neural networks.DNCON2:使用两级深度卷积神经网络改进蛋白质接触预测。
Bioinformatics. 2018 May 1;34(9):1466-1472. doi: 10.1093/bioinformatics/btx781.