• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在蛋白质结构预测关键评估第14轮(CASP14)中改进基于深度学习的蛋白质距离预测

Improving deep learning-based protein distance prediction in CASP14.

作者信息

Guo Zhiye, Wu Tianqi, Liu Jian, Hou Jie, Cheng Jianlin

机构信息

Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA.

Department of Computer Science, Saint Louis University, St. Louis, MO 63103, USA.

出版信息

Bioinformatics. 2021 Oct 11;37(19):3190-3196. doi: 10.1093/bioinformatics/btab355.

DOI:10.1093/bioinformatics/btab355
PMID:33961009
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8504632/
Abstract

MOTIVATION

Accurate prediction of residue-residue distances is important for protein structure prediction. We developed several protein distance predictors based on a deep learning distance prediction method and blindly tested them in the 14th Critical Assessment of Protein Structure Prediction (CASP14). The prediction method uses deep residual neural networks with the channel-wise attention mechanism to classify the distance between every two residues into multiple distance intervals. The input features for the deep learning method include co-evolutionary features as well as other sequence-based features derived from multiple sequence alignments (MSAs). Three alignment methods are used with multiple protein sequence/profile databases to generate MSAs for input feature generation. Based on different configurations and training strategies of the deep learning method, five MULTICOM distance predictors were created to participate in the CASP14 experiment.

RESULTS

Benchmarked on 37 hard CASP14 domains, the best performing MULTICOM predictor is ranked 5th out of 30 automated CASP14 distance prediction servers in terms of precision of top L/5 long-range contact predictions [i.e. classifying distances between two residues into two categories: in contact (<8 Angstrom) and not in contact otherwise] and performs better than the best CASP13 distance prediction method. The best performing MULTICOM predictor is also ranked 6th among automated server predictors in classifying inter-residue distances into 10 distance intervals defined by CASP14 according to the precision of distance classification. The results show that the quality and depth of MSAs depend on alignment methods and sequence databases and have a significant impact on the accuracy of distance prediction. Using larger training datasets and multiple complementary features improves prediction accuracy. However, the number of effective sequences in MSAs is only a weak indicator of the quality of MSAs and the accuracy of predicted distance maps. In contrast, there is a strong correlation between the accuracy of contact/distance predictions and the average probability of the predicted contacts, which can therefore be more effectively used to estimate the confidence of distance predictions and select predicted distance maps.

AVAILABILITY AND IMPLEMENTATION

The software package, source code and data of DeepDist2 are freely available at https://github.com/multicom-toolbox/deepdist and https://zenodo.org/record/4712084#.YIIM13VKhQM.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

准确预测残基间距离对于蛋白质结构预测至关重要。我们基于深度学习距离预测方法开发了几种蛋白质距离预测器,并在第14届蛋白质结构预测关键评估(CASP14)中对它们进行了盲测。该预测方法使用带有通道注意力机制的深度残差神经网络,将每两个残基之间的距离分类到多个距离区间。深度学习方法的输入特征包括共进化特征以及从多序列比对(MSA)中衍生的其他基于序列的特征。使用三种比对方法与多个蛋白质序列/谱数据库来生成用于输入特征生成的MSA。基于深度学习方法的不同配置和训练策略,创建了五个MULTICOM距离预测器来参与CASP14实验。

结果

以37个CASP14硬结构域为基准,表现最佳的MULTICOM预测器在顶级L/5远程接触预测的精度方面(即将两个残基之间的距离分为两类:接触(<8埃)和不接触),在30个自动化CASP14距离预测服务器中排名第5,并且比最佳的CASP13距离预测方法表现更好。在根据距离分类精度将残基间距离分类为CASP14定义的10个距离区间方面,表现最佳的MULTICOM预测器在自动化服务器预测器中也排名第6。结果表明,MSA的质量和深度取决于比对方法和序列数据库,并且对距离预测的准确性有重大影响。使用更大的训练数据集和多个互补特征可提高预测准确性。然而,MSA中有效序列的数量只是MSA质量和预测距离图准确性的一个弱指标。相比之下,接触/距离预测的准确性与预测接触的平均概率之间存在很强的相关性,因此可以更有效地用于估计距离预测的置信度并选择预测距离图。

可用性和实现

DeepDist2的软件包、源代码和数据可在https://github.com/multicom-toolbox/deepdist和https://zenodo.org/record/4712084#.YIIM13VKhQM上免费获取。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/3d9dd82cacfa/btab355f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/ddf73c92d4f3/btab355f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/69ce35c075aa/btab355f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/67aa28fc4687/btab355f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/3d9dd82cacfa/btab355f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/ddf73c92d4f3/btab355f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/69ce35c075aa/btab355f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/67aa28fc4687/btab355f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df27/8504632/3d9dd82cacfa/btab355f4.jpg

相似文献

1
Improving deep learning-based protein distance prediction in CASP14.在蛋白质结构预测关键评估第14轮(CASP14)中改进基于深度学习的蛋白质距离预测
Bioinformatics. 2021 Oct 11;37(19):3190-3196. doi: 10.1093/bioinformatics/btab355.
2
MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction.基于深度学习和距离预测的 MULTICOM2 开源蛋白质结构预测系统。
Sci Rep. 2021 Jun 23;11(1):13155. doi: 10.1038/s41598-021-92395-6.
3
Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.通过深度学习和距离预测改进 CASP14 中的蛋白质三级结构预测。
Proteins. 2022 Jan;90(1):58-72. doi: 10.1002/prot.26186. Epub 2021 Jul 27.
4
Analysis of several key factors influencing deep learning-based inter-residue contact prediction.分析影响基于深度学习的残基间接触预测的几个关键因素。
Bioinformatics. 2020 Feb 15;36(4):1091-1098. doi: 10.1093/bioinformatics/btz679.
5
Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14.基于深度学习和残差距离预测的蛋白质模型准确性估计在 CASP14 中的应用。
Sci Rep. 2021 May 25;11(1):10943. doi: 10.1038/s41598-021-90303-6.
6
The MULTICOM Protein Structure Prediction Server Empowered by Deep Learning and Contact Distance Prediction.基于深度学习和接触距离预测的 MULTICOM 蛋白质结构预测服务器。
Methods Mol Biol. 2020;2165:13-26. doi: 10.1007/978-1-0716-0708-4_2.
7
A deep dilated convolutional residual network for predicting interchain contacts of protein homodimers.一种用于预测蛋白质同源二聚体链间接触的深度扩张卷积残差网络。
Bioinformatics. 2022 Mar 28;38(7):1904-1910. doi: 10.1093/bioinformatics/btac063.
8
DeepDist: real-value inter-residue distance prediction with deep residual convolutional network.DeepDist:基于深度残差卷积网络的真实残差距离预测。
BMC Bioinformatics. 2021 Jan 25;22(1):30. doi: 10.1186/s12859-021-03960-9.
9
Improving AlphaFold2-based protein tertiary structure prediction with MULTICOM in CASP15.在蛋白质结构预测关键评估第15轮(CASP15)中使用MULTICOM改进基于AlphaFold2的蛋白质三级结构预测
Commun Chem. 2023 Sep 7;6(1):188. doi: 10.1038/s42004-023-00991-6.
10
ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.ResPRE:通过结合精度矩阵和深度残差神经网络进行高精度蛋白质接触预测。
Bioinformatics. 2019 Nov 1;35(22):4647-4655. doi: 10.1093/bioinformatics/btz291.

引用本文的文献

1
Freeprotmap: waiting-free prediction method for protein distance map.Freeprotmap:一种无等待的蛋白质距离图预测方法。
BMC Bioinformatics. 2024 May 4;25(1):176. doi: 10.1186/s12859-024-05771-0.
2
Refinement of AlphaFold2 models against experimental and hybrid cryo-EM density maps.针对实验性和混合冷冻电镜密度图对AlphaFold2模型进行优化。
QRB Discov. 2022;3. doi: 10.1017/qrd.2022.13. Epub 2022 Sep 20.
3
Prediction of inter-chain distance maps of protein complexes with 2D attention-based deep neural networks.基于二维注意力的深度学习神经网络预测蛋白质复合物的链间距离图谱。

本文引用的文献

1
Combination of deep neural network with attention mechanism enhances the explainability of protein contact prediction.深度神经网络与注意力机制的结合增强了蛋白质接触预测的可解释性。
Proteins. 2021 Jun;89(6):697-707. doi: 10.1002/prot.26052. Epub 2021 Feb 16.
2
DeepDist: real-value inter-residue distance prediction with deep residual convolutional network.DeepDist:基于深度残差卷积网络的真实残差距离预测。
BMC Bioinformatics. 2021 Jan 25;22(1):30. doi: 10.1186/s12859-021-03960-9.
3
Improved protein structure prediction using potentials from deep learning.
Nat Commun. 2022 Nov 15;13(1):6963. doi: 10.1038/s41467-022-34600-2.
4
High-Performance Deep Learning Toolbox for Genome-Scale Prediction of Protein Structure and Function.用于蛋白质结构和功能基因组规模预测的高性能深度学习工具箱。
Workshop Mach Learn HPC Environ. 2021 Nov;2021:46-57. doi: 10.1109/mlhpc54614.2021.00010. Epub 2021 Dec 27.
5
Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.通过深度学习和距离预测改进 CASP14 中的蛋白质三级结构预测。
Proteins. 2022 Jan;90(1):58-72. doi: 10.1002/prot.26186. Epub 2021 Jul 27.
利用深度学习势进行蛋白质结构预测的改进。
Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.
4
Improved protein structure prediction using predicted interresidue orientations.利用预测的残基间取向改进蛋白质结构预测。
Proc Natl Acad Sci U S A. 2020 Jan 21;117(3):1496-1503. doi: 10.1073/pnas.1914677117. Epub 2020 Jan 2.
5
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.DeepMSA:构建深度多重序列比对以改进远距离同源蛋白质的接触预测和折叠识别。
Bioinformatics. 2020 Apr 1;36(7):2105-2112. doi: 10.1093/bioinformatics/btz863.
6
Critical assessment of methods of protein structure prediction (CASP)-Round XIII.蛋白质结构预测方法的关键评估(CASP)-第十三轮。
Proteins. 2019 Dec;87(12):1011-1020. doi: 10.1002/prot.25823. Epub 2019 Oct 23.
7
Analysis of several key factors influencing deep learning-based inter-residue contact prediction.分析影响基于深度学习的残基间接触预测的几个关键因素。
Bioinformatics. 2020 Feb 15;36(4):1091-1098. doi: 10.1093/bioinformatics/btz679.
8
Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints.深度学习利用迭代预测的结构约束来扩展从头开始的蛋白质建模对基因组的覆盖范围。
Nat Commun. 2019 Sep 4;10(1):3977. doi: 10.1038/s41467-019-11994-0.
9
Analysis of distance-based protein structure prediction by deep learning in CASP13.基于深度学习的 CASP13 蛋白质结构预测距离分析。
Proteins. 2019 Dec;87(12):1069-1081. doi: 10.1002/prot.25810. Epub 2019 Sep 13.
10
Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.