• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分析四重序列比对对基于深度学习的蛋白质残基间距离预测的影响。

Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction.

机构信息

Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA.

Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.

出版信息

Sci Rep. 2021 Apr 7;11(1):7574. doi: 10.1038/s41598-021-87204-z.

DOI:10.1038/s41598-021-87204-z
PMID:33828153
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8027171/
Abstract

Protein 3D structure prediction has advanced significantly in recent years due to improving contact prediction accuracy. This improvement has been largely due to deep learning approaches that predict inter-residue contacts and, more recently, distances using multiple sequence alignments (MSAs). In this work we present AttentiveDist, a novel approach that uses different MSAs generated with different E-values in a single model to increase the co-evolutionary information provided to the model. To determine the importance of each MSA's feature at the inter-residue level, we added an attention layer to the deep neural network. We show that combining four MSAs of different E-value cutoffs improved the model prediction performance as compared to single E-value MSA features. A further improvement was observed when an attention layer was used and even more when additional prediction tasks of bond angle predictions were added. The improvement of distance predictions were successfully transferred to achieve better protein tertiary structure modeling.

摘要

近年来,由于接触预测准确性的提高,蛋白质 3D 结构预测有了显著进展。这一改进在很大程度上要归功于深度学习方法,这些方法使用多重序列比对 (MSA) 预测残基间的接触,以及最近预测距离。在这项工作中,我们提出了 AttentiveDist,这是一种新颖的方法,它在单个模型中使用不同 E 值生成的不同 MSAs 来增加提供给模型的共进化信息。为了确定每个 MSA 在残基水平上的特征的重要性,我们在深度神经网络中添加了一个注意力层。我们表明,与单个 E 值 MSA 特征相比,结合四个不同 E 值截止值的 MSAs 可提高模型预测性能。当使用注意力层时观察到进一步的改进,而当添加键角预测等额外预测任务时则观察到更多的改进。距离预测的改进成功地转移,以实现更好的蛋白质三级结构建模。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/25e35b69825c/41598_2021_87204_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/eb9cf396979f/41598_2021_87204_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/4db56f4cb63f/41598_2021_87204_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/2c33a8557933/41598_2021_87204_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/7faa6611c88c/41598_2021_87204_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/e5cfe488c820/41598_2021_87204_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/1f91383bce07/41598_2021_87204_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/01b527dc2a6b/41598_2021_87204_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/0f64bad5d9bf/41598_2021_87204_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/2966de18210f/41598_2021_87204_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/25e35b69825c/41598_2021_87204_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/eb9cf396979f/41598_2021_87204_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/4db56f4cb63f/41598_2021_87204_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/2c33a8557933/41598_2021_87204_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/7faa6611c88c/41598_2021_87204_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/e5cfe488c820/41598_2021_87204_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/1f91383bce07/41598_2021_87204_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/01b527dc2a6b/41598_2021_87204_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/0f64bad5d9bf/41598_2021_87204_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/2966de18210f/41598_2021_87204_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d82/8027171/25e35b69825c/41598_2021_87204_Fig10_HTML.jpg

相似文献

1
Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction.分析四重序列比对对基于深度学习的蛋白质残基间距离预测的影响。
Sci Rep. 2021 Apr 7;11(1):7574. doi: 10.1038/s41598-021-87204-z.
2
DeepECA: an end-to-end learning framework for protein contact prediction from a multiple sequence alignment.DeepECA:一种基于多重序列比对的蛋白质接触预测端到端学习框架。
BMC Bioinformatics. 2020 Jan 9;21(1):10. doi: 10.1186/s12859-019-3190-x.
3
DNCON2_Inter: predicting interchain contacts for homodimeric and homomultimeric protein complexes using multiple sequence alignments of monomers and deep learning.DNCON2_Inter:使用单体的多重序列比对和深度学习预测同源二聚体和同源多聚体蛋白复合物的链间接触。
Sci Rep. 2021 Jun 10;11(1):12295. doi: 10.1038/s41598-021-91827-7.
4
ProALIGN: Directly Learning Alignments for Protein Structure Prediction via Exploiting Context-Specific Alignment Motifs.ProALIGN:通过利用上下文特定的对齐基序直接学习蛋白质结构预测的对齐。
J Comput Biol. 2022 Feb;29(2):92-105. doi: 10.1089/cmb.2021.0430. Epub 2022 Jan 21.
5
CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction.CopulaNet:直接从多序列比对中学习残基协同进化用于蛋白质结构预测。
Nat Commun. 2021 May 5;12(1):2535. doi: 10.1038/s41467-021-22869-8.
6
Improving deep learning-based protein distance prediction in CASP14.在蛋白质结构预测关键评估第14轮(CASP14)中改进基于深度学习的蛋白质距离预测
Bioinformatics. 2021 Oct 11;37(19):3190-3196. doi: 10.1093/bioinformatics/btab355.
7
Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.通过深度学习和距离预测改进 CASP14 中的蛋白质三级结构预测。
Proteins. 2022 Jan;90(1):58-72. doi: 10.1002/prot.26186. Epub 2021 Jul 27.
8
Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14.使用深度学习距离和氢键约束进行 CASP14 中的蛋白质结构预测。
Proteins. 2021 Dec;89(12):1734-1751. doi: 10.1002/prot.26193. Epub 2021 Aug 7.
9
A Web-Based Protocol for Interprotein Contact Prediction by Deep Learning.基于深度学习的蛋白质间接触预测的网络协议。
Methods Mol Biol. 2020;2074:67-80. doi: 10.1007/978-1-4939-9873-9_6.
10
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.

引用本文的文献

1
Structure Modeling Protocols for Protein Multimer and RNA in CASP16 With Enhanced MSAs, Model Ranking, and Deep Learning.利用增强型多序列比对、模型排序和深度学习的CASP16中蛋白质多聚体和RNA的结构建模协议
Proteins. 2025 Aug 1. doi: 10.1002/prot.70033.
2
BloodProST: prediction of blood-secretory proteins through self-training.BloodProST:通过自我训练预测血液分泌蛋白
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf385.
3
Free-Docking and Template-Based Docking: Physics Versus Knowledge-Based Docking.自由对接和基于模板的对接:物理与基于知识的对接。

本文引用的文献

1
Improved protein structure prediction by deep learning irrespective of co-evolution information.通过深度学习改进蛋白质结构预测,与共进化信息无关。
Nat Mach Intell. 2021 Jul;3:601-609. doi: 10.1038/s42256-021-00348-5. Epub 2021 May 20.
2
Improved protein structure prediction using potentials from deep learning.利用深度学习势进行蛋白质结构预测的改进。
Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.
3
Improved protein structure prediction using predicted interresidue orientations.利用预测的残基间取向改进蛋白质结构预测。
Methods Mol Biol. 2024;2780:27-41. doi: 10.1007/978-1-0716-3985-6_3.
4
GO2Sum: generating human-readable functional summary of proteins from GO terms.GO2Sum:从 GO 术语生成人类可读的蛋白质功能摘要。
NPJ Syst Biol Appl. 2024 Mar 15;10(1):29. doi: 10.1038/s41540-024-00358-0.
5
Pairwise and Multi-chain Protein Docking Enhanced Using LZerD Web Server.使用 LZerD 网络服务器增强的成对和多链蛋白质对接。
Methods Mol Biol. 2023;2690:355-373. doi: 10.1007/978-1-0716-3327-4_28.
6
A Machine Learning Method to Identify Umami Peptide Sequences by Using Multiplicative LSTM Embedded Features.一种利用乘法长短期记忆网络嵌入特征识别鲜味肽序列的机器学习方法。
Foods. 2023 Apr 2;12(7):1498. doi: 10.3390/foods12071498.
7
RL-MLZerD: Multimeric protein docking using reinforcement learning.RL-MLZerD:使用强化学习的多聚体蛋白质对接
Front Mol Biosci. 2022 Aug 26;9:969394. doi: 10.3389/fmolb.2022.969394. eCollection 2022.
8
ContactPFP: Protein function prediction using predicted contact information.ContactPFP:利用预测的接触信息进行蛋白质功能预测。
Front Bioinform. 2022 Jun;2. doi: 10.3389/fbinf.2022.896295. Epub 2022 Jun 2.
9
Protein Structural Modeling for Electron Microscopy Maps Using VESPER and MAINMAST.使用 VESPER 和 MAINMAST 进行电子显微镜图谱的蛋白质结构建模。
Curr Protoc. 2022 Jul;2(7):e494. doi: 10.1002/cpz1.494.
10
Inter-Residue Distance Prediction From Duet Deep Learning Models.基于二重深度学习模型的残基间距离预测
Front Genet. 2022 May 16;13:887491. doi: 10.3389/fgene.2022.887491. eCollection 2022.
Proc Natl Acad Sci U S A. 2020 Jan 21;117(3):1496-1503. doi: 10.1073/pnas.1914677117. Epub 2020 Jan 2.
4
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.DeepMSA:构建深度多重序列比对以改进远距离同源蛋白质的接触预测和折叠识别。
Bioinformatics. 2020 Apr 1;36(7):2105-2112. doi: 10.1093/bioinformatics/btz863.
5
Performance and enhancement of the LZerD protein assembly pipeline in CAPRI 38-46.LZerD 蛋白组装流水线在 CAPRI 38-46 中的表现和增强。
Proteins. 2020 Aug;88(8):948-961. doi: 10.1002/prot.25850. Epub 2019 Nov 25.
6
HH-suite3 for fast remote homology detection and deep protein annotation.HH-suite3 用于快速远程同源检测和深度蛋白质注释。
BMC Bioinformatics. 2019 Sep 14;20(1):473. doi: 10.1186/s12859-019-3019-7.
7
Analysis of distance-based protein structure prediction by deep learning in CASP13.基于深度学习的 CASP13 蛋白质结构预测距离分析。
Proteins. 2019 Dec;87(12):1069-1081. doi: 10.1002/prot.25810. Epub 2019 Sep 13.
8
Advances in protein structure prediction and design.蛋白质结构预测和设计的进展。
Nat Rev Mol Cell Biol. 2019 Nov;20(11):681-697. doi: 10.1038/s41580-019-0163-x. Epub 2019 Aug 15.
9
Ensembling multiple raw coevolutionary features with deep residual neural networks for contact-map prediction in CASP13.基于深度残差神经网络的原始共进化特征集成方法在 CASP13 中用于接触图预测。
Proteins. 2019 Dec;87(12):1082-1091. doi: 10.1002/prot.25798. Epub 2019 Aug 22.
10
Distance-based protein folding powered by deep learning.基于深度学习的距离相关蛋白质折叠。
Proc Natl Acad Sci U S A. 2019 Aug 20;116(34):16856-16865. doi: 10.1073/pnas.1821309116. Epub 2019 Aug 9.