• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在 CASP15 中,通过深度学习、线程比对和多 MSAs 策略,实现高质量蛋白质单体和复合物结构预测。

Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA.

Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan, USA.

出版信息

Proteins. 2023 Dec;91(12):1684-1703. doi: 10.1002/prot.26585. Epub 2023 Aug 31.

DOI:10.1002/prot.26585
PMID:37650367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10840719/
Abstract

We report the results of the "UM-TBM" and "Zheng" groups in CASP15 for protein monomer and complex structure prediction. These prediction sets were obtained using the D-I-TASSER and DMFold-Multimer algorithms, respectively. For monomer structure prediction, D-I-TASSER introduced four new features during CASP15: (i) a multiple sequence alignment (MSA) generation protocol that combines multi-source MSA searching and a structural modeling-based MSA ranker; (ii) attention-network based spatial restraints; (iii) a multi-domain module containing domain partition and arrangement for domain-level templates and spatial restraints; (iv) an optimized I-TASSER-based folding simulation system for full-length model creation guided by a combination of deep learning restraints, threading alignments, and knowledge-based potentials. For 47 free modeling targets in CASP15, the final models predicted by D-I-TASSER showed average TM-score 19% higher than the standard AlphaFold2 program. We thus showed that traditional Monte Carlo-based folding simulations, when appropriately coupled with deep learning algorithms, can generate models with improved accuracy over end-to-end deep learning methods alone. For protein complex structure prediction, DMFold-Multimer generated models by integrating a new MSA generation algorithm (DeepMSA2) with the end-to-end modeling module from AlphaFold2-Multimer. For the 38 complex targets, DMFold-Multimer generated models with an average TM-score of 0.83 and Interface Contact Score of 0.60, both significantly higher than those of competing complex prediction tools. Our analyses on complexes highlighted the critical role played by MSA generating, ranking, and pairing in protein complex structure prediction. We also discuss future room for improvement in the areas of viral protein modeling and complex model ranking.

摘要

我们报告了“UM-TBM”和“Zheng”小组在 CASP15 中的蛋白质单体和复合物结构预测结果。这些预测集分别使用 D-I-TASSER 和 DMFold-Multimer 算法获得。对于单体结构预测,D-I-TASSER 在 CASP15 中引入了四个新特性:(i)一种多序列比对(MSA)生成协议,它结合了多源 MSA 搜索和基于结构建模的 MSA 排名器;(ii)基于注意力网络的空间限制;(iii)一个多域模块,包含用于域级模板和空间限制的域划分和排列;(iv)一个优化的基于 I-TASSER 的折叠模拟系统,用于在深度学习限制、线程比对和基于知识的势能的组合指导下创建全长模型。对于 CASP15 中的 47 个自由建模目标,D-I-TASSER 预测的最终模型的平均 TM 分数比标准的 AlphaFold2 程序高 19%。因此,我们表明,当与深度学习算法适当结合时,传统的基于蒙特卡罗的折叠模拟可以生成比仅基于端到端深度学习方法更准确的模型。对于蛋白质复合物结构预测,DMFold-Multimer 通过将新的 MSA 生成算法(DeepMSA2)与来自 AlphaFold2-Multimer 的端到端建模模块集成,生成模型。对于 38 个复合物目标,DMFold-Multimer 生成的模型的平均 TM 分数为 0.83,界面接触分数为 0.60,均显著高于竞争复合物预测工具的分数。我们对复合物的分析强调了 MSA 生成、排名和配对在蛋白质复合物结构预测中的关键作用。我们还讨论了在病毒蛋白建模和复合物模型排名领域进一步改进的空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/f17864cf4a7b/nihms-1925904-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/4ee8c4b1c36c/nihms-1925904-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/1a811682a047/nihms-1925904-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/c55a20754aef/nihms-1925904-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/1d4763560e23/nihms-1925904-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/0ef602127077/nihms-1925904-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/41f6c6e4b307/nihms-1925904-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/e2508da3fd15/nihms-1925904-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/f17864cf4a7b/nihms-1925904-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/4ee8c4b1c36c/nihms-1925904-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/1a811682a047/nihms-1925904-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/c55a20754aef/nihms-1925904-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/1d4763560e23/nihms-1925904-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/0ef602127077/nihms-1925904-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/41f6c6e4b307/nihms-1925904-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/e2508da3fd15/nihms-1925904-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cb7/10840719/f17864cf4a7b/nihms-1925904-f0008.jpg

相似文献

1
Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15.在 CASP15 中,通过深度学习、线程比对和多 MSAs 策略,实现高质量蛋白质单体和复合物结构预测。
Proteins. 2023 Dec;91(12):1684-1703. doi: 10.1002/prot.26585. Epub 2023 Aug 31.
2
Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14.使用深度学习距离和氢键约束进行 CASP14 中的蛋白质结构预测。
Proteins. 2021 Dec;89(12):1734-1751. doi: 10.1002/prot.26193. Epub 2021 Aug 7.
3
Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.
4
Improving deep learning protein monomer and complex structure prediction using DeepMSA2 with huge metagenomics data.利用 DeepMSA2 和海量宏基因组学数据改进深度学习蛋白质单体和复合物结构预测。
Nat Methods. 2024 Feb;21(2):279-289. doi: 10.1038/s41592-023-02130-4. Epub 2024 Jan 2.
5
Template-based protein structure prediction in CASP11 and retrospect of I-TASSER in the last decade.CASP11中基于模板的蛋白质结构预测及I-TASSER在过去十年的回顾。
Proteins. 2016 Sep;84 Suppl 1(Suppl 1):233-46. doi: 10.1002/prot.24918. Epub 2015 Sep 18.
6
Improved protein structure prediction with trRosettaX2, AlphaFold2, and optimized MSAs in CASP15.利用 trRosettaX2、AlphaFold2 和优化的 MSAs 在 CASP15 中提高蛋白质结构预测。
Proteins. 2023 Dec;91(12):1704-1711. doi: 10.1002/prot.26570. Epub 2023 Aug 10.
7
TASSER_WT: a protein structure prediction algorithm with accurate predicted contact restraints for difficult protein targets.TASSER_WT:一种蛋白质结构预测算法,对困难的蛋白质靶标具有准确预测的接触约束。
Biophys J. 2010 Nov 3;99(9):3066-75. doi: 10.1016/j.bpj.2010.09.007.
8
Protein structure prediction by pro-Sp3-TASSER.通过亲Sp3-TASSER进行蛋白质结构预测。
Biophys J. 2009 Mar 18;96(6):2119-27. doi: 10.1016/j.bpj.2008.12.3898.
9
Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15.在蛋白质结构预测关键评估第15轮(CASP15)中利用MULTICOM增强基于AlphaFold-Multimer的蛋白质复合物结构预测
bioRxiv. 2023 May 18:2023.05.16.541055. doi: 10.1101/2023.05.16.541055.
10
Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12.在蛋白质结构预测技术评估第12轮(CASP12)中,基于模板以及I-TASSER和QUARK流程的自由建模,并使用预测的接触图。
Proteins. 2018 Mar;86 Suppl 1(Suppl 1):136-151. doi: 10.1002/prot.25414. Epub 2017 Nov 14.

引用本文的文献

1
EZpred: improving deep learning-based enzyme function prediction using unlabeled sequence homologs.EZpred:利用未标记的序列同源物改进基于深度学习的酶功能预测
bioRxiv. 2025 Jul 14:2025.07.09.663945. doi: 10.1101/2025.07.09.663945.
2
Artificial Intelligence (AI)-Based Protein Structure Prediction and Analysis.基于人工智能(AI)的蛋白质结构预测与分析
Methods Mol Biol. 2025;2952:39-57. doi: 10.1007/978-1-0716-4690-8_3.
3
Using multiple computer-predicted structures as molecular replacement models: application to the antiviral mini-protein LCB2.

本文引用的文献

1
Fast and accurate Ab Initio Protein structure prediction using deep learning potentials.使用深度学习势能进行快速准确的从头开始蛋白质结构预测。
PLoS Comput Biol. 2022 Sep 16;18(9):e1010539. doi: 10.1371/journal.pcbi.1010539. eCollection 2022 Sep.
2
Deep learning geometrical potential for high-accuracy protein structure prediction.用于高精度蛋白质结构预测的深度学习几何势
iScience. 2022 May 18;25(6):104425. doi: 10.1016/j.isci.2022.104425. eCollection 2022 Jun 17.
3
ColabFold: making protein folding accessible to all.ColabFold:让蛋白质折叠变得人人可用。
使用多个计算机预测结构作为分子置换模型:应用于抗病毒小蛋白LCB2
IUCrJ. 2025 Jul 1;12(Pt 4):488-501. doi: 10.1107/S2052252525005123.
4
CASP16 protein monomer structure prediction assessment.半胱天冬酶16(CASP16)蛋白单体结构预测评估
bioRxiv. 2025 Jun 2:2025.05.29.656942. doi: 10.1101/2025.05.29.656942.
5
Assessment of Protein Complex Predictions in CASP16: Are we making progress?对蛋白质复合物预测在蛋白质结构预测关键评估第16轮中的评估:我们有进展吗?
bioRxiv. 2025 May 30:2025.05.29.656875. doi: 10.1101/2025.05.29.656875.
6
Multimeric protein interaction and complex prediction: Structure, dynamics and function.多聚体蛋白质相互作用与复合物预测:结构、动力学与功能
Comput Struct Biotechnol J. 2025 May 16;27:1975-1997. doi: 10.1016/j.csbj.2025.05.009. eCollection 2025.
7
Characterization of two GH10 enzymes with ability to hydrolyze pretreated Sorghum bicolor bagasse.两种具有水解预处理高粱双色甘蔗渣能力的GH10酶的特性分析。
Appl Microbiol Biotechnol. 2025 Apr 28;109(1):104. doi: 10.1007/s00253-025-13484-4.
8
Genomic and proteomic analyses of Nus-dependent non-lambdoid phages reveal a novel coliphage group prevalent in gut: mEp.对Nus依赖性非λ样噬菌体的基因组和蛋白质组分析揭示了一种在肠道中普遍存在的新型大肠杆菌噬菌体群:mEp。
Front Microbiol. 2025 Feb 24;16:1480411. doi: 10.3389/fmicb.2025.1480411. eCollection 2025.
9
Physical-aware model accuracy estimation for protein complex using deep learning method.基于深度学习方法的蛋白质复合物物理感知模型准确性估计
Comput Struct Biotechnol J. 2025 Jan 22;27:478-487. doi: 10.1016/j.csbj.2025.01.017. eCollection 2025.
10
A novel FadL family outer membrane transporter is involved in the uptake of polycyclic aromatic hydrocarbons.一种新型的FadL家族外膜转运蛋白参与多环芳烃的摄取。
Appl Environ Microbiol. 2025 Feb 19;91(2):e0082724. doi: 10.1128/aem.00827-24. Epub 2025 Jan 24.
Nat Methods. 2022 Jun;19(6):679-682. doi: 10.1038/s41592-022-01488-1. Epub 2022 May 30.
4
DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction.DEMO2:通过将类似的模板比对与深度学习的域间约束预测相结合,组装多结构域蛋白质结构。
Nucleic Acids Res. 2022 Jul 5;50(W1):W235-W245. doi: 10.1093/nar/gkac340.
5
LOMETS3: integrating deep learning and profile alignment for advanced protein template recognition and function annotation.LOMETS3:集成深度学习和轮廓比对,用于高级蛋白质模板识别和功能注释。
Nucleic Acids Res. 2022 Jul 5;50(W1):W454-W464. doi: 10.1093/nar/gkac248.
6
Improved prediction of protein-protein interactions using AlphaFold2.利用 AlphaFold2 提高蛋白质-蛋白质相互作用预测的准确性。
Nat Commun. 2022 Mar 10;13(1):1265. doi: 10.1038/s41467-022-28865-w.
7
Decoding the link of microbiome niches with homologous sequences enables accurately targeted protein structure prediction.解析微生物组生态位与同源序列的联系,能够实现精确靶向的蛋白质结构预测。
Proc Natl Acad Sci U S A. 2021 Dec 7;118(49). doi: 10.1073/pnas.2110828118.
8
DisCovER: distance- and orientation-based covariational threading for weakly homologous proteins.DisCovER:基于距离和方向的弱同源蛋白质共变线程。
Proteins. 2022 Feb;90(2):579-588. doi: 10.1002/prot.26254. Epub 2021 Oct 11.
9
Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions.利用低精度接触图预测改进基于片段的从头蛋白质结构组装。
Nat Commun. 2021 Aug 18;12(1):5011. doi: 10.1038/s41467-021-25316-w.
10
Protein inter-residue contact and distance prediction by coupling complementary coevolution features with deep residual networks in CASP14.通过在 CASP14 中结合互补共进化特征和深度残差网络来预测蛋白质残基间的接触和距离。
Proteins. 2021 Dec;89(12):1911-1921. doi: 10.1002/prot.26211. Epub 2021 Aug 19.