基于深度学习的单域和多域蛋白质结构预测与D-I-TASSER

Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER.

作者信息

Zheng Wei, Wuyun Qiqige, Li Yang, Liu Quancheng, Zhou Xiaogen, Peng Chunxiang, Zhu Yiheng, Freddolino Lydia, Zhang Yang

机构信息

NITFID, School of Statistics and Data Science, AAIS, LPMC and KLMDASR, Nankai University, Tianjin, China.

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.

出版信息

Nat Biotechnol. 2025 May 23. doi: 10.1038/s41587-025-02654-4.

DOI:10.1038/s41587-025-02654-4

PMID:40410405

Abstract

The dominant success of deep learning techniques on protein structure prediction has challenged the necessity and usefulness of traditional force field-based folding simulations. We proposed a hybrid approach, deep-learning-based iterative threading assembly refinement (D-I-TASSER), which constructs atomic-level protein structural models by integrating multisource deep learning potentials with iterative threading fragment assembly simulations. D-I-TASSER introduces a domain splitting and assembly protocol for the automated modeling of large multidomain protein structures. Benchmark tests and the most recent critical assessment of protein structure prediction, 15 experiments demonstrate that D-I-TASSER outperforms AlphaFold2 and AlphaFold3 on both single-domain and multidomain proteins. Large-scale folding experiments further show that D-I-TASSER could fold 81% of protein domains and 73% of full-chain sequences in the human proteome with results highly complementary to recently released models by AlphaFold2. These results highlight a new avenue to integrate deep learning with classical physics-based folding simulations for high-accuracy protein structure and function predictions that are usable in genome-wide applications.

摘要

深度学习技术在蛋白质结构预测方面的显著成功对传统基于力场的折叠模拟的必要性和实用性提出了挑战。我们提出了一种混合方法，即基于深度学习的迭代穿线装配优化（D-I-TASSER），它通过将多源深度学习势与迭代穿线片段装配模拟相结合来构建原子级蛋白质结构模型。D-I-TASSER引入了一种结构域拆分和装配协议，用于大型多结构域蛋白质结构的自动建模。基准测试以及蛋白质结构预测的最新关键评估（15项实验）表明，D-I-TASSER在单结构域和多结构域蛋白质上均优于AlphaFold2和AlphaFold3。大规模折叠实验进一步表明，D-I-TASSER能够折叠人类蛋白质组中81%的蛋白质结构域和73%的全链序列，其结果与AlphaFold2最近发布的模型高度互补。这些结果突出了一条将深度学习与基于经典物理学的折叠模拟相结合的新途径，用于在全基因组应用中可用的高精度蛋白质结构和功能预测。

相似文献

Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER.基于深度学习的单域和多域蛋白质结构预测与D-I-TASSER

Nat Biotechnol. 2025 May 23. doi: 10.1038/s41587-025-02654-4.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Measures implemented in the school setting to contain the COVID-19 pandemic.学校为控制 COVID-19 疫情而采取的措施。

Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Consensus structure prediction of A. thaliana's MCTP4 structure using prediction tools and coarse grained simulations of transmembrane domain dynamics.使用预测工具和跨膜结构域动力学的粗粒度模拟对拟南芥MCTP4结构进行共识结构预测。

PLoS One. 2025 Jul 15;20(7):e0326993. doi: 10.1371/journal.pone.0326993. eCollection 2025.

123I-MIBG scintigraphy and 18F-FDG-PET imaging for diagnosing neuroblastoma.用于诊断神经母细胞瘤的123I-间碘苄胍闪烁扫描术和18F-氟代脱氧葡萄糖正电子发射断层显像

Cochrane Database Syst Rev. 2015 Sep 29;2015(9):CD009263. doi: 10.1002/14651858.CD009263.pub2.

Psychological interventions for adults who have sexually offended or are at risk of offending.针对有性犯罪行为或有性犯罪风险的成年人的心理干预措施。

Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD007507. doi: 10.1002/14651858.CD007507.pub2.

Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。

Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。

Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

Assessment of Protein Complex Predictions in CASP16: Are we making progress?对蛋白质复合物预测在蛋白质结构预测关键评估第16轮中的评估：我们有进展吗？

bioRxiv. 2025 May 30:2025.05.29.656875. doi: 10.1101/2025.05.29.656875.

引用本文的文献

Controlling Nanonet Morphology via Residue-Specific Modulation of β-Hairpin Peptide for Enhanced Bacterial Trapping.通过对β-发夹肽进行残基特异性调控来控制纳米网形态以增强细菌捕获

Small. 2025 Sep;21(35):e2505823. doi: 10.1002/smll.202505823. Epub 2025 Jul 7.

本文引用的文献

Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。

Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.

Improving deep learning protein monomer and complex structure prediction using DeepMSA2 with huge metagenomics data.利用 DeepMSA2 和海量宏基因组学数据改进深度学习蛋白质单体和复合物结构预测。

Nat Methods. 2024 Feb;21(2):279-289. doi: 10.1038/s41592-023-02130-4. Epub 2024 Jan 2.

Critical assessment of methods of protein structure prediction (CASP)-Round XV.蛋白质结构预测方法的关键评估（CASP）-第十五轮。

Proteins. 2023 Dec;91(12):1539-1549. doi: 10.1002/prot.26617. Epub 2023 Nov 2.

The impact of AI-based modeling on the accuracy of protein assembly prediction: Insights from CASP15.基于人工智能的建模对蛋白质组装预测准确性的影响：来自 CASP15 的见解。

Proteins. 2023 Dec;91(12):1636-1657. doi: 10.1002/prot.26598. Epub 2023 Oct 20.

Improved multimer prediction using massive sampling with AlphaFold in CASP15.使用 AlphaFold 在 CASP15 中进行大规模采样以提高多聚体预测。

Proteins. 2023 Dec;91(12):1734-1746. doi: 10.1002/prot.26562. Epub 2023 Aug 7.

To split or not to split: CASP15 targets and their processing into tertiary structure evaluation units.要分割还是不分割：CASP15 目标及其处理为三级结构评估单元。

Proteins. 2023 Dec;91(12):1558-1570. doi: 10.1002/prot.26533. Epub 2023 May 31.

Fast and accurate Ab Initio Protein structure prediction using deep learning potentials.使用深度学习势能进行快速准确的从头开始蛋白质结构预测。

PLoS Comput Biol. 2022 Sep 16;18(9):e1010539. doi: 10.1371/journal.pcbi.1010539. eCollection 2022 Sep.

Deep learning geometrical potential for high-accuracy protein structure prediction.用于高精度蛋白质结构预测的深度学习几何势

iScience. 2022 May 18;25(6):104425. doi: 10.1016/j.isci.2022.104425. eCollection 2022 Jun 17.

ColabFold: making protein folding accessible to all.ColabFold：让蛋白质折叠变得人人可用。

Nat Methods. 2022 Jun;19(6):679-682. doi: 10.1038/s41592-022-01488-1. Epub 2022 May 30.

DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction.DEMO2：通过将类似的模板比对与深度学习的域间约束预测相结合，组装多结构域蛋白质结构。

Nucleic Acids Res. 2022 Jul 5;50(W1):W235-W245. doi: 10.1093/nar/gkac340.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于深度学习的单域和多域蛋白质结构预测与D-I-TASSER

Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献