• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质温度依赖性结构集合的深度生成建模

Deep generative modeling of temperature-dependent structural ensembles of proteins.

作者信息

Janson Giacomo, Jussupow Alexander, Feig Michael

机构信息

Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA.

出版信息

bioRxiv. 2025 Mar 13:2025.03.09.642148. doi: 10.1101/2025.03.09.642148.

DOI:10.1101/2025.03.09.642148
PMID:40161645
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11952339/
Abstract

Deep learning has revolutionized protein structure prediction, but capturing conformational ensembles and structural variability remains an open challenge. While molecular dynamics (MD) is the foundation method for simulating biomolecular dynamics, it is computationally expensive. Recently, deep learning models trained on MD have made progress in generating structural ensembles at reduced cost. However, they remain limited in modeling atomistic details and, crucially, incorporating the effect of environmental factors. Here, we present aSAM (atomistic structural autoencoder model), a latent diffusion model trained on MD to generate heavy atom protein ensembles. Unlike most methods, aSAM models atoms in a latent space, greatly facilitating accurate sampling of side chain and backbone torsion angle distributions. Additionally, we extended aSAM into the first reported transferable generator conditioned on temperature, named aSAMt. Trained on the large and open mdCATH dataset, aSAMt captures temperature-dependent ensemble properties and demonstrates generalization beyond training temperatures. By comparing aSAMt ensembles to long MD simulations of fast folding proteins, we find that high-temperature training enhances the ability of deep generators to explore energy landscapes. Finally, we also show that our MD-based aSAMt can already capture experimentally observed thermal behavior of proteins. Our work is a step towards generalizable ensemble generation to complement physics-based approaches.

摘要

深度学习彻底改变了蛋白质结构预测,但捕捉构象集合和结构变异性仍然是一个悬而未决的挑战。虽然分子动力学(MD)是模拟生物分子动力学的基础方法,但其计算成本很高。最近,基于MD训练的深度学习模型在以降低的成本生成结构集合方面取得了进展。然而,它们在对原子细节建模以及关键地纳入环境因素的影响方面仍然存在局限性。在这里,我们提出了aSAM(原子结构自动编码器模型),这是一种基于MD训练的潜在扩散模型,用于生成重原子蛋白质集合。与大多数方法不同,aSAM在潜在空间中对原子进行建模,极大地促进了侧链和主链扭转角分布的准确采样。此外,我们将aSAM扩展为第一个报道的以温度为条件的可转移生成器,名为aSAMt。在大型开放的mdCATH数据集上进行训练,aSAMt捕捉了温度依赖性集合特性,并展示了超越训练温度的泛化能力。通过将aSAMt集合与快速折叠蛋白质的长时间MD模拟进行比较,我们发现高温训练增强了深度生成器探索能量景观的能力。最后,我们还表明,我们基于MD的aSAMt已经可以捕捉实验观察到的蛋白质热行为。我们的工作是朝着可泛化的集合生成迈出的一步,以补充基于物理的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/8ed01e59b398/nihpp-2025.03.09.642148v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/f62776725f86/nihpp-2025.03.09.642148v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/5bf8206f5df9/nihpp-2025.03.09.642148v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/7eb8f7a6e3f4/nihpp-2025.03.09.642148v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/24a7500af6d8/nihpp-2025.03.09.642148v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/c65aeb1e06a3/nihpp-2025.03.09.642148v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/9776fcaf5505/nihpp-2025.03.09.642148v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/8ed01e59b398/nihpp-2025.03.09.642148v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/f62776725f86/nihpp-2025.03.09.642148v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/5bf8206f5df9/nihpp-2025.03.09.642148v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/7eb8f7a6e3f4/nihpp-2025.03.09.642148v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/24a7500af6d8/nihpp-2025.03.09.642148v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/c65aeb1e06a3/nihpp-2025.03.09.642148v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/9776fcaf5505/nihpp-2025.03.09.642148v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1529/11952339/8ed01e59b398/nihpp-2025.03.09.642148v1-f0007.jpg

相似文献

1
Deep generative modeling of temperature-dependent structural ensembles of proteins.蛋白质温度依赖性结构集合的深度生成建模
bioRxiv. 2025 Mar 13:2025.03.09.642148. doi: 10.1101/2025.03.09.642148.
2
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Active body surface warming systems for preventing complications caused by inadvertent perioperative hypothermia in adults.用于预防成人围手术期意外低温引起并发症的主动体表升温系统。
Cochrane Database Syst Rev. 2016 Apr 21;4(4):CD009016. doi: 10.1002/14651858.CD009016.pub2.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Predicting Affinity Through Homology (PATH): Interpretable Binding Affinity Prediction with Persistent Homology.通过同源性预测亲和力(PATH):基于持久同源性的可解释结合亲和力预测
bioRxiv. 2024 Oct 21:2023.11.16.567384. doi: 10.1101/2023.11.16.567384.
7
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
8
Physical exercise for people with Parkinson's disease: a systematic review and network meta-analysis.帕金森病患者的身体锻炼:系统评价和网络荟萃分析。
Cochrane Database Syst Rev. 2023 Jan 5;1(1):CD013856. doi: 10.1002/14651858.CD013856.pub2.
9
Macrolide antibiotics (including azithromycin) for cystic fibrosis.大环内酯类抗生素(包括阿奇霉素)治疗囊性纤维化。
Cochrane Database Syst Rev. 2024 Feb 27;2(2):CD002203. doi: 10.1002/14651858.CD002203.pub5.
10
Short-Term Memory Impairment短期记忆障碍

本文引用的文献

1
Scalable emulation of protein equilibrium ensembles with generative deep learning.利用生成式深度学习对蛋白质平衡系综进行可扩展模拟。
Science. 2025 Jul 10:eadv9817. doi: 10.1126/science.adv9817.
2
P2DFlow: A Protein Ensemble Generative Model with SE(3) Flow Matching.P2D流:一种具有SE(3)流匹配的蛋白质集合生成模型。
J Chem Theory Comput. 2025 Mar 25;21(6):3288-3296. doi: 10.1021/acs.jctc.4c01620. Epub 2025 Mar 10.
3
Modeling Boltzmann-weighted structural ensembles of proteins using artificial intelligence-based methods.使用基于人工智能的方法对蛋白质的玻尔兹曼加权结构集合进行建模。
Curr Opin Struct Biol. 2025 Apr;91:103000. doi: 10.1016/j.sbi.2025.103000. Epub 2025 Feb 8.
4
COCOMO2: A Coarse-Grained Model for Interacting Folded and Disordered Proteins.COCOMO2:一种用于相互作用的折叠和无序蛋白质的粗粒度模型。
J Chem Theory Comput. 2025 Feb 25;21(4):2095-2107. doi: 10.1021/acs.jctc.4c01460. Epub 2025 Feb 5.
5
Simulating 500 million years of evolution with a language model.用语言模型模拟5亿年的进化历程。
Science. 2025 Feb 21;387(6736):850-858. doi: 10.1126/science.ads0018. Epub 2025 Jan 16.
6
mdCATH: A Large-Scale MD Dataset for Data-Driven Computational Biophysics.mdCATH:用于数据驱动计算生物物理学的大规模 MD 数据集。
Sci Data. 2024 Nov 28;11(1):1299. doi: 10.1038/s41597-024-04140-z.
7
Structure prediction of alternative protein conformations.蛋白质变构构象预测。
Nat Commun. 2024 Aug 26;15(1):7328. doi: 10.1038/s41467-024-51507-2.
8
TemBERTure: advancing protein thermostability prediction with deep learning and attention mechanisms.TemBERTure:利用深度学习和注意力机制推进蛋白质热稳定性预测
Bioinform Adv. 2024 Jul 13;4(1):vbae103. doi: 10.1093/bioadv/vbae103. eCollection 2024.
9
Transferable deep generative modeling of intrinsically disordered protein conformations.可转移的深度生成模型对固有无序蛋白质构象的建模。
PLoS Comput Biol. 2024 May 23;20(5):e1012144. doi: 10.1371/journal.pcbi.1012144. eCollection 2024 May.
10
OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization.OpenFold:重新训练 AlphaFold2 可深入了解其学习机制和泛化能力。
Nat Methods. 2024 Aug;21(8):1514-1524. doi: 10.1038/s41592-024-02272-z. Epub 2024 May 14.