一种用于大型分子复合物精确粗粒化的自下而上与数据驱动相结合的机器学习方法。

A Hybrid Bottom-Up and Data-Driven Machine Learning Approach for Accurate Coarse-Graining of Large Molecular Complexes.

作者信息

Liebl Korbinian, Voth Gregory A

机构信息

Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, United States.

出版信息

J Chem Theory Comput. 2025 May 13;21(9):4846-4854. doi: 10.1021/acs.jctc.5c00063. Epub 2025 Apr 16.

DOI:10.1021/acs.jctc.5c00063

PMID:40241350

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12268871/

Abstract

Bottom-up coarse-graining refers to the development of low-resolution simulation models that are thermodynamically consistent with certain distributions from fully atomistic simulations. Force-matching and relative entropy minimization represent two major, frequently applied methods that allow to develop such bottom-up models. Nevertheless, atomistic simulations can often provide only limited sampling of the phase space. For bottom-up coarse-graining, these limitations may result in overfitting of the atomistic reference data, especially for large molecular complexes, where the learning may be agnostic of the actual affinities between binding partners. As a solution to this problem, we devise a data-driven machine learning hybrid coarse-graining concept that represents a regularized version of the relative entropy minimization approach. We demonstrate that this new approach allows one to develop coarse-grained models for molecular complexes that reproduce the targeted binding affinity but also describe the underlying complex structure accurately. The trained models therefore show diverse behavior as they can undergo frequent unbinding and binding events and are also transferable for simulating entire protein lattices, e.g., for a virus capsid.

摘要

自底向上的粗粒化是指开发与全原子模拟中的某些分布在热力学上一致的低分辨率模拟模型。力匹配和相对熵最小化是两种主要的、经常应用的方法，可用于开发此类自底向上的模型。然而，原子模拟通常只能对相空间进行有限的采样。对于自底向上的粗粒化，这些限制可能导致原子参考数据的过度拟合，特别是对于大分子复合物，其中学习可能无法识别结合伙伴之间的实际亲和力。作为这个问题的解决方案，我们设计了一种数据驱动的机器学习混合粗粒化概念，它代表了相对熵最小化方法的正则化版本。我们证明，这种新方法允许为分子复合物开发粗粒化模型，该模型不仅能重现目标结合亲和力，还能准确描述潜在的复合物结构。因此，经过训练的模型表现出多样的行为，因为它们可以频繁地经历解离和结合事件，并且还可转移用于模拟整个蛋白质晶格，例如病毒衣壳。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7761/12409876/7964cf3b4bc1/ct5c00063_0001.jpg

相似文献

A Hybrid Bottom-Up and Data-Driven Machine Learning Approach for Accurate Coarse-Graining of Large Molecular Complexes.

J Chem Theory Comput. 2025 May 13;21(9):4846-4854. doi: 10.1021/acs.jctc.5c00063. Epub 2025 Apr 16.

Prescription of Controlled Substances: Benefits and Risks

Short-Term Memory Impairment

Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.

Plug-and-play use of tree-based methods: consequences for clinical prediction modeling.

J Clin Epidemiol. 2025 Aug;184:111834. doi: 10.1016/j.jclinepi.2025.111834. Epub 2025 May 19.

QM/CG-MM: Systematic Embedding of Quantum Mechanical Systems in a Coarse-Grained Environment with Accurate Electrostatics.

J Phys Chem A. 2024 Jul 25;128(29):6061-6071. doi: 10.1021/acs.jpca.4c02906. Epub 2024 Jul 17.

Idiopathic (Genetic) Generalized Epilepsy

Sexual Harassment and Prevention Training

Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.

Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.

Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.

Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.

本文引用的文献

Enhancing the Assembly Properties of Bottom-Up Coarse-Grained Phospholipids.

J Chem Theory Comput. 2024 Nov 26;20(22):10235-10246. doi: 10.1021/acs.jctc.4c00905. Epub 2024 Nov 13.

Changing Your Martini Can Still Give You a Hangover.

J Chem Theory Comput. 2024 Oct 22;20(20):9190-9208. doi: 10.1021/acs.jctc.4c00868. Epub 2024 Oct 3.

Lipid organization by the Caveolin-1 complex.

Biophys J. 2024 Nov 5;123(21):3688-3697. doi: 10.1016/j.bpj.2024.09.018. Epub 2024 Sep 20.

The structure and physical properties of a packaged bacteriophage particle.

Nature. 2024 Mar;627(8005):905-914. doi: 10.1038/s41586-024-07150-4. Epub 2024 Mar 6.

HIV-1 capsid shape, orientation, and entropic elasticity regulate translocation into the nuclear pore complex.

Proc Natl Acad Sci U S A. 2024 Jan 23;121(4):e2313737121. doi: 10.1073/pnas.2313737121. Epub 2024 Jan 19.

OpenMM 8: Molecular Dynamics Simulation with Machine Learning Potentials.

J Phys Chem B. 2024 Jan 11;128(1):109-116. doi: 10.1021/acs.jpcb.3c06662. Epub 2023 Dec 28.

OpenMSCG: A Software Tool for Bottom-Up Coarse-Graining.

J Phys Chem B. 2023 Oct 12;127(40):8537-8550. doi: 10.1021/acs.jpcb.3c04473. Epub 2023 Oct 4.

Machine learning coarse-grained potentials of protein thermodynamics.

Nat Commun. 2023 Sep 15;14(1):5739. doi: 10.1038/s41467-023-41343-1.

Thermodynamics and kinetics of DNA and RNA dinucleotide hybridization to gaps and overhangs.

Biophys J. 2023 Aug 22;122(16):3323-3339. doi: 10.1016/j.bpj.2023.07.009. Epub 2023 Jul 19.

Perspective: Advances, Challenges, and Insight for Predictive Coarse-Grained Models.

J Phys Chem B. 2023 May 18;127(19):4174-4207. doi: 10.1021/acs.jpcb.2c08731. Epub 2023 May 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于大型分子复合物精确粗粒化的自下而上与数据驱动相结合的机器学习方法。

A Hybrid Bottom-Up and Data-Driven Machine Learning Approach for Accurate Coarse-Graining of Large Molecular Complexes.

作者信息

Liebl Korbinian, Voth Gregory A

机构信息

Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, United States.

出版信息

J Chem Theory Comput. 2025 May 13;21(9):4846-4854. doi: 10.1021/acs.jctc.5c00063. Epub 2025 Apr 16.

DOI:10.1021/acs.jctc.5c00063

PMID:40241350

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12268871/

Abstract

摘要

一种用于大型分子复合物精确粗粒化的自下而上与数据驱动相结合的机器学习方法。

A Hybrid Bottom-Up and Data-Driven Machine Learning Approach for Accurate Coarse-Graining of Large Molecular Complexes.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种用于大型分子复合物精确粗粒化的自下而上与数据驱动相结合的机器学习方法。

A Hybrid Bottom-Up and Data-Driven Machine Learning Approach for Accurate Coarse-Graining of Large Molecular Complexes.

作者信息

机构信息

出版信息

相似文献

本文引用的文献