通过机器学习从构象集合中获得的分子见解

Molecular Insights from Conformational Ensembles via Machine Learning.

作者信息

Fleetwood Oliver, Kasimova Marina A, Westerlund Annie M, Delemotte Lucie

机构信息

Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, Solna, Sweden.

出版信息

Biophys J. 2020 Feb 4;118(3):765-780. doi: 10.1016/j.bpj.2019.12.016. Epub 2019 Dec 21.

DOI:10.1016/j.bpj.2019.12.016

PMID:31952811

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7002924/

Abstract

Biomolecular simulations are intrinsically high dimensional and generate noisy data sets of ever-increasing size. Extracting important features from the data is crucial for understanding the biophysical properties of molecular processes, but remains a big challenge. Machine learning (ML) provides powerful dimensionality reduction tools. However, such methods are often criticized as resembling black boxes with limited human-interpretable insight. We use methods from supervised and unsupervised ML to efficiently create interpretable maps of important features from molecular simulations. We benchmark the performance of several methods, including neural networks, random forests, and principal component analysis, using a toy model with properties reminiscent of macromolecular behavior. We then analyze three diverse biological processes: conformational changes within the soluble protein calmodulin, ligand binding to a G protein-coupled receptor, and activation of an ion channel voltage-sensor domain, unraveling features critical for signal transduction, ligand binding, and voltage sensing. This work demonstrates the usefulness of ML in understanding biomolecular states and demystifying complex simulations.

摘要

生物分子模拟本质上是高维的，会生成规模不断增大的噪声数据集。从数据中提取重要特征对于理解分子过程的生物物理特性至关重要，但仍然是一项巨大挑战。机器学习（ML）提供了强大的降维工具。然而，此类方法常被批评为类似于黑箱，人类可解释的洞察力有限。我们使用监督式和无监督式机器学习方法，从分子模拟中高效创建重要特征的可解释图谱。我们使用一个具有类似大分子行为特性的玩具模型，对包括神经网络、随机森林和主成分分析在内的几种方法的性能进行基准测试。然后，我们分析三个不同的生物过程：可溶性蛋白钙调蛋白的构象变化、配体与G蛋白偶联受体的结合以及离子通道电压传感器结构域的激活，揭示对信号转导、配体结合和电压传感至关重要的特征。这项工作证明了机器学习在理解生物分子状态和揭开复杂模拟神秘面纱方面的有用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a52e/7002924/875728c4fc25/gr1.jpg

相似文献

Molecular Insights from Conformational Ensembles via Machine Learning.通过机器学习从构象集合中获得的分子见解

Biophys J. 2020 Feb 4;118(3):765-780. doi: 10.1016/j.bpj.2019.12.016. Epub 2019 Dec 21.

Machine-learning-based methods to generate conformational ensembles of disordered proteins.基于机器学习的方法生成无序蛋白质的构象集合。

Biophys J. 2024 Jan 2;123(1):101-113. doi: 10.1016/j.bpj.2023.12.001. Epub 2023 Dec 5.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象：化学与物理邂逅生物学（瑞士阿斯科纳，2012年6月10日至14日）

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

Engineering Aspects of Olfaction嗅觉的工程学方面

Resolving Protein Conformational Plasticity and Substrate Binding via Machine Learning.通过机器学习解析蛋白质构象可塑性与底物结合

J Chem Theory Comput. 2023 May 9;19(9):2644-2657. doi: 10.1021/acs.jctc.2c00932. Epub 2023 Apr 17.

Machine Learning Driven Analysis of Large Scale Simulations Reveals Conformational Characteristics of Ubiquitin Chains.机器学习驱动的大规模模拟分析揭示泛素链的构象特征。

J Chem Theory Comput. 2020 May 12;16(5):3205-3220. doi: 10.1021/acs.jctc.0c00045. Epub 2020 Apr 7.

Predicting protein conformational changes for unbound and homology docking: learning from intrinsic and induced flexibility.预测未结合和同源对接的蛋白质构象变化：从内在和诱导柔性中学习。

Proteins. 2017 Mar;85(3):544-556. doi: 10.1002/prot.25212. Epub 2016 Dec 5.

WASCO: A Wasserstein-based Statistical Tool to Compare Conformational Ensembles of Intrinsically Disordered Proteins.WASCO：一种基于 Wasserstein 的统计工具，用于比较天然无序蛋白质的构象集合。

J Mol Biol. 2023 Jul 15;435(14):168053. doi: 10.1016/j.jmb.2023.168053. Epub 2023 Mar 18.

Exploration of Black Boxes of Supervised Machine Learning Models: A Demonstration on Development of Predictive Heart Risk Score.探索监督机器学习模型的“黑箱”：以开发预测心脏风险评分为例。

Comput Intell Neurosci. 2022 May 12;2022:5475313. doi: 10.1155/2022/5475313. eCollection 2022.

Unraveling dynamic protein structures by two-dimensional infrared spectra with a pretrained machine learning model.利用预先训练的机器学习模型通过二维红外光谱揭示动态蛋白质结构。

Proc Natl Acad Sci U S A. 2024 Jul 2;121(27):e2409257121. doi: 10.1073/pnas.2409257121. Epub 2024 Jun 25.

引用本文的文献

Special Issue: "Advanced Research on Molecular Modeling of Protein Structure and Functions".特刊：“蛋白质结构与功能的分子建模高级研究”。

Int J Mol Sci. 2025 Aug 16;26(16):7916. doi: 10.3390/ijms26167916.

Using Machine Learning to Analyze Molecular Dynamics Simulations of Biomolecules.利用机器学习分析生物分子的分子动力学模拟

J Phys Chem B. 2025 Jun 5;129(22):5375-5385. doi: 10.1021/acs.jpcb.4c08824. Epub 2025 May 27.

Machine Learning of Molecular Dynamics Simulations Provides Insights into the Modulation of Viral Capsid Assembly.分子动力学模拟的机器学习为病毒衣壳组装的调控提供了见解。

J Chem Inf Model. 2025 May 26;65(10):4844-4853. doi: 10.1021/acs.jcim.5c00274. Epub 2025 May 8.

A beginner's approach to deep learning applied to VS and MD techniques.深度学习应用于VS和MD技术的初学者方法。

J Cheminform. 2025 Apr 8;17(1):47. doi: 10.1186/s13321-025-00985-7.

Molecular dynamics and machine learning stratify motion-dependent activity profiles of S-layer destabilizing nanobodies.分子动力学和机器学习对S层去稳定纳米抗体的运动依赖性活性谱进行分层。

PNAS Nexus. 2024 Nov 26;3(12):pgae538. doi: 10.1093/pnasnexus/pgae538. eCollection 2024 Dec.

Molecular Dynamics Reveals Altered Interactions between Belzutifan and HIF-2 with Natural Variant G323E or Proximal Phosphorylation at T324.分子动力学揭示了belzutifan与具有天然变体G323E或T324近端磷酸化的HIF-2之间相互作用的改变。

ACS Omega. 2024 Aug 26;9(36):37843-37855. doi: 10.1021/acsomega.4c03777. eCollection 2024 Sep 10.

Thermodynamics-inspired explanations of artificial intelligence.热力学启发的人工智能解释。

Nat Commun. 2024 Sep 9;15(1):7859. doi: 10.1038/s41467-024-51970-x.

Mechanistic insights into P-glycoprotein ligand transport and inhibition revealed by enhanced molecular dynamics simulations.通过增强分子动力学模拟揭示的P-糖蛋白配体转运与抑制的机制性见解。

Comput Struct Biotechnol J. 2024 Jun 13;23:2548-2564. doi: 10.1016/j.csbj.2024.06.010. eCollection 2024 Dec.

Binding to nucleosome poises human SIRT6 for histone H3 deacetylation.与核小体结合使人类 SIRT6 为组蛋白 H3 去乙酰化做好准备。

Elife. 2024 Feb 28;12:RP87989. doi: 10.7554/eLife.87989.

Toward physics-based precision medicine: Exploiting protein dynamics to design new therapeutics and interpret variants.迈向基于物理学的精准医学：利用蛋白质动力学设计新的治疗方法和解释变体。

Protein Sci. 2024 Mar;33(3):e4902. doi: 10.1002/pro.4902.

本文引用的文献

Energy Landscapes Reveal Agonist Control of G Protein-Coupled Receptor Activation via Microswitches.能量景观揭示激动剂通过微转换控制 G 蛋白偶联受体的激活。

Biochemistry. 2020 Feb 25;59(7):880-891. doi: 10.1021/acs.biochem.9b00842. Epub 2020 Feb 7.

Past-future information bottleneck for sampling molecular reaction coordinate simultaneously with thermodynamics and kinetics.过去-未来信息瓶颈用于同时采样分子反应坐标的热力学和动力学。

Nat Commun. 2019 Aug 8;10(1):3573. doi: 10.1038/s41467-019-11405-4.

Nonlinear discovery of slow molecular modes using state-free reversible VAMPnets.使用无状态可逆VAMPnets进行慢分子模式的非线性发现。

J Chem Phys. 2019 Jun 7;150(21):214114. doi: 10.1063/1.5092521.

Applications of deep learning for the analysis of medical data.深度学习在医学数据分析中的应用。

Arch Pharm Res. 2019 Jun;42(6):492-504. doi: 10.1007/s12272-019-01162-9. Epub 2019 May 28.

Anncolvar: Approximation of Complex Collective Variables by Artificial Neural Networks for Analysis and Biasing of Molecular Simulations.安科尔瓦尔：通过人工神经网络逼近复杂集体变量以进行分子模拟分析和偏差计算

Front Mol Biosci. 2019 Apr 18;6:25. doi: 10.3389/fmolb.2019.00025. eCollection 2019.

Coupling Molecular Dynamics and Deep Learning to Mine Protein Conformational Space.耦合分子动力学和深度学习挖掘蛋白质构象空间。

Structure. 2019 Jun 4;27(6):1034-1040.e3. doi: 10.1016/j.str.2019.03.018. Epub 2019 Apr 25.

Relative Principal Components Analysis: Application to Analyzing Biomolecular Conformational Changes.相对主成分分析：在分析生物分子构象变化中的应用。

J Chem Theory Comput. 2019 Apr 9;15(4):2166-2178. doi: 10.1021/acs.jctc.8b01074. Epub 2019 Mar 6.

Deep Learning: Current and Emerging Applications in Medicine and Technology.深度学习：医学和技术中的当前和新兴应用。

IEEE J Biomed Health Inform. 2019 May;23(3):906-920. doi: 10.1109/JBHI.2019.2894713. Epub 2019 Jan 23.

An overview of deep learning in medical imaging focusing on MRI.深度学习在医学影像中的概述，重点是 MRI。

Z Med Phys. 2019 May;29(2):102-127. doi: 10.1016/j.zemedi.2018.11.002. Epub 2018 Dec 13.

Toward Achieving Efficient and Accurate Ligand-Protein Unbinding with Deep Learning and Molecular Dynamics through RAVE.通过 RAVE 实现基于深度学习和分子动力学的高效、准确配体-蛋白解吸。

J Chem Theory Comput. 2019 Jan 8;15(1):708-719. doi: 10.1021/acs.jctc.8b00869. Epub 2018 Dec 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过机器学习从构象集合中获得的分子见解

Molecular Insights from Conformational Ensembles via Machine Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献