• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

原子模拟中的无监督机器学习:预测与理解之间

Unsupervised machine learning in atomistic simulations, between predictions and understanding.

作者信息

Ceriotti Michele

机构信息

Laboratory of Computational Science and Modeling, Institute des Materiaux, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.

出版信息

J Chem Phys. 2019 Apr 21;150(15):150901. doi: 10.1063/1.5091842.

DOI:10.1063/1.5091842
PMID:31005087
Abstract

Automated analyses of the outcome of a simulation have been an important part of atomistic modeling since the early days, addressing the need of linking the behavior of individual atoms and the collective properties that are usually the final quantity of interest. Methods such as clustering and dimensionality reduction have been used to provide a simplified, coarse-grained representation of the structure and dynamics of complex systems from proteins to nanoparticles. In recent years, the rise of machine learning has led to an even more widespread use of these algorithms in atomistic modeling and to consider different classification and inference techniques as part of a coherent toolbox of data-driven approaches. This perspective briefly reviews some of the unsupervised machine-learning methods-that are geared toward classification and coarse-graining of molecular simulations-seen in relation to the fundamental mathematical concepts that underlie all machine-learning techniques. It discusses the importance of using concise yet complete representations of atomic structures as the starting point of the analyses and highlights the risk of introducing preconceived biases when using machine learning to rationalize and understand structure-property relations. Supervised machine-learning techniques that explicitly attempt to predict the properties of a material given its structure are less susceptible to such biases. Current developments in the field suggest that using these two classes of approaches side-by-side and in a fully integrated mode, while keeping in mind the relations between the data analysis framework and the fundamental physical principles, will be key to realizing the full potential of machine learning to help understand the behavior of complex molecules and materials.

摘要

自早期以来,对模拟结果的自动化分析一直是原子模型的重要组成部分,满足了将单个原子的行为与通常作为最终关注量的集体性质联系起来的需求。诸如聚类和降维等方法已被用于提供从蛋白质到纳米颗粒等复杂系统的结构和动力学的简化、粗粒度表示。近年来,机器学习的兴起导致这些算法在原子模型中得到更广泛的应用,并将不同的分类和推理技术视为数据驱动方法的连贯工具箱的一部分。本观点简要回顾了一些无监督机器学习方法,这些方法旨在对分子模拟进行分类和粗粒度处理,并结合了所有机器学习技术所基于的基本数学概念进行阐述。它讨论了使用简洁而完整的原子结构表示作为分析起点的重要性,并强调了在使用机器学习来合理化和理解结构-性质关系时引入先入为主偏差的风险。明确尝试根据材料结构预测其性质的监督机器学习技术较不易受到此类偏差的影响。该领域的当前发展表明,将这两类方法并行且以完全集成的方式使用,同时牢记数据分析框架与基本物理原理之间的关系,将是充分发挥机器学习潜力以帮助理解复杂分子和材料行为的关键。

相似文献

1
Unsupervised machine learning in atomistic simulations, between predictions and understanding.原子模拟中的无监督机器学习:预测与理解之间
J Chem Phys. 2019 Apr 21;150(15):150901. doi: 10.1063/1.5091842.
2
Addressing uncertainty in atomistic machine learning.解决原子机器学习中的不确定性。
Phys Chem Chem Phys. 2017 May 10;19(18):10978-10985. doi: 10.1039/c7cp00375g.
3
Proceedings of the Second Workshop on Theory meets Industry (Erwin-Schrödinger-Institute (ESI), Vienna, Austria, 12-14 June 2007).第二届理论与产业研讨会会议录(2007年6月12日至14日,奥地利维也纳埃尔温·薛定谔研究所)
J Phys Condens Matter. 2008 Feb 13;20(6):060301. doi: 10.1088/0953-8984/20/06/060301. Epub 2008 Jan 24.
4
Expanding density-correlation machine learning representations for anisotropic coarse-grained particles.为各向异性粗粒化粒子扩展密度关联机器学习表示法。
J Chem Phys. 2024 Aug 21;161(7). doi: 10.1063/5.0210910.
5
Atom-density representations for machine learning.用于机器学习的原子密度表示。
J Chem Phys. 2019 Apr 21;150(15):154110. doi: 10.1063/1.5090481.
6
Creating Machine Learning-Driven Material Recipes Based on Crystal Structure.基于晶体结构创建机器学习驱动的材料配方。
J Phys Chem Lett. 2019 Jan 17;10(2):283-288. doi: 10.1021/acs.jpclett.8b03527. Epub 2019 Jan 8.
7
Unlocking the Potential of Clustering and Classification Approaches: Navigating Supervised and Unsupervised Chemical Similarity.解锁聚类和分类方法的潜力:探索有监督和无监督的化学相似性。
Environ Health Perspect. 2024 Aug;132(8):85002. doi: 10.1289/EHP14001. Epub 2024 Aug 6.
8
Introduction to machine learning.机器学习导论
Methods Mol Biol. 2014;1107:105-28. doi: 10.1007/978-1-62703-748-8_7.
9
Integrating Machine Learning in the Coarse-Grained Molecular Simulation of Polymers.机器学习在聚合物粗粒度分子模拟中的集成
J Phys Chem B. 2023 Mar 23;127(11):2302-2322. doi: 10.1021/acs.jpcb.2c06354. Epub 2023 Mar 8.
10
Machine learnt bond order potential to model metal-organic (Co-C) heterostructures.机器学习键级势模型用于模拟金属有机(Co-C)杂化结构。
Nanoscale. 2017 Nov 30;9(46):18229-18239. doi: 10.1039/c7nr06038f.

引用本文的文献

1
NeuralTSNE: A Python Package for the Dimensionality Reduction of Molecular Dynamics Data Using Neural Networks.NeuralTSNE:一个使用神经网络对分子动力学数据进行降维的Python软件包。
J Chem Inf Model. 2025 Jul 28;65(14):7347-7351. doi: 10.1021/acs.jcim.5c01107. Epub 2025 Jul 14.
2
Automating the Analysis of Substrate Reactivity through Environment Interaction Mapping.通过环境相互作用映射实现底物反应性分析的自动化
J Chem Inf Model. 2025 Jun 9;65(11):5395-5410. doi: 10.1021/acs.jcim.5c00474. Epub 2025 May 28.
3
Understanding Defects in Amorphous Silicon with Million-Atom Simulations and Machine Learning.
通过百万原子模拟和机器学习理解非晶硅中的缺陷
Angew Chem Int Ed Engl. 2024 May 27;63(22):e202403842. doi: 10.1002/anie.202403842. Epub 2024 Apr 18.
4
Structures, Electronic, and Magnetic Properties of CoK ( = 2-12) Clusters: A Particle Swarm Optimization Prediction Jointed with First-Principles Investigation.CoK(=2-12)团簇的结构、电子和磁性性质:结合第一性原理研究的粒子群优化预测
Nanomaterials (Basel). 2023 Jul 25;13(15):2155. doi: 10.3390/nano13152155.
5
Spectral Map: Embedding Slow Kinetics in Collective Variables.谱图:在集体变量中嵌入慢动力学。
J Phys Chem Lett. 2023 Jun 8;14(22):5216-5220. doi: 10.1021/acs.jpclett.3c01101. Epub 2023 Jun 1.
6
Visualizing the Residue Interaction Landscape of Proteins by Temporal Network Embedding.通过时间网络嵌入可视化蛋白质的残基相互作用景观。
J Chem Theory Comput. 2023 May 23;19(10):2985-2995. doi: 10.1021/acs.jctc.2c01228. Epub 2023 Apr 25.
7
Algorithmic Graph Theory, Reinforcement Learning and Game Theory in MD Simulations: From 3D Structures to Topological 2D-Molecular Graphs (2D-MolGraphs) and Vice Versa.MD 模拟中的算法图论、强化学习和博弈论:从 3D 结构到拓扑 2D-分子图(2D-MolGraphs),反之亦然。
Molecules. 2023 Mar 23;28(7):2892. doi: 10.3390/molecules28072892.
8
Computational Design of Peptides for Biomaterials Applications.用于生物材料应用的肽的计算设计。
ACS Appl Bio Mater. 2024 Feb 19;7(2):617-625. doi: 10.1021/acsabm.2c01023. Epub 2023 Mar 27.
9
Neural network potentials for chemistry: concepts, applications and prospects.化学中的神经网络势:概念、应用与展望。
Digit Discov. 2022 Dec 21;2(1):28-58. doi: 10.1039/d2dd00102k. eCollection 2023 Feb 13.
10
Driving and characterizing nucleation of urea and glycine polymorphs in water.在水中驱动和描述尿素和甘氨酸多晶型物的成核。
Proc Natl Acad Sci U S A. 2023 Feb 14;120(7):e2216099120. doi: 10.1073/pnas.2216099120. Epub 2023 Feb 9.