• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

科学驱动的原子级机器学习。

Science-Driven Atomistic Machine Learning.

机构信息

Fritz-Haber-Institute of the Max-Planck-Society, Faradayweg 4-6, 14195, Berlin, Germany.

出版信息

Angew Chem Int Ed Engl. 2023 Jun 26;62(26):e202219170. doi: 10.1002/anie.202219170. Epub 2023 Apr 13.

DOI:10.1002/anie.202219170
PMID:36896758
Abstract

Machine learning (ML) algorithms are currently emerging as powerful tools in all areas of science. Conventionally, ML is understood as a fundamentally data-driven endeavour. Unfortunately, large well-curated databases are sparse in chemistry. In this contribution, I therefore review science-driven ML approaches which do not rely on "big data", focusing on the atomistic modelling of materials and molecules. In this context, the term science-driven refers to approaches that begin with a scientific question and then ask what training data and model design choices are appropriate. As key features of science-driven ML, the automated and purpose-driven collection of data and the use of chemical and physical priors to achieve high data-efficiency are discussed. Furthermore, the importance of appropriate model evaluation and error estimation is emphasized.

摘要

机器学习(ML)算法目前在各个科学领域崭露头角,成为强大的工具。传统上,ML 被理解为一种完全依赖数据的努力。不幸的是,化学领域的大型、精心策划的数据库却很稀疏。在这篇综述中,我因此回顾了不依赖“大数据”的基于科学的 ML 方法,重点关注材料和分子的原子建模。在这种情况下,“基于科学的”一词是指从科学问题开始,然后询问哪些训练数据和模型设计选择是合适的方法。作为基于科学的 ML 的关键特征,讨论了自动化和有目的的数据收集以及使用化学和物理先验知识来实现高效数据利用。此外,还强调了适当的模型评估和误差估计的重要性。

相似文献

1
Science-Driven Atomistic Machine Learning.科学驱动的原子级机器学习。
Angew Chem Int Ed Engl. 2023 Jun 26;62(26):e202219170. doi: 10.1002/anie.202219170. Epub 2023 Apr 13.
2
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
3
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
4
A Design-to-Device Pipeline for Data-Driven Materials Discovery.数据驱动的材料发现的设计到器件的流水线。
Acc Chem Res. 2020 Mar 17;53(3):599-610. doi: 10.1021/acs.accounts.9b00470. Epub 2020 Feb 25.
5
Applications of Artificial Intelligence and Machine Learning Algorithms to Crystallization.人工智能和机器学习算法在结晶中的应用。
Chem Rev. 2022 Aug 10;122(15):13006-13042. doi: 10.1021/acs.chemrev.2c00141. Epub 2022 Jun 27.
6
Data Science in Chemical Engineering: Applications to Molecular Science.化学工程中的数据科学:在分子科学中的应用。
Annu Rev Chem Biomol Eng. 2021 Jun 7;12:15-37. doi: 10.1146/annurev-chembioeng-101220-102232. Epub 2021 Mar 12.
7
Artificial intelligence in spine care: current applications and future utility.人工智能在脊柱护理中的应用:当前的应用和未来的效用。
Eur Spine J. 2022 Aug;31(8):2057-2081. doi: 10.1007/s00586-022-07176-0. Epub 2022 Mar 27.
8
Machine Learning and Artificial Intelligence: A Paradigm Shift in Big Data-Driven Drug Design and Discovery.机器学习和人工智能:大数据驱动的药物设计与发现的范式转变。
Curr Top Med Chem. 2022;22(20):1692-1727. doi: 10.2174/1568026622666220701091339.
9
How Artificial Intelligence and Machine Learning Is Assisting Us to Extract Meaning from Data on Bone Mechanics?人工智能和机器学习如何帮助我们从骨力学数据中提取信息?
Adv Exp Med Biol. 2022;1356:195-221. doi: 10.1007/978-3-030-87779-8_9.
10
Biomaterialomics: Data science-driven pathways to develop fourth-generation biomaterials.生物材料组学:数据科学驱动的第四代生物材料开发途径
Acta Biomater. 2022 Apr 15;143:1-25. doi: 10.1016/j.actbio.2022.02.027. Epub 2022 Feb 23.

引用本文的文献

1
Artificial Intelligence in Traditional Chinese Medicine: Multimodal Fusion and Machine Learning for Enhanced Diagnosis and Treatment Efficacy.中医中的人工智能:多模态融合与机器学习以提高诊疗效果
Curr Med Sci. 2025 Aug 7. doi: 10.1007/s11596-025-00103-6.
2
Development and Validation of a Machine Learning-Based Online Prognostic Model for Cervical Spondylosis Patients After Anterior Cervical Discectomy and Fusion: A Multicenter Study.基于机器学习的颈椎前路椎间盘切除融合术后颈椎病患者在线预后模型的开发与验证:一项多中心研究
JOR Spine. 2025 Jul 28;8(3):e70090. doi: 10.1002/jsp2.70090. eCollection 2025 Sep.
3
The future of critical care: AI-powered mortality prediction for acute variceal gastrointestinal bleeding and acute non-variceal gastrointestinal bleeding patients.
重症监护的未来:人工智能助力预测急性静脉曲张性胃肠道出血和急性非静脉曲张性胃肠道出血患者的死亡率
Front Med (Lausanne). 2025 May 16;12:1580094. doi: 10.3389/fmed.2025.1580094. eCollection 2025.
4
Beyond Numerical Hessians: Higher-Order Derivatives for Machine Learning Interatomic Potentials via Automatic Differentiation.超越数值海森矩阵:通过自动微分实现机器学习原子间势的高阶导数
J Chem Theory Comput. 2025 May 13;21(9):4742-4752. doi: 10.1021/acs.jctc.4c01790. Epub 2025 Apr 24.
5
Efficient Composite Infrared Spectroscopy: Combining the Double-Harmonic Approximation with Machine Learning Potentials.高效复合红外光谱:将双谐波近似与机器学习势相结合
J Chem Theory Comput. 2024 Dec 24;20(24):10986-11004. doi: 10.1021/acs.jctc.4c01157. Epub 2024 Dec 12.
6
Assessment of fine-tuned large language models for real-world chemistry and material science applications.用于实际化学和材料科学应用的微调大语言模型评估。
Chem Sci. 2024 Nov 22;16(2):670-684. doi: 10.1039/d4sc04401k. eCollection 2025 Jan 2.
7
Prediction model for spinal cord injury in spinal tuberculosis patients using multiple machine learning algorithms: a multicentric study.基于多机器学习算法的脊柱结核患者脊髓损伤预测模型:一项多中心研究。
Sci Rep. 2024 Apr 2;14(1):7691. doi: 10.1038/s41598-024-56711-0.
8
Physics-inspired machine learning of localized intensive properties.基于物理启发的局部强度性质的机器学习
Chem Sci. 2023 Apr 10;14(18):4913-4922. doi: 10.1039/d3sc00841j. eCollection 2023 May 10.