利用深度学习对分子光学峰进行多保真度预测。

Multi-fidelity prediction of molecular optical peaks with deep learning.

作者信息

Greenman Kevin P, Green William H, Gómez-Bombarelli Rafael

机构信息

Department of Chemical Engineering, Massachusetts Institute of Technology 77 Massachusetts Ave Cambridge MA 02139 USA.

Department of Materials Science and Engineering, Massachusetts Institute of Technology 77 Massachusetts Ave Cambridge MA 02139 USA

出版信息

Chem Sci. 2022 Jan 4;13(4):1152-1162. doi: 10.1039/d1sc05677h. eCollection 2022 Jan 26.

DOI:10.1039/d1sc05677h

PMID:35211282

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8790778/

Abstract

Optical properties are central to molecular design for many applications, including solar cells and biomedical imaging. A variety of and statistical methods have been developed for their prediction, each with a trade-off between accuracy, generality, and cost. Existing theoretical methods such as time-dependent density functional theory (TD-DFT) are generalizable across chemical space because of their robust physics-based foundations but still exhibit random and systematic errors with respect to experiment despite their high computational cost. Statistical methods can achieve high accuracy at a lower cost, but data sparsity and unoptimized molecule and solvent representations often limit their ability to generalize. Here, we utilize directed message passing neural networks (D-MPNNs) to represent both dye molecules and solvents for predictions of molecular absorption peaks in solution. Additionally, we demonstrate a multi-fidelity approach based on an auxiliary model trained on over 28 000 TD-DFT calculations that further improves accuracy and generalizability, as shown through rigorous splitting strategies. Combining several openly-available experimental datasets, we benchmark these methods against a state-of-the-art regression tree algorithm and compare the D-MPNN solvent representation to several alternatives. Finally, we explore the interpretability of the learned representations using dimensionality reduction and evaluate the use of ensemble variance as an estimator of the epistemic uncertainty in our predictions of molecular peak absorption in solution. The prediction methods proposed herein can be integrated with active learning, generative modeling, and experimental workflows to enable the more rapid design of molecules with targeted optical properties.

摘要

光学性质对于包括太阳能电池和生物医学成像在内的许多应用中的分子设计至关重要。已经开发了各种方法和统计方法来进行预测，每种方法在准确性、通用性和成本之间都存在权衡。现有的理论方法，如含时密度泛函理论（TD-DFT），由于其基于物理的坚实基础，可在化学空间中通用，但尽管计算成本高昂，相对于实验仍表现出随机和系统误差。统计方法可以以较低成本实现高精度，但数据稀疏以及分子和溶剂表示未优化常常限制了它们的泛化能力。在这里，我们利用定向消息传递神经网络（D-MPNN）来表示染料分子和溶剂，以预测溶液中的分子吸收峰。此外，我们展示了一种基于在超过28000次TD-DFT计算上训练的辅助模型的多保真方法，通过严格的拆分策略进一步提高了准确性和泛化能力。结合几个公开可用的实验数据集，我们将这些方法与一种先进的回归树算法进行基准测试，并将D-MPNN溶剂表示与几种替代方法进行比较。最后，我们使用降维探索所学表示的可解释性，并评估使用总体方差作为我们对溶液中分子峰值吸收预测中认知不确定性的估计器。本文提出的预测方法可以与主动学习、生成建模和实验工作流程相结合，以实现具有目标光学性质的分子的更快速设计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e67/8790778/df91d486eaf0/d1sc05677h-f1.jpg

相似文献

Multi-fidelity prediction of molecular optical peaks with deep learning.

Chem Sci. 2022 Jan 4;13(4):1152-1162. doi: 10.1039/d1sc05677h. eCollection 2022 Jan 26.

Automatic Prediction of Peak Optical Absorption Wavelengths in Molecules Using Convolutional Neural Networks.

J Chem Inf Model. 2024 Mar 11;64(5):1486-1501. doi: 10.1021/acs.jcim.3c01792. Epub 2024 Feb 29.

ABT-MPNN: an atom-bond transformer-based message-passing neural network for molecular property prediction.

J Cheminform. 2023 Feb 26;15(1):29. doi: 10.1186/s13321-023-00698-9.

When Do Quantum Mechanical Descriptors Help Graph Neural Networks to Predict Chemical Properties?

J Am Chem Soc. 2024 Aug 21;146(33):23103-23120. doi: 10.1021/jacs.4c04670. Epub 2024 Aug 6.

Message-passing neural networks for high-throughput polymer screening.

J Chem Phys. 2019 Jun 21;150(23):234111. doi: 10.1063/1.5099132.

Prediction of Frequency-Dependent Optical Spectrum for Solid Materials: A Multioutput and Multifidelity Machine Learning Approach.

ACS Appl Mater Interfaces. 2024 Aug 7;16(31):41145-41156. doi: 10.1021/acsami.4c07328. Epub 2024 Jul 24.

Chemprop: A Machine Learning Package for Chemical Property Prediction.

J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.

Proceedings of the Second Workshop on Theory meets Industry (Erwin-Schrödinger-Institute (ESI), Vienna, Austria, 12-14 June 2007).

J Phys Condens Matter. 2008 Feb 13;20(6):060301. doi: 10.1088/0953-8984/20/06/060301. Epub 2008 Jan 24.

Enhancing geometric representations for molecules with equivariant vector-scalar interactive message passing.

Nat Commun. 2024 Jan 5;15(1):313. doi: 10.1038/s41467-023-43720-2.

Evidential Deep Learning for Guided Molecular Property Prediction and Discovery.

ACS Cent Sci. 2021 Aug 25;7(8):1356-1367. doi: 10.1021/acscentsci.1c00546. Epub 2021 Jul 27.

引用本文的文献

Multi-fidelity graph neural networks for predicting toluene/water partition coefficients.

J Cheminform. 2025 Aug 8;17(1):123. doi: 10.1186/s13321-025-01057-6.

Exploring Optimized Organic Fluorophore Search through Experimental Data-Driven Adaptive β‑VAE.

JACS Au. 2025 Jun 30;5(7):3082-3091. doi: 10.1021/jacsau.5c00052. eCollection 2025 Jul 28.

PAH101: A GW+BSE Dataset of 101 Polycyclic Aromatic Hydrocarbon (PAH) Molecular Crystals.

Sci Data. 2025 Apr 23;12(1):679. doi: 10.1038/s41597-025-04959-0.

Enhancing Activation Energy Predictions under Data Constraints Using Graph Neural Networks.

J Chem Inf Model. 2025 Feb 10;65(3):1367-1377. doi: 10.1021/acs.jcim.4c02319. Epub 2025 Jan 25.

Automatic Prediction of Molecular Properties Using Substructure Vector Embeddings within a Feature Selection Workflow.

J Chem Inf Model. 2025 Jan 13;65(1):133-152. doi: 10.1021/acs.jcim.4c01862. Epub 2024 Dec 23.

Enhancing chemistry-intuitive feature learning to improve prediction performance of optical properties.

Chem Sci. 2024 Sep 26;15(42):17533-46. doi: 10.1039/d4sc02781g.

Differentiable modeling and optimization of non-aqueous Li-based battery electrolyte solutions using geometric deep learning.

Nat Commun. 2024 Oct 5;15(1):8649. doi: 10.1038/s41467-024-51653-7.

ADMETlab 3.0: an updated comprehensive online ADMET prediction platform enhanced with broader coverage, improved performance, API functionality and decision support.

Nucleic Acids Res. 2024 Jul 5;52(W1):W422-W431. doi: 10.1093/nar/gkae236.

Automatic Prediction of Peak Optical Absorption Wavelengths in Molecules Using Convolutional Neural Networks.

J Chem Inf Model. 2024 Mar 11;64(5):1486-1501. doi: 10.1021/acs.jcim.3c01792. Epub 2024 Feb 29.

Chemprop: A Machine Learning Package for Chemical Property Prediction.

J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.

本文引用的文献

Learning properties of ordered and disordered materials from multi-fidelity data.

Nat Comput Sci. 2021 Jan;1(1):46-53. doi: 10.1038/s43588-020-00002-x. Epub 2021 Jan 14.

Molecular excited states through a machine learning lens.

Nat Rev Chem. 2021 Jun;5(6):388-405. doi: 10.1038/s41570-021-00278-1. Epub 2021 May 20.

Deep Learning Optical Spectroscopy Based on Experimental Database: Potential Applications to Molecular Design.

JACS Au. 2021 Mar 17;1(4):427-438. doi: 10.1021/jacsau.1c00035. eCollection 2021 Apr 26.

Ab Initio Machine Learning in Chemical Compound Space.

Chem Rev. 2021 Aug 25;121(16):10001-10036. doi: 10.1021/acs.chemrev.0c01303. Epub 2021 Aug 13.

Assigning confidence to molecular property prediction.

Expert Opin Drug Discov. 2021 Sep;16(9):1009-1023. doi: 10.1080/17460441.2021.1925247. Epub 2021 Jun 15.

UV/Vis photochemistry database: Structure, content and applications.

J Quant Spectrosc Radiat Transf. 2020 Sep 1;253. doi: 10.1016/j.jqsrt.2020.107056.

Best practices in machine learning for chemistry.

Nat Chem. 2021 Jun;13(6):505-508. doi: 10.1038/s41557-021-00716-z.

Machine Learning Enables Highly Accurate Predictions of Photophysical Properties of Organic Fluorescent Materials: Emission Wavelengths and Quantum Yields.

J Chem Inf Model. 2021 Mar 22;61(3):1053-1065. doi: 10.1021/acs.jcim.0c01203. Epub 2021 Feb 23.

Machine Learning for Electronically Excited States of Molecules.

Chem Rev. 2021 Aug 25;121(16):9873-9926. doi: 10.1021/acs.chemrev.0c00749. Epub 2020 Nov 19.

QM-symex, update of the QM-sym database with excited state information for 173 kilo molecules.

Sci Data. 2020 Nov 18;7(1):400. doi: 10.1038/s41597-020-00746-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用深度学习对分子光学峰进行多保真度预测。

Multi-fidelity prediction of molecular optical peaks with deep learning.

作者信息

Greenman Kevin P, Green William H, Gómez-Bombarelli Rafael

机构信息

Department of Chemical Engineering, Massachusetts Institute of Technology 77 Massachusetts Ave Cambridge MA 02139 USA.

Department of Materials Science and Engineering, Massachusetts Institute of Technology 77 Massachusetts Ave Cambridge MA 02139 USA

出版信息

Chem Sci. 2022 Jan 4;13(4):1152-1162. doi: 10.1039/d1sc05677h. eCollection 2022 Jan 26.

DOI:10.1039/d1sc05677h

PMID:35211282

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8790778/

Abstract

摘要

利用深度学习对分子光学峰进行多保真度预测。

Multi-fidelity prediction of molecular optical peaks with deep learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用深度学习对分子光学峰进行多保真度预测。

Multi-fidelity prediction of molecular optical peaks with deep learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献