• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用卷积神经网络自动预测分子中的峰值光吸收波长

Automatic Prediction of Peak Optical Absorption Wavelengths in Molecules Using Convolutional Neural Networks.

作者信息

Jung Son Gyo, Jung Guwon, Cole Jacqueline M

机构信息

Cavendish Laboratory, Department of Physics, University of Cambridge, J. J. Thomson Avenue, Cambridge CB3 0HE, U.K.

ISIS Neutron and Muon Source, STFC Rutherford Appleton Laboratory, Harwell Science and Innovation Campus, Didcot, Oxfordshire OX11 0QX, U.K.

出版信息

J Chem Inf Model. 2024 Mar 11;64(5):1486-1501. doi: 10.1021/acs.jcim.3c01792. Epub 2024 Feb 29.

DOI:10.1021/acs.jcim.3c01792
PMID:38422386
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10934802/
Abstract

Molecular design depends heavily on optical properties for applications such as solar cells and polymer-based batteries. Accurate prediction of these properties is essential, and multiple predictive methods exist, from to data-driven techniques. Although theoretical methods, such as time-dependent density functional theory (TD-DFT) calculations, have well-established physical relevance and are among the most popular methods in computational physics and chemistry, they exhibit errors that are inherent in their approximate nature. These high-throughput electronic structure calculations also incur a substantial computational cost. With the emergence of big-data initiatives, cost-effective, data-driven methods have gained traction, although their usability is highly contingent on the degree of data quality and sparsity. In this study, we present a workflow that employs deep residual convolutional neural networks (DR-CNN) and gradient boosting feature selection to predict peak optical absorption wavelengths (λ) exclusively from SMILES representations of dye molecules and solvents; one would normally measure λ using UV-vis absorption spectroscopy. We use a multifidelity modeling approach, integrating 34,893 DFT calculations and 26,395 experimentally derived λ data, to deliver more accurate predictions via a Bayesian-optimized gradient boosting machine. Our approach is benchmarked against the state of the art that is reported in the scientific literature; results demonstrate that learnt representations via a DR-CNN workflow that is integrated with other machine learning methods can accelerate the design of molecules for specific optical characteristics.

摘要

分子设计在很大程度上依赖于光学特性,以用于太阳能电池和聚合物基电池等应用。准确预测这些特性至关重要,并且存在多种预测方法,从[具体方法未提及]到数据驱动技术。尽管理论方法,如含时密度泛函理论(TD-DFT)计算,具有公认的物理相关性,并且是计算物理和化学中最流行的方法之一,但它们存在近似性质所固有的误差。这些高通量电子结构计算还会产生巨大的计算成本。随着大数据计划的出现,具有成本效益的数据驱动方法受到了关注,尽管它们的可用性高度取决于数据质量和稀疏程度。在本研究中,我们提出了一种工作流程,该流程采用深度残差卷积神经网络(DR-CNN)和梯度提升特征选择,仅从染料分子和溶剂的SMILES表示中预测峰值光吸收波长(λ);通常会使用紫外可见吸收光谱法测量λ。我们使用多保真度建模方法,整合34,893次DFT计算和26,395个实验得出的λ数据,通过贝叶斯优化的梯度提升机进行更准确的预测。我们的方法以科学文献中报道的现有技术为基准;结果表明,通过与其他机器学习方法集成的DR-CNN工作流程学习到的表示可以加速针对特定光学特性的分子设计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/bf5890902135/ci3c01792_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/b31c7356a747/ci3c01792_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/60129e8a0cb4/ci3c01792_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/c649f9c9956d/ci3c01792_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/12d06b00846e/ci3c01792_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/5e701abc6a9a/ci3c01792_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/d478ee56c34b/ci3c01792_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/e69f54fe389d/ci3c01792_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/bf5890902135/ci3c01792_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/b31c7356a747/ci3c01792_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/60129e8a0cb4/ci3c01792_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/c649f9c9956d/ci3c01792_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/12d06b00846e/ci3c01792_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/5e701abc6a9a/ci3c01792_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/d478ee56c34b/ci3c01792_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/e69f54fe389d/ci3c01792_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/10934802/bf5890902135/ci3c01792_0008.jpg

相似文献

1
Automatic Prediction of Peak Optical Absorption Wavelengths in Molecules Using Convolutional Neural Networks.使用卷积神经网络自动预测分子中的峰值光吸收波长
J Chem Inf Model. 2024 Mar 11;64(5):1486-1501. doi: 10.1021/acs.jcim.3c01792. Epub 2024 Feb 29.
2
Multi-fidelity prediction of molecular optical peaks with deep learning.利用深度学习对分子光学峰进行多保真度预测。
Chem Sci. 2022 Jan 4;13(4):1152-1162. doi: 10.1039/d1sc05677h. eCollection 2022 Jan 26.
3
Automatic Prediction of Band Gaps of Inorganic Materials Using a Gradient Boosted and Statistical Feature Selection Workflow.使用梯度提升和统计特征选择工作流程自动预测无机材料的能带隙。
J Chem Inf Model. 2024 Feb 26;64(4):1187-1200. doi: 10.1021/acs.jcim.3c01897. Epub 2024 Feb 6.
4
Uncertainty-Quantified Hybrid Machine Learning/Density Functional Theory High Throughput Screening Method for Crystals.不确定性量化的混合机器学习/密度泛函理论高通量筛选晶体方法。
J Chem Inf Model. 2020 Apr 27;60(4):1996-2003. doi: 10.1021/acs.jcim.0c00003. Epub 2020 Apr 6.
5
Prediction of Frequency-Dependent Optical Spectrum for Solid Materials: A Multioutput and Multifidelity Machine Learning Approach.固体材料频率相关光谱的预测:一种多输出和多保真度机器学习方法。
ACS Appl Mater Interfaces. 2024 Aug 7;16(31):41145-41156. doi: 10.1021/acsami.4c07328. Epub 2024 Jul 24.
6
Synergistic Approach of Ultrafast Spectroscopy and Molecular Simulations in the Characterization of Intramolecular Charge Transfer in Push-Pull Molecules.超快光谱与分子模拟相结合的方法用于表征推拉分子中的分子内电荷转移
Molecules. 2020 Jan 20;25(2):430. doi: 10.3390/molecules25020430.
7
Toward Predicting Intermetallics Surface Properties with High-Throughput DFT and Convolutional Neural Networks.通过高通量 DFT 和卷积神经网络预测金属间化合物表面性质。
J Chem Inf Model. 2019 Nov 25;59(11):4742-4749. doi: 10.1021/acs.jcim.9b00550. Epub 2019 Nov 5.
8
Machine Learning Enables Highly Accurate Predictions of Photophysical Properties of Organic Fluorescent Materials: Emission Wavelengths and Quantum Yields.机器学习能够高度准确地预测有机荧光材料的光物理性质:发射波长和量子产率。
J Chem Inf Model. 2021 Mar 22;61(3):1053-1065. doi: 10.1021/acs.jcim.0c01203. Epub 2021 Feb 23.
9
QSAR modeling without descriptors using graph convolutional neural networks: the case of mutagenicity prediction.使用图卷积神经网络的无描述符定量构效关系建模:以致突变性预测为例
Mol Divers. 2021 Aug;25(3):1283-1299. doi: 10.1007/s11030-021-10250-2. Epub 2021 Jun 19.
10
Excitations, optical absorption spectra, and optical excitonic gaps of heterofullerenes. I. C60, C59N+, and C48N12: theory and experiment.异质富勒烯的激发、光吸收光谱和光学激子能隙。I. C60、C59N+和C48N12:理论与实验
J Chem Phys. 2004 Mar 15;120(11):5133-47. doi: 10.1063/1.1647532.

引用本文的文献

1
Deep learning for property prediction of natural fiber polymer composites.用于天然纤维聚合物复合材料性能预测的深度学习
Sci Rep. 2025 Jul 30;15(1):27837. doi: 10.1038/s41598-025-10841-1.
2
Automatic Prediction of Molecular Properties Using Substructure Vector Embeddings within a Feature Selection Workflow.在特征选择工作流程中使用子结构向量嵌入自动预测分子性质
J Chem Inf Model. 2025 Jan 13;65(1):133-152. doi: 10.1021/acs.jcim.4c01862. Epub 2024 Dec 23.
3
Enhancing chemistry-intuitive feature learning to improve prediction performance of optical properties.

本文引用的文献

1
Automatic Prediction of Band Gaps of Inorganic Materials Using a Gradient Boosted and Statistical Feature Selection Workflow.使用梯度提升和统计特征选择工作流程自动预测无机材料的能带隙。
J Chem Inf Model. 2024 Feb 26;64(4):1187-1200. doi: 10.1021/acs.jcim.3c01897. Epub 2024 Feb 6.
2
Learning properties of ordered and disordered materials from multi-fidelity data.从多保真度数据中学习有序和无序材料的特性。
Nat Comput Sci. 2021 Jan;1(1):46-53. doi: 10.1038/s43588-020-00002-x. Epub 2021 Jan 14.
3
Gradient boosted and statistical feature selection workflow for materials property predictions.
增强化学直观特征学习以提高光学性质的预测性能。
Chem Sci. 2024 Sep 26;15(42):17533-46. doi: 10.1039/d4sc02781g.
4
Predictive Modeling of High-Entropy Alloys and Amorphous Metallic Alloys Using Machine Learning.使用机器学习对高熵合金和非晶态金属合金进行预测建模。
J Chem Inf Model. 2024 Oct 14;64(19):7313-7336. doi: 10.1021/acs.jcim.4c00873. Epub 2024 Oct 1.
5
Machine-Learning Predictions of Critical Temperatures from Chemical Compositions of Superconductors.机器学习预测超导材料化学成分的临界温度。
J Chem Inf Model. 2024 Oct 14;64(19):7349-7375. doi: 10.1021/acs.jcim.4c01137. Epub 2024 Sep 17.
6
Machine-Learning Prediction of Curie Temperature from Chemical Compositions of Ferromagnetic Materials.机器学习预测铁磁材料化学成分的居里温度。
J Chem Inf Model. 2024 Aug 26;64(16):6388-6409. doi: 10.1021/acs.jcim.4c00947. Epub 2024 Aug 7.
7
Automatic Prediction of Band Gaps of Inorganic Materials Using a Gradient Boosted and Statistical Feature Selection Workflow.使用梯度提升和统计特征选择工作流程自动预测无机材料的能带隙。
J Chem Inf Model. 2024 Feb 26;64(4):1187-1200. doi: 10.1021/acs.jcim.3c01897. Epub 2024 Feb 6.
用于材料性能预测的梯度提升和统计特征选择工作流程。
J Chem Phys. 2023 Nov 21;159(19). doi: 10.1063/5.0171540.
4
Molecular excited states through a machine learning lens.机器学习视角下的分子激发态
Nat Rev Chem. 2021 Jun;5(6):388-405. doi: 10.1038/s41570-021-00278-1. Epub 2021 May 20.
5
Automatic materials characterization from infrared spectra using convolutional neural networks.使用卷积神经网络从红外光谱中进行自动材料表征。
Chem Sci. 2023 Feb 23;14(13):3600-3609. doi: 10.1039/d2sc05892h. eCollection 2023 Mar 29.
6
Clustering a database of optically absorbing organic molecules via a hierarchical fingerprint scheme that categorizes similar functional molecular fragments.通过一种对相似功能分子片段进行分类的分层指纹方案,对光吸收有机分子数据库进行聚类。
J Chem Phys. 2022 Apr 21;156(15):154110. doi: 10.1063/5.0087603.
7
Multi-fidelity prediction of molecular optical peaks with deep learning.利用深度学习对分子光学峰进行多保真度预测。
Chem Sci. 2022 Jan 4;13(4):1152-1162. doi: 10.1039/d1sc05677h. eCollection 2022 Jan 26.
8
Organic materials repurposing, a data set for theoretical predictions of new applications for existing compounds.有机材料再利用,一个用于对现有化合物新应用进行理论预测的数据集。
Sci Data. 2022 Feb 14;9(1):54. doi: 10.1038/s41597-022-01142-7.
9
Reversible Electrochemical Charging of n-Type Conjugated Polymer Electrodes in Aqueous Electrolytes.n型共轭聚合物电极在水性电解质中的可逆电化学充电
J Am Chem Soc. 2021 Sep 15;143(36):14795-14805. doi: 10.1021/jacs.1c06713. Epub 2021 Sep 1.
10
Deep Learning Optical Spectroscopy Based on Experimental Database: Potential Applications to Molecular Design.基于实验数据库的深度学习光学光谱学:在分子设计中的潜在应用
JACS Au. 2021 Mar 17;1(4):427-438. doi: 10.1021/jacsau.1c00035. eCollection 2021 Apr 26.