预测最高占据分子轨道（HOMO）和最低未占据分子轨道（LUMO）的密度泛函理论B3LYP能量的机器学习方法。

Machine Learning Methods to Predict Density Functional Theory B3LYP Energies of HOMO and LUMO Orbitals.

作者信息

Pereira Florbela, Xiao Kaixia, Latino Diogo A R S, Wu Chengcheng, Zhang Qingyou, Aires-de-Sousa Joao

机构信息

LAQV and REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa , 2829-516 Caparica, Portugal.

Henan Engineering Research Center of Industrial Circulating Water Treatment, College of Chemistry and Chemical Engineering, Henan University , Kaifeng, 475004, PR China.

出版信息

J Chem Inf Model. 2017 Jan 23;57(1):11-21. doi: 10.1021/acs.jcim.6b00340. Epub 2016 Dec 29.

DOI:10.1021/acs.jcim.6b00340

PMID:28033004

Abstract

Machine learning algorithms were explored for the fast estimation of HOMO and LUMO orbital energies calculated by DFT B3LYP, on the basis of molecular descriptors exclusively based on connectivity. The whole project involved the retrieval and generation of molecular structures, quantum chemical calculations for a database with >111 000 structures, development of new molecular descriptors, and training/validation of machine learning models. Several machine learning algorithms were screened, and an applicability domain was defined based on Euclidean distances to the training set. Random forest models predicted an external test set of 9989 compounds achieving mean absolute error (MAE) up to 0.15 and 0.16 eV for the HOMO and LUMO orbitals, respectively. The impact of the quantum chemical calculation protocol was assessed with a subset of compounds. Inclusion of the orbital energy calculated by PM7 as an additional descriptor significantly improved the quality of estimations (reducing the MAE in >30%).

摘要

基于仅基于连接性的分子描述符，探索了机器学习算法以快速估计通过密度泛函理论（DFT）B3LYP计算的最高占据分子轨道（HOMO）和最低未占据分子轨道（LUMO）的轨道能量。整个项目包括分子结构的检索和生成、对超过111,000个结构的数据库进行量子化学计算、开发新的分子描述符以及机器学习模型的训练/验证。筛选了几种机器学习算法，并基于到训练集的欧几里得距离定义了适用域。随机森林模型预测了9989种化合物的外部测试集，对于HOMO和LUMO轨道，平均绝对误差（MAE）分别高达0.15和0.16 eV。用一部分化合物评估了量子化学计算协议的影响。将通过PM7计算的轨道能量作为额外描述符纳入显著提高了估计质量（将MAE降低了30%以上）。

相似文献

Machine Learning Methods to Predict Density Functional Theory B3LYP Energies of HOMO and LUMO Orbitals.预测最高占据分子轨道（HOMO）和最低未占据分子轨道（LUMO）的密度泛函理论B3LYP能量的机器学习方法。

J Chem Inf Model. 2017 Jan 23;57(1):11-21. doi: 10.1021/acs.jcim.6b00340. Epub 2016 Dec 29.

Mapping the frontier orbital energies of imidazolium-based cations using machine learning.利用机器学习绘制基于咪唑鎓的阳离子的前沿轨道能量

J Chem Phys. 2023 Aug 14;159(6). doi: 10.1063/5.0155775.

Predictive Models for HOMO and LUMO Energies of N-Donor Heterocycles as Ligands for Lanthanides Separation.用于镧系元素分离的 N-供体杂环配体的 HOMO 和 LUMO 能量的预测模型。

Mol Inform. 2018 Nov;37(11):e1800025. doi: 10.1002/minf.201800025. Epub 2018 Jul 4.

Machine learning for the prediction of molecular dipole moments obtained by density functional theory.用于预测通过密度泛函理论获得的分子偶极矩的机器学习。

J Cheminform. 2018 Aug 22;10(1):43. doi: 10.1186/s13321-018-0296-5.

Comparison of DFT methods for molecular orbital eigenvalue calculations.用于分子轨道本征值计算的密度泛函理论（DFT）方法比较。

J Phys Chem A. 2007 Mar 1;111(8):1554-61. doi: 10.1021/jp061633o. Epub 2007 Feb 6.

Relating molecular descriptors to frontier orbital energy levels, singlet and triplet excited states of fused tricyclics using machine learning.利用机器学习将分子描述符与稠合三环化合物的前沿轨道能级、单重态和三重态激发态相关联。

J Mol Graph Model. 2021 Jun;105:107891. doi: 10.1016/j.jmgm.2021.107891. Epub 2021 Mar 13.

Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error.分子机器学习模型的预测误差低于混合密度泛函理论误差。

J Chem Theory Comput. 2017 Nov 14;13(11):5255-5264. doi: 10.1021/acs.jctc.7b00577. Epub 2017 Oct 10.

A big data approach to the ultra-fast prediction of DFT-calculated bond energies.一种大数据方法，可实现对 DFT 计算键能的超快速预测。

J Cheminform. 2013 Jul 12;5:34. doi: 10.1186/1758-2946-5-34. eCollection 2013.

Externally predictive quantitative modeling of supercooled liquid vapor pressure of polychlorinated-naphthalenes through electron-correlation based quantum-mechanical descriptors.通过基于电子相关的量子力学描述符对外推预测多氯代萘的过冷液体蒸气压。

Chemosphere. 2014 Jan;95:448-54. doi: 10.1016/j.chemosphere.2013.09.093. Epub 2013 Oct 26.

SAMPL6 logP challenge: machine learning and quantum mechanical approaches.SAMPL6 logP 挑战：机器学习与量子力学方法。

J Comput Aided Mol Des. 2020 May;34(5):495-510. doi: 10.1007/s10822-020-00287-0. Epub 2020 Jan 30.

引用本文的文献

Introducing Virtual Points in Equivariant Networks by Extending Atom Representation for Effective Prediction.通过扩展原子表示在等变网络中引入虚拟点以进行有效预测。

J Chem Theory Comput. 2025 Sep 9;21(17):8468-8477. doi: 10.1021/acs.jctc.5c00701. Epub 2025 Aug 25.

Graph Convolutional Neural Network-Enabled Frontier Molecular Orbital Prediction: A Case Study with Neurotransmitters and Antidepressants.基于图卷积神经网络的前沿分子轨道预测：以神经递质和抗抑郁药为例的研究

J Chem Inf Model. 2025 Jul 28;65(14):7447-7462. doi: 10.1021/acs.jcim.5c00724. Epub 2025 Jul 17.

A Chemoinformatics Investigation of Spectral and Quantum Chemistry Patterns for Discovering New Drug Leads from Natural Products Targeting the PD-1/PD-L1 Immune Checkpoint, with a Particular Focus on Naturally Occurring Marine Products.一项针对靶向PD-1/PD-L1免疫检查点的天然产物（特别关注天然海洋产物）发现新药先导物的光谱和量子化学模式的化学信息学研究。

Mar Drugs. 2025 Jun 10;23(6):247. doi: 10.3390/md23060247.

Exploring the nonlinear conductive properties of polymer/graphene composites at the molecular level: a machine learning approach.在分子水平上探索聚合物/石墨烯复合材料的非线性导电特性：一种机器学习方法。

RSC Adv. 2025 May 28;15(22):17711-17719. doi: 10.1039/d5ra00705d. eCollection 2025 May 21.

Machine learning models for predicting in-hospital mortality from acute pancreatitis in intensive care unit.用于预测重症监护病房中急性胰腺炎患者院内死亡率的机器学习模型。

BMC Med Inform Decis Mak. 2025 May 27;25(1):198. doi: 10.1186/s12911-025-03033-4.

Effect of Molecular Structure on the B3LYP-Computed HOMO-LUMO Gap: A Structure -Property Relationship Using Atomic Signatures.分子结构对B3LYP计算的最高占据分子轨道-最低未占据分子轨道能隙的影响：基于原子特征的结构-性质关系

ACS Omega. 2025 Jan 15;10(3):2799-2808. doi: 10.1021/acsomega.4c08626. eCollection 2025 Jan 28.

Quantum Chemistry Dataset with Ground- and Excited-state Properties of 450 Kilo Molecules.包含45万个分子基态和激发态性质的量子化学数据集。

Sci Data. 2024 Aug 29;11(1):948. doi: 10.1038/s41597-024-03788-x.

Effect of chalcogen atoms on the electronic band gaps of the quinoxaline containing donor-acceptor-donor type semiconducting polymers: a systematic DFT investigation.硫族原子对含喹喔啉供体-受体-供体型半导体聚合物电子带隙的影响：一项系统的密度泛函理论研究。

J Mol Model. 2024 May 22;30(6):179. doi: 10.1007/s00894-024-05985-2.

Novel isatin-triazole based thiosemicarbazones as potential anticancer agents: synthesis, DFT and molecular docking studies.基于新型异吲哚酮 - 三唑的硫代氨基脲作为潜在抗癌剂：合成、密度泛函理论及分子对接研究

RSC Adv. 2024 Apr 29;14(20):14051-14067. doi: 10.1039/d4ra01937g. eCollection 2024 Apr 25.

Synthesis, crystal structure and in-silico evaluation of arylsulfonamide Schiff bases for potential activity against colon cancer.芳基磺酰胺席夫碱的合成、晶体结构及对结肠癌潜在活性的计算机评估。

Acta Crystallogr C Struct Chem. 2024 Apr 1;80(Pt 4):129-142. doi: 10.1107/S205322962400233X. Epub 2024 Mar 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

预测最高占据分子轨道（HOMO）和最低未占据分子轨道（LUMO）的密度泛函理论B3LYP能量的机器学习方法。

Machine Learning Methods to Predict Density Functional Theory B3LYP Energies of HOMO and LUMO Orbitals.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献