利用分子拓扑预测含能化合物的密度。

Density Prediction Models for Energetic Compounds Merely Using Molecular Topology.

机构信息

School of Computer Science and Technology, Southwest University of Science & Technology, Mianyang 621010, Sichuan, China.

Institute of Chemical Materials, China Academy of Engineering Physics (CAEP), P.O. Box 919-311, Mianyang 621999, Sichuan, China.

出版信息

J Chem Inf Model. 2021 Jun 28;61(6):2582-2593. doi: 10.1021/acs.jcim.0c01393. Epub 2021 Apr 12.

DOI:10.1021/acs.jcim.0c01393

PMID:33844526

Abstract

Newly developed high-throughput methods for property predictions make the process of materials design faster and more efficient. Density is an important physical property for energetic compounds to assess detonation velocity and detonation pressure, but the time cost of recent density prediction models is still high owing to the time-consuming processes to calculate molecular descriptors. To improve the screening efficiency of potential energetic compounds, new methods for density prediction with more accuracy and less time cost are urgently needed, and a possible solution is to establish direct mappings between the molecular structure and density. We propose three machine learning (ML) models, support vector machine (SVM), random forest (RF), and Graph neural network (GNN), using molecular topology as the only known input. The widely applied quantitative structure-property relationship based on the density functional theory (DFT-QSPR) is adopted as the benchmark to evaluate the accuracies of the models. All these four models are trained and tested by using the same data set enclosing over 2000 reported nitro compounds searched out from the Cambridge Structural Database. The proportions of compounds with prediction error less than 5% are evaluated by using the independent test set, and the values for the models of SVM, RF, DFT-QSPR, and GNN are 48, 63, 85, and 88%, respectively. The results show that, for the models of SVM and RF, fingerprint bit vectors alone are not facilitated to obtain good QSPRs. Mapping between the molecular structure and density can be well established by using GNN and molecular topology, and its accuracy is slightly better than that of the time-consuming DFT-QSPR method. The GNN-based model has higher accuracy and lower computational resource cost than the widely accepted DFT-QSPR model, so it is more suitable for high-throughput screening of energetic compounds.

摘要

新开发的高通量物性预测方法使材料设计过程更快、更高效。密度是评估爆轰速度和爆轰压力的含能化合物的重要物理性质，但由于计算分子描述符的耗时过程，最近的密度预测模型的时间成本仍然很高。为了提高潜在含能化合物的筛选效率，迫切需要具有更高精度和更低时间成本的密度预测新方法，一种可能的解决方案是建立分子结构和密度之间的直接映射。我们提出了三种机器学习 (ML) 模型，支持向量机 (SVM)、随机森林 (RF) 和图神经网络 (GNN)，仅使用分子拓扑作为唯一已知输入。广泛应用的基于密度泛函理论的定量结构-性质关系 (DFT-QSPR) 被用作基准来评估模型的准确性。所有这四个模型都是通过使用包含从剑桥结构数据库中搜索出的 2000 多个报道的硝基化合物的相同数据集进行训练和测试的。通过使用独立测试集评估预测误差小于 5%的化合物的比例，SVM、RF、DFT-QSPR 和 GNN 模型的值分别为 48%、63%、85%和 88%。结果表明，对于 SVM 和 RF 模型，单独使用指纹位向量不利于获得良好的 QSPR。通过使用 GNN 和分子拓扑可以很好地建立分子结构和密度之间的映射，其准确性略优于耗时的 DFT-QSPR 方法。基于 GNN 的模型比广泛接受的 DFT-QSPR 模型具有更高的准确性和更低的计算资源成本，因此更适合含能化合物的高通量筛选。

相似文献

Density Prediction Models for Energetic Compounds Merely Using Molecular Topology.利用分子拓扑预测含能化合物的密度。

J Chem Inf Model. 2021 Jun 28;61(6):2582-2593. doi: 10.1021/acs.jcim.0c01393. Epub 2021 Apr 12.

[Research on QSPR for n-octanol-water partition coefficients of organic compounds based on genetic algorithms-support vector machine and genetic algorithms-radial basis function neural networks].基于遗传算法-支持向量机和遗传算法-径向基函数神经网络的有机化合物正辛醇-水分配系数的定量构效关系研究

Huan Jing Ke Xue. 2008 Jan;29(1):212-8.

QSPR studies of impact sensitivity of nitro energetic compounds using three-dimensional descriptors.采用三维描述符对硝胺类炸药撞击感度的定量构效关系研究。

J Mol Graph Model. 2012 Jun;36:10-9. doi: 10.1016/j.jmgm.2012.03.002. Epub 2012 Mar 20.

Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure-Property Relationship for Density Prediction of Energetic Materials.结合定量结构-性质关系的新型随机森林集成建模策略用于含能材料密度预测

ACS Omega. 2023 Jan 4;8(2):2752-2759. doi: 10.1021/acsomega.2c07436. eCollection 2023 Jan 17.

QSPR modelling for intrinsic viscosity in polymer-solvent combinations based on density functional theory.基于密度泛函理论的聚合物-溶剂体系特性黏度的 QSPR 建模。

SAR QSAR Environ Res. 2021 May;32(5):379-393. doi: 10.1080/1062936X.2021.1902387. Epub 2021 Apr 7.

QSPR study of Setschenow constants of organic compounds using MLR, ANN, and SVM analyses.采用多元线性回归（MLR）、人工神经网络（ANN）和支持向量机（SVM）分析对有机化合物的 Setschenow 常数进行 QSPR 研究。

J Comput Chem. 2011 Nov 30;32(15):3241-52. doi: 10.1002/jcc.21907. Epub 2011 Aug 12.

Six global and local QSPR models of aqueous solubility at pH = 7.4 based on structural similarity and physicochemical descriptors.基于结构相似性和物理化学描述符的 6 个全球和局部 pH=7.4 下的水溶解度 QSPR 模型。

SAR QSAR Environ Res. 2017 Aug;28(8):661-676. doi: 10.1080/1062936X.2017.1368704. Epub 2017 Sep 11.

QSPR modeling of thermal stability of nitroaromatic compounds: DFT vs. AM1 calculated descriptors.基于密度泛函理论（DFT）与 AM1 计算描述符的硝基芳香族化合物热稳定性的 QSPR 建模

J Mol Model. 2010 Apr;16(4):805-12. doi: 10.1007/s00894-009-0634-7. Epub 2010 Jan 5.

Prediction of impact sensitivity of nitro energetic compounds by neural network based on electrotopological-state indices.基于电拓扑状态指数的神经网络对硝基含能化合物撞击感度的预测

J Hazard Mater. 2009 Jul 15;166(1):155-86. doi: 10.1016/j.jhazmat.2008.11.005. Epub 2008 Nov 13.

QSPR modeling of detonation parameters and sensitivity of some energetic materials: DFT vs. PM3 calculations.某些含能材料爆轰参数和感度的定量结构-性质关系建模：密度泛函理论与PM3计算对比

J Mol Model. 2017 Jun;23(6):193. doi: 10.1007/s00894-017-3357-1. Epub 2017 May 22.

引用本文的文献

Machine Learning Densities, Detonation Velocities, and Formation Enthalpies of Energetic Materials Using Quantum Chemistry Descriptors.利用量子化学描述符预测含能材料的机器学习密度、爆速和生成焓

J Chem Theory Comput. 2025 Sep 9;21(17):8406-8419. doi: 10.1021/acs.jctc.5c00865. Epub 2025 Aug 28.

Design and computational screening of high-energy, low-sensitivity bistetrazole-based energetic molecules.基于双四唑的高能、低感度含能分子的设计与计算筛选

RSC Adv. 2025 Apr 14;15(15):11645-11654. doi: 10.1039/d5ra01604e. eCollection 2025 Apr 9.

Machine Learning Models for High Explosive Crystal Density and Performance.用于高爆炸药晶体密度和性能的机器学习模型

Chem Mater. 2024 Nov 7;36(22):11109-11118. doi: 10.1021/acs.chemmater.4c01978. eCollection 2024 Nov 26.

High precision deep-learning model combined with high-throughput screening to discover fused [5,5] biheterocyclic energetic materials with excellent comprehensive properties.高精度深度学习模型结合高通量筛选以发现具有优异综合性能的稠合[5,5]双杂环含能材料。

RSC Adv. 2024 Jul 29;14(33):23672-23682. doi: 10.1039/d4ra03233k. eCollection 2024 Jul 26.

Accurate Density Prediction of Sesquiterpenoid HEDFs and the Multiproperty Computing Server SesquiterPre.倍半萜类高能量密度化合物的精确密度预测及多性质计算服务器SesquiterPre

ACS Omega. 2024 Jun 5;9(24):26213-26221. doi: 10.1021/acsomega.4c01898. eCollection 2024 Jun 18.

Atom-Based Machine Learning Model for Quantitative Property-Structure Relationship of Electronic Properties of Fusenes and Substituted Fusenes.基于原子的机器学习模型用于并苯及其取代物电子性质的定量构效关系

ACS Omega. 2023 Oct 2;8(41):38441-38451. doi: 10.1021/acsomega.3c05212. eCollection 2023 Oct 17.

Force field-inspired transformer network assisted crystal density prediction for energetic materials.基于力场启发的变压器网络辅助高能材料晶体密度预测

J Cheminform. 2023 Jul 19;15(1):65. doi: 10.1186/s13321-023-00736-6.

Prediction and Construction of Energetic Materials Based on Machine Learning Methods.基于机器学习方法的高能材料的预测与构建。

Molecules. 2022 Dec 31;28(1):322. doi: 10.3390/molecules28010322.

Crystal Structure and Noncovalent Interactions of Heterocyclic Energetic Molecules.杂环含能分子的晶体结构和非共价相互作用。

Molecules. 2022 Aug 4;27(15):4969. doi: 10.3390/molecules27154969.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用分子拓扑预测含能化合物的密度。

Density Prediction Models for Energetic Compounds Merely Using Molecular Topology.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献