CEA, DAM, Le Ripault, 37260, Monts, France.
Mol Inform. 2022 May;41(5):e2100064. doi: 10.1002/minf.202100064. Epub 2021 Dec 10.
Total electronic energies and frequencies predicted using the deep learning models ANI-1x and ANI-1ccx are converted to gas-phase formation enthalpies Δ H using an atom equivalent (AE) scheme for a database of CHNO compounds. As expected from the accuracy of those models in predicting reference DFT frequencies and DLPNO-CCSD(T)/CBS energies, this procedure usually outperforms DFT-based AE schemes. However, for some compounds, including energetic molecules, significant deviations from experiment are observed, larger than obtained using DFT procedures. A close examination of the GDB-11 database from which the training data was drawn reveals that many structures of interest in the energetic materials community are excluded from this extensive compilation primarily focused on drug discovery and designed with stability constraints in mind. This points to the urgent need to set up a comparable database including energetic species of interest for the design of energetic materials such as propellants or explosives.
使用深度学习模型 ANI-1x 和 ANI-1ccx 预测的总电子能和频率通过原子等效(AE)方案转换为 CHNO 化合物数据库的气相生成焓 ΔH。从这些模型在预测参考 DFT 频率和 DLPNO-CCSD(T)/CBS 能量的准确性来看,这种方法通常优于基于 DFT 的 AE 方案。然而,对于一些化合物,包括高能分子,与实验相比观察到显著的偏差,比使用 DFT 程序获得的偏差大。对训练数据取自的 GDB-11 数据库的仔细检查表明,能量物质界感兴趣的许多结构被排除在这个主要专注于药物发现并考虑到稳定性限制的广泛编目中。这表明迫切需要建立一个类似的数据库,其中包括设计推进剂或爆炸物等高能物质时感兴趣的高能物质。