Yoon Jonghoon, Choi Eunseong, Min Kyoungmin
School of Mechanical Engineering, Soongsil University, 369 Sangdo-ro, Dongjak-gu, Seoul 06978, Republic of Korea.
J Phys Chem A. 2021 Nov 25;125(46):10103-10110. doi: 10.1021/acs.jpca.1c05292. Epub 2021 Nov 12.
Gibbs free energy is a fundamental physical property for understanding the stability and synthesizability of materials under various thermodynamic conditions, but its accessibility and availability are still limited. In this study, we used 9880 phonon databases to construct a machine learning model to predict approximately 40 000 Inorganic Crystalline Solid Database (ICSD) materials, whose free energy information has not been fully explored. To improve the prediction accuracy, a sampling strategy was implemented by including structures with low accuracy metrics, leading to and mean absolute error values of 0.99 and 18.7 kJ/mol, respectively, in the testing set. Uncertainty analysis was followed for unexplored ICSD materials by obtaining the standard deviation in predictions from 10 surrogate models with different samplings in the training set. Based on this, an optimization process was conducted: density functional calculations were performed for 50 structures with high uncertainty and the training database was updated; this loop was repeated 15 times. This demonstrates the reduction and saturation in the uncertainty, confirming that the constructed model can provide a comprehensive map of the Gibbs free energy for inorganic solid materials. This can accelerate the material screening process by providing information on thermodynamic stability.
吉布斯自由能是理解材料在各种热力学条件下的稳定性和可合成性的基本物理性质,但其可及性和可用性仍然有限。在本研究中,我们使用9880个声子数据库构建了一个机器学习模型,以预测约40000种无机晶体固体数据库(ICSD)材料,这些材料的自由能信息尚未得到充分探索。为了提高预测准确性,我们实施了一种采样策略,纳入了具有低精度指标的结构,测试集中的均方根误差和平均绝对误差值分别为0.99和18.7kJ/mol。通过从训练集中具有不同采样的10个替代模型获得预测中的标准差,对未探索的ICSD材料进行了不确定性分析。在此基础上,进行了一个优化过程:对50个具有高不确定性的结构进行密度泛函计算,并更新训练数据库;这个循环重复了15次。这表明不确定性降低并趋于饱和,证实了所构建的模型可以提供无机固体材料吉布斯自由能的全面图谱。这可以通过提供热力学稳定性信息来加速材料筛选过程。