Gong Sheng, Wang Shuo, Xie Tian, Chae Woo Hyun, Liu Runze, Shao-Horn Yang, Grossman Jeffrey C
Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.
Department of Materials Science and Engineering, University of Maryland, College Park, Maryland 20742, United States.
JACS Au. 2022 Sep 9;2(9):1964-1977. doi: 10.1021/jacsau.2c00235. eCollection 2022 Sep 26.
The application of machine learning to predict materials properties measured by experiments are valuable yet difficult due to the limited amount of experimental data. In this work, we use a multifidelity random forest model to learn the experimental formation enthalpy of materials with prediction accuracy higher than the Perdew-Burke-Ernzerhof (PBE) functional with linear correction, PBEsol, and meta-generalized gradient approximation (meta-GGA) functionals (SCAN and rSCAN), and it outperforms the hotly studied deep neural network-based representation learning and transfer learning. We then use the model to calibrate the DFT formation enthalpy in the Materials Project database and discover materials with underestimated stability. The multifidelity model is also used as a data-mining approach to find how DFT deviates from experiments by explaining the model output.
由于实验数据量有限,应用机器学习来预测通过实验测量的材料特性虽有价值但颇具难度。在这项工作中,我们使用多保真度随机森林模型来学习材料的实验形成焓,其预测精度高于带有线性校正的佩德韦-伯克-恩泽霍夫(PBE)泛函、PBEsol和元广义梯度近似(meta-GGA)泛函(SCAN和rSCAN),并且它优于备受研究的基于深度神经网络的表示学习和迁移学习。然后,我们使用该模型校准材料项目数据库中的密度泛函理论(DFT)形成焓,并发现稳定性被低估的材料。多保真度模型还被用作一种数据挖掘方法,通过解释模型输出结果来发现DFT与实验的偏差。