Taniguchi Takuya
Center for Data Science, Waseda University, 1-6-1 Nishiwaseda, Shinjuku-ku, Tokyo 169-8050, Japan.
Faraday Discuss. 2025 Jan 14;256(0):139-155. doi: 10.1039/d4fd00090k.
Organic molecular crystals exhibit various functions due to their diverse molecular structures and arrangements. Computational approaches are necessary to explore novel molecular crystals from the material space, but quantum chemical calculations are costly and time-consuming. Neural network potentials (NNPs), trained on vast amounts of data, have recently gained attention for their ability to perform energy calculations with accuracy comparable to quantum chemical methods at high speed. However, NNPs trained on datasets primarily consisting of inorganic crystals, such as the Materials Project, may introduce bias when applied to organic molecular crystals. This study investigates the strategies to improve the accuracy of a pre-trained NNP for organic molecular crystals by distilling knowledge from a teacher model. The most effective knowledge transfer was achieved when fine-tuning using only soft targets, , the teacher model's inference values. As the ratio of hard target loss increased, the efficiency of knowledge transfer decreased, leading to overfitting. As a proof of concept, the NNP created through knowledge distillation was used to predict elastic properties, resulting in improved accuracy compared to the pre-trained model.
有机分子晶体由于其多样的分子结构和排列方式而展现出各种功能。从材料空间探索新型分子晶体需要计算方法,但量子化学计算成本高昂且耗时。基于大量数据训练的神经网络势(NNP),因其能够以与量子化学方法相当的精度高速进行能量计算的能力,近来受到关注。然而,在主要由无机晶体(如材料项目)组成的数据集上训练的NNP,应用于有机分子晶体时可能会引入偏差。本研究通过从教师模型中提炼知识,探究提高预训练NNP对有机分子晶体预测准确性的策略。当仅使用软目标(即教师模型的推理值)进行微调时,实现了最有效的知识转移。随着硬目标损失比例的增加,知识转移效率降低,导致过拟合。作为概念验证,通过知识蒸馏创建的NNP被用于预测弹性性质,与预训练模型相比,预测准确性得到了提高。