Department of Materials Science and Engineering, Indian Institute of Technology Kanpur, Kalyanpur, Kanpur, Uttar Pradesh-208016, India.
Department of Electrical Engineering, Indian Institute of Technology Kanpur, Kalyanpur, Kanpur, Uttar Pradesh-208016, India.
J Chem Inf Model. 2024 Nov 25;64(22):8404-8413. doi: 10.1021/acs.jcim.4c01329. Epub 2024 Nov 3.
Traditional methods of materials discovery, often relying on intuition and trial-and-error experimentation, are time-consuming and limited in their ability to explore the vast design space effectively. The emergence of machine learning (ML) as a powerful tool for pattern recognition has opened exciting opportunities to revolutionize materials discovery. This work explores the application of ML techniques to assist in the discovery of materials using band structure data. The electronic band structure, which describes the energy levels of electrons in a material, holds vital information regarding its electronic and optical properties. The band structure data of 63,588 materials, including metals and insulators, have been retrieved from the Materials Project database. The data were grouped into 85 batches based on the band path in the first Brillouin zone. Three ML clustering algorithms were trained on the band structure data after performing feature selection and engineering, followed by noise reduction. The models were validated by comparing the materials' properties in a cluster.
传统的材料发现方法,通常依赖于直觉和反复试验的实验,既耗时又在有效探索设计空间方面能力有限。机器学习 (ML) 的出现作为一种强大的模式识别工具,为材料发现带来了令人兴奋的变革机会。这项工作探讨了应用 ML 技术来辅助使用能带结构数据进行材料发现。电子能带结构描述了材料中电子的能级,它包含了有关其电子和光学性质的重要信息。从 Materials Project 数据库中检索了 63,588 种材料(包括金属和绝缘体)的能带结构数据。根据第一布里渊区中的能带路径,将数据分为 85 批。在进行特征选择和工程处理以及降噪后,在能带结构数据上训练了三个 ML 聚类算法。通过比较聚类中材料的性质来验证模型。