Hu Chuan-Shen, Mayengbam Rishikanta, Xia Kelin, Sum Tze Chien
Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, 21 Nanyang Link, Singapore 637371, Singapore.
Division of Physics & Applied Physics, School of Physical and Mathematical Sciences, Nanyang Technological University, 21 Nanyang Link, Singapore 637371, Singapore.
J Chem Inf Model. 2025 Jan 27;65(2):660-671. doi: 10.1021/acs.jcim.4c02033. Epub 2025 Jan 9.
With remarkable stability and exceptional optoelectronic properties, two-dimensional (2D) halide layered perovskites hold immense promise for revolutionizing photovoltaic technology. Effective data representations are key to the success of all learning models. Currently, the lack of comprehensive and accurate material representations has hindered AI-based design and discovery of 2D perovskites, limiting their potential for advanced photovoltaic applications. In this context, this work introduces a novel computational topology framework termed the quotient complex (QC), which serves as the foundation for the material representation. The proposed QC-based features are seamlessly integrated with learning models for the advancement of 2D perovskite design. At the heart of this framework lies the quotient complex descriptors (QCDs), representing a quotient variation of simplicial complexes derived from materials' unit cell and periodic boundary conditions. Differing from prior material representations, this approach encodes higher-order interactions and periodicity information simultaneously. Based on the well-established new materials for solar energetics (NMSE) databank, the proposed QC-based machine learning models exhibit superior performance against all existing counterparts. This underscores the paramount role of periodicity information in predicting material functionality, while also showcasing the remarkable efficiency of the QC-based model in characterizing materials' structural attributes.
二维(2D)卤化物层状钙钛矿具有卓越的稳定性和优异的光电性能,在光伏技术变革方面有着巨大潜力。有效的数据表示是所有学习模型成功的关键。目前,缺乏全面准确的材料表示阻碍了基于人工智能的二维钙钛矿设计与发现,限制了它们在先进光伏应用中的潜力。在此背景下,这项工作引入了一种名为商复形(QC)的新型计算拓扑框架,作为材料表示的基础。所提出的基于QC的特征与学习模型无缝集成,以推动二维钙钛矿设计的发展。该框架的核心是商复形描述符(QCD),它表示从材料的晶胞和周期性边界条件导出的单纯复形的商变化。与先前的材料表示不同,这种方法同时编码高阶相互作用和周期性信息。基于成熟的太阳能新材料(NMSE)数据库,所提出的基于QC的机器学习模型相对于所有现有同类模型表现出卓越性能。这突出了周期性信息在预测材料功能方面的至关重要作用,同时也展示了基于QC的模型在表征材料结构属性方面的显著效率。