Cao Xiaofeng, Luo Wenjia, Liu Huimin
School of Chemistry and Chemical Engineering, Southwest Petroleum University Chengdu 610500 P. R. China
RSC Adv. 2024 Apr 16;14(17):12235-12246. doi: 10.1039/d4ra00710g. eCollection 2024 Apr 10.
Despite the rapid development of computational methods, including density functional theory (DFT), predicting the performance of a catalytic material merely based on its atomic arrangements remains challenging. Although quantum mechanics-based methods can model 'real' materials with dopants, grain boundaries, and interfaces with acceptable accuracy, the high demand for computational resources no longer meets the needs of modern scientific research. On the other hand, Machine Learning (ML) method can accelerate the screening of alloy-based catalytic materials. In this study, an ML model was developed to predict the CO and CO adsorption affinity on single-atom doped binary alloys based on the thermochemical properties of component metals. By using a greedy algorithm, the best combination of features was determined, and the ML model was trained and verified based on a data set containing 78 alloys on which the adsorption energy values of CO and CO were calculated from DFT. Comparison between predicted and DFT calculated adsorption energy values suggests that the extreme gradient boosting (XGBoost) algorithm has excellent generalization performance, and the -squared () for CO and CO adsorption energy prediction are 0.96 and 0.91, respectively. The errors of predicted adsorption energy are 0.138 eV and 0.075 eV for CO and CO, respectively. This model can be expected to advance our understanding of structure-property relationships at the fundamental level and be used in large-scale screening of alloy-based catalysts.
尽管包括密度泛函理论(DFT)在内的计算方法发展迅速,但仅基于其原子排列来预测催化材料的性能仍然具有挑战性。虽然基于量子力学的方法可以以可接受的精度对含有掺杂剂、晶界和界面的“真实”材料进行建模,但对计算资源的高需求已不再满足现代科学研究的需要。另一方面,机器学习(ML)方法可以加速基于合金的催化材料的筛选。在本研究中,基于组成金属的热化学性质,开发了一种ML模型来预测单原子掺杂二元合金上CO和CO的吸附亲和力。通过使用贪心算法确定了最佳特征组合,并基于一个包含78种合金的数据集对ML模型进行了训练和验证,在该数据集上从DFT计算了CO和CO的吸附能值。预测的吸附能值与DFT计算的吸附能值之间的比较表明,极端梯度提升(XGBoost)算法具有出色的泛化性能,CO和CO吸附能预测的决定系数(R²)分别为0.96和0.91。CO和CO的预测吸附能误差分别为0.138 eV和0.075 eV。该模型有望在基础层面推进我们对结构-性能关系的理解,并用于基于合金的催化剂的大规模筛选。