Wu Song, Song Zihao, Wang Jianwei, Niu Xiaobin, Chen Haiyuan
School of Materials and Energy, University of Electronic Science and Technology of China, Chengdu 611731, China.
Phys Chem Chem Phys. 2025 Jan 2;27(2):717-729. doi: 10.1039/d4cp04496g.
The phase structure information of high-entropy alloys (HEAs) is critical for their design and application, as different phase configurations are associated with distinct chemical and physical properties. However, the broad range of elements in HEAs presents significant challenges for precise experimental design and rational theoretical modeling and simulation. To address these challenges, machine learning (ML) methods have emerged as powerful tools for phase structure prediction. In this study, we use a dataset of 544 HEA configurations to predict phases, including 248 intermetallic, 131 solid solution, and 165 amorphous phases. To mitigate the limitations imposed by the small dataset size, we employ a Generative Adversarial Network (GAN) to augment the existing data. Our results show a significant improvement in model performance with data augmentation, achieving an average accuracy of 94.77% across ten random seeds. Validation on an independent dataset confirms the model's reliability and real-world applicability, achieving 100% prediction accuracy. We also predict FCC and BCC phases for SS HEAs based on elemental composition, achieving a peak accuracy of 98%. Furthermore, feature importance analysis identifies correlations between compositional features and phase formation tendencies, which are consistent with experimental observations. This work proposes an effective strategy to enhance the accuracy and generalizability of machine learning models in phase structure prediction, thus promoting the accelerated design of HEAs for a wide range of applications.
高熵合金(HEAs)的相结构信息对于其设计和应用至关重要,因为不同的相构型与不同的化学和物理性质相关联。然而,高熵合金中的大量元素给精确的实验设计以及合理的理论建模与模拟带来了重大挑战。为应对这些挑战,机器学习(ML)方法已成为相结构预测的强大工具。在本研究中,我们使用一个包含544种高熵合金构型的数据集来预测相,其中包括248种金属间化合物相、131种固溶体相和165种非晶相。为减轻小数据集规模带来的限制,我们采用生成对抗网络(GAN)来扩充现有数据。我们的结果表明,通过数据扩充,模型性能有显著提升,在十个随机种子上的平均准确率达到94.77%。在独立数据集上的验证证实了该模型的可靠性和实际适用性,预测准确率达到100%。我们还基于元素组成预测了单相高熵合金的面心立方(FCC)相和体心立方(BCC)相,峰值准确率达到98%。此外,特征重要性分析确定了成分特征与相形成趋势之间的相关性,这与实验观察结果一致。这项工作提出了一种有效策略,以提高机器学习模型在相结构预测中的准确性和通用性,从而推动高熵合金在广泛应用中的加速设计。