Liu Pan, Song Yan, Chai Mengyu, Han Zelin, Zhang Yu
School of Chemical Engineering and Technology, Xi'an Jiaotong University, Xi'an 710049, China.
School of Computer Science, Shaanxi Normal University, Xi'an 710119, China.
Materials (Basel). 2021 Dec 7;14(24):7504. doi: 10.3390/ma14247504.
The precise identification of micro-features on 2.25Cr1Mo0.25V steel is of great significance for understanding the mechanism of hydrogen embrittlement (HE) and evaluating the alloy's properties of HE resistance. Presently, the convolution neural network (CNN) of deep learning is widely applied in the micro-features identification of alloy. However, with the development of the transformer in image recognition, the transformer-based neural network performs better on the learning of global and long-range semantic information than CNN and achieves higher prediction accuracy. In this work, a new transformer-based neural network model Swin-UNet++ was proposed. Specifically, the architecture of the decoder was redesigned to more precisely detect and identify the micro-feature with complex morphology (i.e., dimples) of 2.25Cr1Mo0.25V steel fracture surface. Swin-UNet++ and other segmentation models performed state-of-the-art (SOTA) were compared on the dimple dataset constructed in this work, which consists of 830 dimple scanning electron microscopy (SEM) images on 2.25Cr1Mo0.25V steel fracture surface. The segmentation results show Swin-UNet++ not only realizes the accurate identification of dimples but displays a much higher prediction accuracy and stronger robustness than Swin-Unet and UNet. Moreover, efforts from this work will also provide an important reference value to the identification of other micro-features with complex morphologies.
精确识别2.25Cr1Mo0.25V钢的微观特征对于理解氢脆(HE)机理和评估该合金的抗氢脆性能具有重要意义。目前,深度学习中的卷积神经网络(CNN)广泛应用于合金微观特征识别。然而,随着图像识别领域中Transformer的发展,基于Transformer的神经网络在全局和长程语义信息学习方面比CNN表现更优,且实现了更高的预测精度。在这项工作中,提出了一种基于Transformer的新型神经网络模型Swin-UNet++。具体而言,重新设计了解码器架构,以更精确地检测和识别2.25Cr1Mo0.25V钢断口表面具有复杂形态(即韧窝)的微观特征。在本工作构建的韧窝数据集上,将Swin-UNet++与其他表现最优(SOTA)的分割模型进行了比较,该数据集由830张2.25Cr1Mo0.25V钢断口表面韧窝的扫描电子显微镜(SEM)图像组成。分割结果表明,Swin-UNet++不仅实现了韧窝的准确识别,而且与Swin-Unet和UNet相比,具有更高的预测精度和更强的鲁棒性。此外,这项工作也将为其他具有复杂形态的微观特征识别提供重要的参考价值。