Weng Shizhuang, Wang Cong, Zhu Rui, Wu Yehang, Yang Rui, Zheng Ling, Li Pan, Zhao Jinling, Zheng Shouguo
School of Electronic and Information Engineering, Anhui University, Anhui, Hefei 230601, China; National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Hefei 230601, China.
School of Electronic and Information Engineering, Anhui University, Anhui, Hefei 230601, China; National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Hefei 230601, China.
Spectrochim Acta A Mol Biomol Spectrosc. 2024 Aug 5;316:124295. doi: 10.1016/j.saa.2024.124295. Epub 2024 Apr 16.
Surface-enhanced Raman Spectroscopy (SERS) is extensively implemented in drug detection due to its sensitivity and non-destructive nature. Deep learning methods, which are represented by convolutional neural network (CNN), have been widely applied in identifying the spectra from SERS for powerful learning ability. However, the local receptive field of CNN limits the feature extraction of sequential spectra for suppressing the analysis results. In this study, a hybrid Transformer network, TMNet, was developed to identify SERS spectra by integrating the Transformer encoder and the multi-layer perceptron. The Transformer encoder can obtain precise feature representations of sequential spectra with the aid of self-attention, and the multi-layer perceptron efficiently transforms the representations to the final identification results. TMNet performed excellently, with identification accuracies of 99.07% for the spectra of hair containing drugs and 97.12% for those of urine containing drugs. For the spectra with additive white Gaussian, baseline background, and mixed noises, TMNet still exhibited the best performance among all the methods. Overall, the proposed method can accurately identify SERS spectra with outstanding noise resistance and excellent generalization and holds great potential for the analysis of other spectroscopy data.
表面增强拉曼光谱(SERS)因其灵敏度高和非破坏性的特点而在药物检测中得到广泛应用。以卷积神经网络(CNN)为代表的深度学习方法,因其强大的学习能力,已被广泛应用于识别SERS光谱。然而,CNN的局部感受野限制了对连续光谱的特征提取,从而影响了分析结果。在本研究中,通过整合Transformer编码器和多层感知器,开发了一种混合Transformer网络TMNet来识别SERS光谱。Transformer编码器借助自注意力机制可以获得连续光谱的精确特征表示,多层感知器则有效地将这些表示转换为最终的识别结果。TMNet表现出色,对含药毛发光谱的识别准确率为99.07%,对含药尿液光谱的识别准确率为97.12%。对于加性高斯白噪声、基线背景和混合噪声的光谱,TMNet在所有方法中仍表现出最佳性能。总体而言,所提出的方法能够准确识别SERS光谱,具有出色的抗噪声能力和良好的泛化能力,在其他光谱数据分析方面具有巨大潜力。