Key Laboratory of Optical System Advanced Manufacturing Technology, Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, 130033, PR China; University of Chinese Academy of Sciences, Beijing, 100049, PR China.
Key Laboratory of Optical System Advanced Manufacturing Technology, Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, 130033, PR China; University of Chinese Academy of Sciences, Beijing, 100049, PR China.
Spectrochim Acta A Mol Biomol Spectrosc. 2025 Feb 5;326:125207. doi: 10.1016/j.saa.2024.125207. Epub 2024 Sep 26.
Raman spectroscopy has emerged as a highly sensitive, rapid, and label-free detection method, extensively utilized in biological research. Presently, it is frequently paired with artificial intelligence (AI) algorithms to facilitate identification and classification tasks. However, variations in the settings across different Raman spectrometers, along with the sensitive and continuous nature of biological Raman signals, can subtly alter the acquisition of these signals. This can potentially impact the classification outcomes of the spectra. Moreover, Raman spectra with disparate resolutions pose challenges for effective model training. In this study, we introduce a modularized Siamese neural network, equipped with multiple projection layers to segregate the model components. This design allows our model to support the core module spectral encoder's pluggability. The model determines the classification results by extracting the features of Raman spectra with inter-instrument variation, mapping these feature distances into spectral similarities, and finally, comparing a set of similarities. Our experimental results demonstrate the feasibility of training the model with only 10 spectra per category, using bacterial datasets we created. We compared the classification outcomes of three distinct spectral encoders, with the most effective model achieving a classification accuracy exceeding 90%. Furthermore, we successfully implemented the fusion training and prediction of Raman spectra with different resolutions. In conclusion, our model enhances the validity and comparability of Raman spectral acquisition for biological applications and diversifies the methods of Raman spectral acquisition.
拉曼光谱已成为一种高度敏感、快速且无需标记的检测方法,广泛应用于生物研究领域。目前,它经常与人工智能(AI)算法结合使用,以促进识别和分类任务。然而,不同拉曼光谱仪之间的设置变化,以及生物拉曼信号的敏感和连续性质,可能会微妙地改变这些信号的采集。这可能会影响光谱的分类结果。此外,具有不同分辨率的拉曼光谱对有效模型训练构成挑战。在本研究中,我们引入了一种模块化的孪生神经网络,配备了多个投影层来分离模型组件。这种设计允许我们的模型支持核心模块光谱编码器的可插拔性。该模型通过提取具有仪器间变化的拉曼光谱的特征,将这些特征距离映射到光谱相似性,并最终比较一组相似性,来确定分类结果。我们的实验结果表明,使用我们创建的细菌数据集,仅用每类 10 个光谱即可训练模型。我们比较了三个不同光谱编码器的分类结果,其中最有效的模型的分类准确率超过 90%。此外,我们成功地实现了不同分辨率的拉曼光谱的融合训练和预测。总之,我们的模型增强了生物应用中拉曼光谱采集的有效性和可比性,并多样化了拉曼光谱采集的方法。