Mo Wenbo, Wen Jiaxing, Huang Jinglin, Yang Yue, Zhou Minjie, Ni Shuang, Le Wei, Wei Lai, Qi Daojian, Wang Shaoyi, Su Jingqin, Wu Yuchi, Zhou Weimin, Du Kai, Wang Xuewu, Zhao Zongqing
Laser Fusion Research Center at China Academy of Engineering Physics, Mianyang, China.
Department of Engineering Physics, Tsinghua University, Beijing, China.
J Appl Spectrosc. 2023;89(6):1203-1211. doi: 10.1007/s10812-023-01487-w. Epub 2023 Jan 26.
The outbreak of COVID-19 has spread worldwide, causing great damage to the global economy. Raman spectroscopy is expected to become a rapid and accurate method for the detection of coronavirus. A classification method of coronavirus spike proteins by Raman spectroscopy based on deep learning was implemented. A Raman spectra dataset of the spike proteins of five coronaviruses (including MERS-CoV, SARS-CoV, SARS-CoV-2, HCoVHKU1, and HCoV-OC43) was generated to establish the neural network model for classification. Even for rapidly acquired spectra with a low signal-to-noise ratio, the average accuracy exceeded 97%. An interpretive analysis of the classification results of the neural network was performed, which indicated that the differences in spectral characteristics captured by the neural network were consistent with the experimental analysis. The interpretative analysis method provided a valuable reference for identifying complex Raman spectra using deep-learning techniques. Our approach exhibited the potential to be applied in clinical practice to identify COVID-19 and other coronaviruses, and it can also be applied to other identification problems such as the identification of viruses or chemical agents, as well as in industrial areas such as oil and gas exploration.
新型冠状病毒肺炎(COVID-19)疫情已在全球蔓延,给全球经济造成了巨大破坏。拉曼光谱有望成为一种快速、准确检测冠状病毒的方法。基于深度学习实现了一种利用拉曼光谱对冠状病毒刺突蛋白进行分类的方法。生成了包含五种冠状病毒(中东呼吸综合征冠状病毒、严重急性呼吸综合征冠状病毒、严重急性呼吸综合征冠状病毒2、人冠状病毒HKU1和人冠状病毒OC43)刺突蛋白的拉曼光谱数据集,以建立用于分类的神经网络模型。即使对于快速获取的低信噪比光谱,平均准确率也超过了97%。对神经网络的分类结果进行了解释性分析,结果表明神经网络捕捉到的光谱特征差异与实验分析一致。该解释性分析方法为利用深度学习技术识别复杂拉曼光谱提供了有价值的参考。我们的方法显示出在临床实践中用于识别COVID-19和其他冠状病毒的潜力,也可应用于其他识别问题,如病毒或化学制剂的识别,以及石油和天然气勘探等工业领域。