Department of Mechanical and Industrial Engineering, Louisiana State University, Baton Rouge, LA 70803, USA.
Department of Pediatrics, Center for Blood Oxygen Transport and Hemostasis, University of Maryland Baltimore School of Medicine, Baltimore, MD 21201, USA.
Biosensors (Basel). 2022 Aug 2;12(8):589. doi: 10.3390/bios12080589.
In this study, we explored machine learning approaches for predictive diagnosis using surface-enhanced Raman scattering (SERS), applied to the detection of COVID-19 infection in biological samples. To do this, we utilized SERS data collected from 20 patients at the University of Maryland Baltimore School of Medicine. As a preprocessing step, the positive-negative labels are obtained using Polymerase Chain Reaction (PCR) testing. First, we compared the performance of linear and nonlinear dimensionality techniques for projecting the high-dimensional Raman spectra to a low-dimensional space where a smaller number of variables defines each sample. The appropriate number of reduced features used was obtained by comparing the mean accuracy from a 10-fold cross-validation. Finally, we employed Gaussian process (GP) classification, a probabilistic machine learning approach, to correctly predict the occurrence of a negative or positive sample as a function of the low-dimensional space variables. As opposed to providing rigid class labels, the GP classifier provides a probability (ranging from zero to one) that a given sample is positive or negative. In practice, the proposed framework can be used to provide high-throughput rapid testing, and a follow-up PCR can be used for confirmation in cases where the model's uncertainty is unacceptably high.
在这项研究中,我们探索了使用表面增强拉曼散射(SERS)进行预测诊断的机器学习方法,应用于生物样本中 COVID-19 感染的检测。为此,我们利用马里兰大学巴尔的摩医学院 20 名患者的 SERS 数据。作为预处理步骤,使用聚合酶链反应(PCR)测试获得正负标签。首先,我们比较了线性和非线性降维技术在将高维拉曼光谱投影到低维空间的性能,其中每个样本由较少的变量定义。通过比较 10 倍交叉验证的平均准确性来获得适当的降维特征数量。最后,我们采用高斯过程(GP)分类,一种概率机器学习方法,根据低维空间变量正确预测负样本或正样本的发生。与提供严格的类别标签相反,GP 分类器提供了一个给定样本为正或负的概率(范围从 0 到 1)。在实践中,所提出的框架可用于提供高通量快速测试,并且可以在模型不确定性不可接受高的情况下使用后续的 PCR 进行确认。