Faculty of Science and Technology, Norwegian University of Life Sciences, 1430 Ås, Norway.
Research Unit of Medical Imaging, Physics and Technology, Faculty of Medicine, University of Oulu, 90570 Oulu, Finland.
Molecules. 2022 Apr 1;27(7):2298. doi: 10.3390/molecules27072298.
Preclassification of raw infrared spectra has often been neglected in scientific literature. Separating spectra of low spectral quality, due to low signal-to-noise ratio, presence of artifacts, and low analyte presence, is crucial for accurate model development. Furthermore, it is very important for sparse data, where it becomes challenging to visually inspect spectra of different natures. Hence, a preclassification approach to separate infrared spectra for sparse data is needed. In this study, we propose a preclassification approach based on Multiplicative Signal Correction (MSC). The MSC approach was applied on human and the bovine knee cartilage broadband Fourier Transform Infrared (FTIR) spectra and on a sparse data subset comprising of only seven wavelengths. The goal of the preclassification was to separate spectra with analyte-rich signals (i.e., cartilage) from spectra with analyte-poor (and high-matrix) signals (i.e., water). The human datasets 1 and 2 contained 814 and 815 spectra, while the bovine dataset contained 396 spectra. A pure water spectrum was used as a reference spectrum in the MSC approach. A threshold for the root mean square error (RMSE) was used to separate cartilage from water spectra for broadband and the sparse spectral data. Additionally, standard noise-to-ratio and principle component analysis were applied on broadband spectra. The fully automated MSC preclassification approach, using water as reference spectrum, performed as well as the manual visual inspection. Moreover, it enabled not only separation of cartilage from water spectra in broadband spectral datasets, but also in sparse datasets where manual visual inspection cannot be applied.
在科学文献中,原始红外光谱的预分类常常被忽视。对于低信噪比、存在伪影和低分析物存在的低光谱质量的光谱进行分离,对于准确的模型开发至关重要。此外,对于稀疏数据来说,这一点非常重要,因为在稀疏数据中,不同性质的光谱很难进行可视化检查。因此,需要一种针对稀疏数据的红外光谱预分类方法。在本研究中,我们提出了一种基于多重信号校正(MSC)的预分类方法。该 MSC 方法应用于人类和牛膝关节软骨宽带傅里叶变换红外(FTIR)光谱,以及仅包含七个波长的稀疏数据子集。预分类的目的是将具有分析物丰富信号(即软骨)的光谱与具有分析物贫(和高基质)信号(即水)的光谱分离。数据集 1 和 2 分别包含 814 个和 815 个光谱,而牛数据集包含 396 个光谱。在 MSC 方法中,纯水光谱被用作参考光谱。使用均方根误差(RMSE)的阈值来分离宽带和稀疏光谱数据中的软骨和水光谱。此外,标准噪声比和主成分分析也应用于宽带光谱。使用水作为参考光谱的全自动 MSC 预分类方法与手动目视检查一样有效。此外,它不仅能够在宽带光谱数据集中分离软骨和水光谱,而且能够在手动目视检查无法应用的稀疏数据集中进行分离。