Morais Camilo L M, Giamougiannis Panagiotis, Grabowska Rita, Wood Nicholas J, Martin-Hirsch Pierre L, Martin Francis L
School of Pharmacy and Biomedical Sciences, University of Central Lancashire, Preston PR1 2HE, UK.
Analyst. 2020 Aug 24;145(17):5915-5924. doi: 10.1039/d0an01328e.
Raman hyperspectral imaging is a powerful technique that provides both chemical and spatial information of a sample matrix being studied. The generated data are composed of three-dimensional (3D) arrays containing the spatial information across the x- and y-axis, and the spectral information in the z-axis. Unfolding procedures are commonly employed to analyze this type of data in a multivariate fashion, where the spatial dimension is reshaped and the spectral data fits into a two-dimensional (2D) structure and, thereafter, common first-order chemometric algorithms are applied to process the data. There are only a few algorithms capable of working with the full 3D array. Herein, we propose new algorithms for 3D discriminant analysis of hyperspectral images based on a three-dimensional principal component analysis linear discriminant analysis (3D-PCA-LDA) and a three-dimensional discriminant analysis quadratic discriminant analysis (3D-PCA-QDA) approach. The analysis was performed in order to discriminate simulated and real-world data, comprising benign controls and ovarian cancer samples based on Raman hyperspectral imaging, in which 3D-PCA-LDA and 3D-PCA-QDA achieved far superior performance than classical algorithms using unfolding procedures (PCA-LDA, PCA-QDA, partial lest squares discriminant analysis [PLS-DA], and support vector machines [SVM]), where the classification accuracies improved from 66% to 83% (simulated data) and from 50% to 100% (real-world dataset) after employing the 3D techniques. 3D-PCA-LDA and 3D-PCA-QDA are new approaches for discriminant analysis of hyperspectral images multisets to provide faster and superior classification performance than traditional techniques.
拉曼高光谱成像是一种强大的技术,可提供所研究样本基质的化学和空间信息。生成的数据由三维(3D)阵列组成,其中包含x轴和y轴上的空间信息以及z轴上的光谱信息。通常采用展开程序以多变量方式分析此类数据,即将空间维度重塑,使光谱数据适合二维(2D)结构,然后应用常见的一阶化学计量学算法来处理数据。只有少数算法能够处理完整的3D阵列。在此,我们基于三维主成分分析线性判别分析(3D - PCA - LDA)和三维判别分析二次判别分析(3D - PCA - QDA)方法,提出了用于高光谱图像三维判别分析的新算法。进行该分析是为了区分模拟数据和真实世界数据,这些数据包括基于拉曼高光谱成像的良性对照和卵巢癌样本,其中3D - PCA - LDA和3D - PCA - QDA的性能远优于使用展开程序的经典算法(PCA - LDA、PCA - QDA、偏最小二乘判别分析[PLS - DA]和支持向量机[SVM]),采用三维技术后,分类准确率从66%提高到83%(模拟数据),从50%提高到100%(真实世界数据集)。3D - PCA - LDA和3D - PCA - QDA是用于高光谱图像多集判别分析的新方法,比传统技术提供更快且更优的分类性能。