Hao Hui-min, Tang Xiao-jun, Bai Peng, Liu Jun-hua, Zhu Chang-chun
School of Electrical Engineering, Xi'an Jiaotong University, Xi'an 710049, China.
Guang Pu Xue Yu Guang Pu Fen Xi. 2008 Jun;28(6):1286-9.
In the present paper, the authors present a new quantitative analysis method of mid-infrared spectrum. The method combines the kernel principal component analysis (KPCA) technique with support vector regress machine (SVR) to createa quantitative analysis model of multi-component gas mixtures. Firstly, the spectra of multi-component gas mixtures samples were mapped nonlinearly into a high-dimensional feature space through the use of Gaussian kernels. And then, PCA technique was employed to compute efficiently the principal components in the high-dimensional feature spaces. After determining the optimal numbers of principal components, the extracted features (principal components) were used as the inputs of SVR to create the quantitative analysis model of seven-component gas mixtures. The prediction RMSE (phi x 10(-6))of seven-component gases of prediction set samples by use of KPCA-SVR model were respectively 124.37, 72.44, 136.51, 87.29, 153.01, 57.12, and 81.72, ten times less than that by use of SVR model. The elapsed time of modeling and prediction by using KPCA-SVR were respectively 46.59 (s) and 4.94 (s), which was consumedly less than 752.52 (s) and 26.21 (s) by using only SVR These results show that KPCA has an excellent ability of nonlinear feature extraction. It can make the most of the information of entire spectra range and effectively reduce noise and the dimension of the spectra. The KPCA combined with SVR can improve the model's analysis precision and cut the elapsed time of modeling and analysis. From our research and experiments, we conclude that KPCA-SVR is an effective new method for infrared spectroscopic quantitative analysis.
在本文中,作者提出了一种新的中红外光谱定量分析方法。该方法将核主成分分析(KPCA)技术与支持向量回归机(SVR)相结合,建立了多组分气体混合物的定量分析模型。首先,通过使用高斯核将多组分气体混合物样品的光谱非线性映射到高维特征空间。然后,采用主成分分析(PCA)技术在高维特征空间中高效计算主成分。确定主成分的最佳数量后,将提取的特征(主成分)用作支持向量回归的输入,建立七组分气体混合物的定量分析模型。使用KPCA-SVR模型对预测集样品的七组分气体的预测均方根误差(φ×10⁻⁶)分别为124.37、72.44、136.51、87.29、153.01、57.12和81.72,比使用支持向量回归模型时小十倍。使用KPCA-SVR进行建模和预测的耗时分别为46.59(秒)和4.94(秒),大大少于仅使用支持向量回归时的752.52(秒)和26.21(秒)。这些结果表明,KPCA具有出色的非线性特征提取能力。它可以充分利用整个光谱范围的信息,有效降低噪声和光谱维度。KPCA与支持向量回归相结合可以提高模型的分析精度,并缩短建模和分析的耗时。通过我们的研究和实验,我们得出结论,KPCA-SVR是一种用于红外光谱定量分析的有效新方法。