National Engineering Research Center for Information Technologies in Agriculture, Beijing 100097, China.
Appl Spectrosc. 2011 Sep;65(9):1062-7. doi: 10.1366/11-06287.
Epigallocatechin-3-gallate (EGCG) is credited with the majority of the health benefits associated with green tea consumption. It has a high economic and medicinal value. The feasibility of using different variable selection approaches in Fourier transform near-infrared (FT-NIR) spectroscopy for a rapid and conclusive quantitative determination of EGCG in green tea was investigated. Graphically oriented multivariate calibration modeling procedures such as interval partial least squares (iPLS), synergy interval partial least squares (siPLS), and genetic algorithm optimization combined with siPLS (siPLS-GA) were applied to select the most efficient spectral variables that provided the lowest prediction error. The performance of the final model was evaluated according to the root mean square error of prediction (RMSEP) and coefficient of determination (R(2)) for the prediction set. Experimental results showed that the siPLS-GA model obtained the best results in comparison to other models. The optimal models were achieved with R(2)(p) = 0.97 and RMSEP = 0.32. The model can be obtained with only 36 variables retained and it provides a robust model with good estimation accuracy. This demonstrates the potential of NIR spectroscopy with multivariate calibration methods to quickly detect the bioactive component in green tea.
没食子儿茶素-3-没食子酸酯(EGCG)被认为是绿茶消费与健康益处相关的主要成分。它具有很高的经济和药用价值。本研究旨在探讨傅里叶变换近红外(FT-NIR)光谱中不同变量选择方法在快速、明确地定量测定绿茶中 EGCG 方面的可行性。采用了图形导向的多元校正建模方法,如间隔偏最小二乘法(iPLS)、协同间隔偏最小二乘法(siPLS)和遗传算法优化结合 siPLS(siPLS-GA),以选择提供最低预测误差的最有效的光谱变量。根据预测集的预测均方根误差(RMSEP)和决定系数(R²)对最终模型的性能进行评估。实验结果表明,与其他模型相比,siPLS-GA 模型的结果最佳。最佳模型的 R²(p)=0.97,RMSEP=0.32。该模型仅用 36 个保留变量即可获得,并且提供了具有良好估计精度的稳健模型。这表明近红外光谱与多元校正方法相结合具有快速检测绿茶中生物活性成分的潜力。