Universidade Federal da Paraíba, CCEN, Departamento de Química, Caixa Postal 5093, CEP 58051-970-João Pessoa, PB, Brazil.
Anal Chim Acta. 2011 Mar 9;689(1):22-8. doi: 10.1016/j.aca.2011.01.022. Epub 2011 Jan 19.
This work proposes a modification to the successive projections algorithm (SPA) aimed at selecting spectral variables for multiple linear regression (MLR) in the presence of unknown interferents not included in the calibration data set. The modified algorithm favours the selection of variables in which the effect of the interferent is less pronounced. The proposed procedure can be regarded as an adaptive modelling technique, because the spectral features of the samples to be analyzed are considered in the variable selection process. The advantages of this new approach are demonstrated in two analytical problems, namely (1) ultraviolet-visible spectrometric determination of tartrazine, allure red and sunset yellow in aqueous solutions under the interference of erythrosine, and (2) near-infrared spectrometric determination of ethanol in gasoline under the interference of toluene. In these case studies, the performance of conventional MLR-SPA models is substantially degraded by the presence of the interferent. This problem is circumvented by applying the proposed Adaptive MLR-SPA approach, which results in prediction errors smaller than those obtained by three other multivariate calibration techniques, namely stepwise regression, full-spectrum partial-least-squares (PLS) and PLS with variables selected by a genetic algorithm. An inspection of the variable selection results reveals that the Adaptive approach successfully avoids spectral regions in which the interference is more intense.
本工作提出了一种改进的连续投影算法(SPA),旨在选择存在于校准数据集未包含的未知干扰物的多元线性回归(MLR)的光谱变量。改进的算法有利于选择干扰物影响较小的变量。所提出的方法可以被视为一种自适应建模技术,因为在变量选择过程中考虑了待分析样品的光谱特征。该新方法的优点在两个分析问题中得到了证明,即(1)在赤藓红、诱惑红和日落黄在水溶液中的紫外可见光谱测定,在干扰物丽春红存在下;(2)在甲苯存在下,近红外光谱法测定汽油中的乙醇。在这些案例研究中,由于干扰物的存在,常规的 MLR-SPA 模型的性能大大降低。通过应用所提出的自适应 MLR-SPA 方法,可以避免这个问题,其预测误差小于其他三种多元校准技术,即逐步回归、全谱偏最小二乘法(PLS)和通过遗传算法选择变量的 PLS。对变量选择结果的检查表明,自适应方法成功地避免了干扰更强烈的光谱区域。