Harrington Peter de B, Kister Jacky, Artaud Jacques, Dupuy Nathalie
OHIO University Center for Intelligent Chemical Instrumentation, Department of Chemistry and Biochemistry, Clippinger Laboratories, Athens, Ohio 45701-2979, USA.
Anal Chem. 2009 Sep 1;81(17):7160-9. doi: 10.1021/ac900538n.
An approach for automating the determination of the number of components in orthogonal signal correction (OSC) has been devised. In addition, a novel principal component OSC (PC-OSC) is reported that builds softer models for removing background from signals and is much faster than the partial least-squares (PLS) based OSC algorithm. These signal correction methods were evaluated by classifying fused near- and mid-infrared spectra of French olive oils by geographic origin. Two classification methods, partial least-squares-discriminant analysis (PLS-DA) and a fuzzy rule-building expert system (FuRES), were used to evaluate the signal correction of the fused vibrational spectra from the olive oils. The number of components was determined by using bootstrap Latin partitions (BLPs) in the signal correction routine and maximizing the average projected difference resolution (PDR). The same approach was used to select the number of latent variables in the PLS-DA evaluation and perfect classification was obtained. Biased PLS-DA models were also evaluated that optimized the number of latent variables to yield the minimum prediction error. Fuzzy or soft classification systems benefit from background removal. The FuRES prediction results did not differ significantly from the results that were obtained using either the unbiased or biased PLS-DA methods, but was an order of magnitude faster in the evaluations when a sufficient number of PC-OSC components were selected. The importance of bootstrapping was demonstrated for the automated OSC and PC-OSC methods. In addition, the PLS-DA algorithms were also automated using BLPs and proved effective.
已经设计出一种用于自动确定正交信号校正(OSC)中成分数量的方法。此外,还报道了一种新型主成分OSC(PC - OSC),它能构建更灵活的模型以去除信号背景,并且比基于偏最小二乘法(PLS)的OSC算法快得多。通过按地理来源对法国橄榄油的近红外和中红外融合光谱进行分类,对这些信号校正方法进行了评估。使用了两种分类方法,偏最小二乘判别分析(PLS - DA)和模糊规则构建专家系统(FuRES)来评估橄榄油融合振动光谱的信号校正。在信号校正程序中使用自助拉丁划分(BLP)并最大化平均投影差异分辨率(PDR)来确定成分数量。在PLS - DA评估中使用相同的方法选择潜在变量的数量,并获得了完美分类。还评估了有偏PLS - DA模型,该模型优化了潜在变量的数量以产生最小预测误差。模糊或软分类系统受益于背景去除。FuRES的预测结果与使用无偏或有偏PLS - DA方法获得的结果没有显著差异,但在选择了足够数量的PC - OSC成分时,评估速度快一个数量级。证明了自助法对于自动OSC和PC - OSC方法的重要性。此外,PLS - DA算法也使用BLP实现了自动化,并证明是有效的。