Center for Intelligent Chemical Instrumentation, Clippinger Laboratories, Department of Chemistry and Biochemistry, Ohio University, Athens, Ohio 45701-2979, United States.
Anal Chem. 2013 Mar 5;85(5):2945-53. doi: 10.1021/ac303445v. Epub 2013 Feb 22.
Basil plants cultivated by organic and conventional farming practices were accurately classified by pattern recognition of gas chromatography/mass spectrometry (GC/MS) data. A novel extraction procedure was devised to extract characteristic compounds from ground basil powders. Two in-house fuzzy classifiers, i.e., the fuzzy rule-building expert system (FuRES) and the fuzzy optimal associative memory (FOAM) for the first time, were used to build classification models. Two crisp classifiers, i.e., soft independent modeling by class analogy (SIMCA) and the partial least-squares discriminant analysis (PLS-DA), were used as control methods. Prior to data processing, baseline correction and retention time alignment were performed. Classifiers were built with the two-way data sets, the total ion chromatogram representation of data sets, and the total mass spectrum representation of data sets, separately. Bootstrapped Latin partition (BLP) was used as an unbiased evaluation of the classifiers. By using two-way data sets, average classification rates with FuRES, FOAM, SIMCA, and PLS-DA were 100 ± 0%, 94.4 ± 0.4%, 93.3 ± 0.4%, and 100 ± 0%, respectively, for 100 independent evaluations. The established classifiers were used to classify a new validation set collected 2.5 months later with no parametric changes except that the training set and validation set were individually mean-centered. For the new two-way validation set, classification rates with FuRES, FOAM, SIMCA, and PLS-DA were 100%, 93%, 97%, and 100%, respectively. Thereby, the GC/MS analysis was demonstrated as a viable approach for organic basil authentication. It is the first time that a FOAM has been applied to classification. A novel baseline correction method was used also for the first time. The FuRES and the FOAM are demonstrated as powerful tools for modeling and classifying GC/MS data of complex samples, and the data pretreatments are demonstrated to be useful to improve the performance of classifiers.
采用模式识别技术,通过气相色谱/质谱(GC/MS)数据分析,对有机种植和传统种植的罗勒植物进行了准确分类。设计了一种新的提取方法,从罗勒粉中提取特征化合物。首次使用两种内部模糊分类器,即模糊规则生成专家系统(FuRES)和模糊最优联想记忆(FOAM)构建分类模型。还使用了两种清晰分类器,即软独立建模类比(SIMCA)和偏最小二乘判别分析(PLS-DA)作为对照方法。在数据处理之前,进行了基线校正和保留时间对齐。分类器分别使用双向数据集、数据集的总离子色谱图表示和数据集的总质谱图表示构建。Bootstrapped Latin partition (BLP) 被用作分类器的无偏评估。使用双向数据集,FuRES、FOAM、SIMCA 和 PLS-DA 的平均分类率分别为 100 ± 0%、94.4 ± 0.4%、93.3 ± 0.4%和 100 ± 0%,在 100 次独立评估中。建立的分类器用于分类 2.5 个月后收集的新验证集,除了训练集和验证集分别进行均值中心化外,没有进行任何参数更改。对于新的双向验证集,FuRES、FOAM、SIMCA 和 PLS-DA 的分类率分别为 100%、93%、97%和 100%。因此,GC/MS 分析被证明是一种可行的有机罗勒认证方法。这是首次将 FOAM 应用于分类。还首次使用了一种新的基线校正方法。FuRES 和 FOAM 被证明是用于建模和分类复杂样品 GC/MS 数据的强大工具,并且数据预处理被证明有助于提高分类器的性能。