Center for Intelligent Chemical Instrumentation, Department of Chemistry and Biochemistry, Clippinger Laboratories, Ohio University, Athens, Ohio 45701-2979, USA.
Anal Chem. 2011 Oct 1;83(19):7464-71. doi: 10.1021/ac2016745. Epub 2011 Aug 30.
A baseline correction method that uses basis set projection to estimate spectral backgrounds has been developed and applied to gas chromatography/mass spectrometry (GC/MS) data. An orthogonal basis was constructed using singular value decomposition (SVD) for each GC/MS two-way data object from a set of baseline mass spectra. A novel aspect of this baseline correction method is the regularization parameter that prevents overfitting that may produce negative peaks in the corrected mass spectra or ion chromatograms. The number of components in the basis, the regularization parameter, and the mass spectral range from which the spectra were sampled to construct the basis were optimized so that the projected difference resolution (PDR) or signal-to-noise ratio (SNR) was maximized. PDR is a metric similar to chromatographic resolution that indicates the separation of classes in a multivariate data space. This new baseline correction method was evaluated with two synthetic data sets and a real GC/MS data set. The prediction accuracies obtained by using the fuzzy rule-building expert system (FuRES) and partial least-squares-discriminant analysis (PLS-DA) as classifiers were compared and validated through bootstrapped Latin partition (BLP) between data before and after baseline correction. The results indicate that baseline correction of the two-way GC/MS data using the proposed methods resulted in a significant increase in average PDR values and prediction accuracies.
一种使用基组投影估计光谱背景的基线校正方法已经开发并应用于气相色谱/质谱 (GC/MS) 数据。对于一组基线质谱中的每个 GC/MS 二维数据对象,使用奇异值分解 (SVD) 构建正交基。这种基线校正方法的一个新颖方面是正则化参数,它可以防止过度拟合,从而在校正后的质谱或离子色谱图中产生负峰。基的成分数、正则化参数以及用于构建基的光谱采样的质量谱范围进行了优化,以使投影差分辨率 (PDR) 或信噪比 (SNR) 最大化。PDR 是一种类似于色谱分辨率的指标,它表示多元数据空间中类别的分离。使用模糊规则生成专家系统 (FuRES) 和偏最小二乘判别分析 (PLS-DA) 作为分类器评估了这种新的基线校正方法,并通过数据前后的自举拉丁分区 (BLP) 进行了验证校正。结果表明,使用所提出的方法对双向 GC/MS 数据进行基线校正,可显著提高平均 PDR 值和预测精度。