Watson Nathanial E, Vanwingerden Matthew M, Pierce Karisa M, Wright Bob W, Synovec Robert E
Department of Chemistry, University of Washington, Seattle, WA 98195, USA.
J Chromatogr A. 2006 Sep 29;1129(1):111-8. doi: 10.1016/j.chroma.2006.06.087. Epub 2006 Jul 24.
A useful methodology is introduced for the analysis of data obtained via gas chromatography with mass spectrometry (GC-MS) utilizing a complete mass spectrum at each retention time interval in which a mass spectrum was collected. Principal component analysis (PCA) with preprocessing by both piecewise retention time alignment and analysis of variance (ANOVA) feature selection is applied to all mass channels collected. The methodology involves concatenating all concurrently measured individual m/z chromatograms from m/z 20 to 120 for each GC-MS separation into a row vector. All of the sample row vectors are incorporated into a matrix where each row is a sample vector. This matrix is piecewise aligned and reduced by ANOVA feature selection. Application of the preprocessing steps (retention time alignment and feature selection) to all mass channels collected during the chromatographic separation allows considerably more selective chemical information to be incorporated in the PCA classification, and is the primary novelty of the report. This methodology is objective and requires no knowledge of the specific analytes of interest, as in selective ion monitoring (SIM), and does not restrict the mass spectral data used, as in both SIM and total ion current (TIC) methods. Significantly, the methodology allows for the classification of data with low resolution in the chromatographic dimension because of the added selectivity from the complete mass spectral dimension. This allows for the successful classification of data over significantly decreased chromatographic separation times, since high-speed separations can be employed. The methodology is demonstrated through the analysis of a set of four differing gasoline samples that serve as model complex samples. For comparison, the gasoline samples are analyzed by GC-MS over both 10-min and 10-s separation times. The successfully classified 10-min GC-MS TIC data served as the benchmark analysis to compare to the 10-s data. When only alignment and feature selection was applied to the 10-s gasoline separations using GC-MS TIC data, PCA failed. PCA was successful for 10-s gasoline separations when the methodology was applied with all the m/z information. With ANOVA feature selection, chromatographic regions with Fisher ratios greater than 1500 were retained in a new matrix and subjected to PCA yielding successful classification for the 10-s separations.
介绍了一种有用的方法,用于分析通过气相色谱-质谱联用(GC-MS)获得的数据,该方法在收集质谱图的每个保留时间间隔利用完整的质谱图。主成分分析(PCA)结合分段保留时间对齐和方差分析(ANOVA)特征选择进行预处理,应用于所有收集的质量通道。该方法包括将每次GC-MS分离中从m/z 20到120同时测量的各个m/z色谱图连接成一个行向量。所有样本行向量被整合到一个矩阵中,其中每行是一个样本向量。该矩阵通过ANOVA特征选择进行分段对齐和降维。将预处理步骤(保留时间对齐和特征选择)应用于色谱分离过程中收集的所有质量通道,可使PCA分类中纳入更多具有选择性的化学信息,这是本报告的主要创新点。该方法是客观的,不像选择性离子监测(SIM)那样需要了解感兴趣的特定分析物,也不像SIM和总离子流(TIC)方法那样限制使用的质谱数据。值得注意的是,由于完整质谱维度增加了选择性,该方法允许对色谱维度分辨率较低的数据进行分类。这使得在显著缩短色谱分离时间的情况下仍能成功对数据进行分类,因为可以采用高速分离。通过分析一组四个不同的汽油样品(作为模型复杂样品)来演示该方法。为作比较,对汽油样品进行了10分钟和10秒分离时间的GC-MS分析。成功分类的10分钟GC-MS TIC数据用作与10秒数据比较的基准分析。当仅对10秒汽油分离的GC-MS TIC数据应用对齐和特征选择时,PCA失败。当将该方法应用于所有m/z信息时,PCA对10秒汽油分离成功。通过ANOVA特征选择,将Fisher比率大于1500的色谱区域保留在一个新矩阵中,并进行PCA,从而实现了对10秒分离的成功分类。