Department of Chemistry and Biochemistry, Old Dominion University, Norfolk, Virginia 23529, USA.
Environ Sci Technol. 2010 Oct 1;44(19):7576-82. doi: 10.1021/es1002204.
We apply multivariate statistics to explore the large data sets encountered from Fourier transform ion cyclotron resonance mass spectra of dissolved organic matter (DOM). Molecular formula assignments for the individual constituents of DOM are examined by hierarchal cluster analysis (HCA) and principal component analysis (PCA), to measure the relationships between numerous DOM samples. We compare two approaches: (1) using averages of elemental ratios and double bond equivalents calculated from the formulas, and (2) employing individual formulas and either their presence/absence or relative magnitude in each sample. With approach 2, PCA deciphers which of the thousands of formulas are significant to particular samples, and then a van Krevelen diagram highlights what types of compounds are molecular signatures to the samples. Our dual approach, especially approach 2, allows for complex data sets to be more easily interpreted, aiding in the characterization of DOM from various sources. By applying this methodology, clear trends can be delineated, trends that are not apparent from currently employed methods. Terrestrial DOM contains various lignin-derived compounds, tannins, and condensed aromatics. Marine DOM contains aliphatic compounds with heteroatom functionalities, as well as lignin-like molecules.
我们应用多元统计学来探索傅里叶变换离子回旋共振质谱法所获得的大量溶解有机物 (DOM) 数据集。通过层次聚类分析 (HCA) 和主成分分析 (PCA),对 DOM 的各个成分的分子公式进行检验,以衡量大量 DOM 样品之间的关系。我们比较了两种方法:(1) 使用从公式计算得出的元素比值和双键当量的平均值;(2) 采用单个公式,并根据其在每个样品中的存在/不存在或相对大小进行处理。使用方法 2,PCA 可以解析出数千个公式中哪些对特定样品具有重要意义,然后范德雷伦图突出显示了哪些类型的化合物是样品的分子特征。我们的双方法,尤其是方法 2,使得更易于解释复杂的数据集,有助于从各种来源对 DOM 进行特征描述。通过应用这种方法,可以描绘出清晰的趋势,这些趋势是目前使用的方法所不明显的。陆地 DOM 包含各种木质素衍生化合物、单宁和缩合芳烃。海洋 DOM 含有具有杂原子官能团的脂肪族化合物,以及类似木质素的分子。