Gao Shuxian, Jennings Elaine K, Han Limei, Koch Boris P, Herzsprung Peter, Lechtenfeld Oliver J
Department Environmental Analytical Chemistry, Research Group BioGeoOmics, Helmholtz Centre for Environmental Research─UFZ, Permoserstr. 15, Leipzig D-04318, Germany.
Department of Biosciences, Ecological Chemistry, Helmholtz Centre for Polar and Marine Research─AWI, Am Handelshafen 12, Bremerhaven D-27570, Germany.
Anal Chem. 2024 Jun 25;96(25):10210-10218. doi: 10.1021/acs.analchem.4c00489. Epub 2024 Jun 13.
Ultrahigh resolution mass spectrometry (UHRMS) routinely detects and identifies thousands of mass peaks in complex mixtures, such as natural organic matter (NOM) and petroleum. The assignment of several chemically plausible molecular formulas (MFs) for a single accurate mass still poses a major problem for the reliable interpretation of NOM composition in a biogeochemical context. Applying sensible chemical rules for MF validation is often insufficient to eliminate multiple assignments (MultiAs)─especially for mass peaks with low abundance or if ample heteroatoms or isotopes are included - and requires manual inspection or expert judgment. Here, we present a new approach based on mass error distributions for the identification of true and false assignments among MultiAs. To this end, we used the mass error in millidalton (mDa), which was superior to the commonly used relative mass error in ppm. We developed an automatic workflow to group MultiAs based on their shared formula units and Kendrick mass defect values and to evaluate the mass error distribution. In this way, the number of valid assignments of chlorinated disinfection byproducts was increased by 8-fold as compared to only applying Cl/Cl isotope ratio filters. Likewise, phosphorus-containing MFs can be differentiated against chlorine-containing MFs with high confidence. Further, false assignments of highly aromatic sulfur-containing MFs ("black sulfur") to sodium adducts in negative ionization mode can be excluded by applying our approach. Overall, MFs for mass peaks that are close to the detection limit or where naturally occurring isotopes are rare (e.g., N) or absent (e.g., P and F) can now be validated, substantially increasing the reliability of MF assignments and broadening the applicability of UHRMS analysis to even more complex samples and processes.
超高分辨率质谱(UHRMS)通常能检测和识别复杂混合物中的数千个质量峰,如天然有机物(NOM)和石油。对于单个精确质量,确定几个化学上合理的分子式(MFs)仍然是在生物地球化学背景下可靠解释NOM组成的一个主要问题。应用合理的化学规则进行MF验证往往不足以消除多个归属(多重归属,MultiAs)——特别是对于低丰度的质量峰,或者如果包含大量杂原子或同位素时——并且需要人工检查或专家判断。在这里,我们提出了一种基于质量误差分布的新方法,用于在多重归属中识别真实和错误的归属。为此,我们使用了以毫道尔顿(mDa)为单位的质量误差,它优于常用的百万分之一(ppm)相对质量误差。我们开发了一个自动工作流程,根据共享的分子式单元和肯德里克质量缺陷值对多重归属进行分组,并评估质量误差分布。通过这种方式,与仅应用Cl/Cl同位素比率过滤器相比,氯化消毒副产物的有效归属数量增加了8倍。同样,可以高度自信地区分含磷的MFs和含氯 的MFs。此外,通过应用我们的方法,可以排除在负离子模式下高芳香族含硫MFs(“黑硫”)对钠加合物的错误归属。总体而言,现在可以验证接近检测限或天然存在的同位素稀少(如N)或不存在(如P和F)的质量峰的MFs,大大提高了MF归属的可靠性,并拓宽了UHRMS分析对更复杂样品和过程的适用性。