Zhao Tingting, Shen Qiming, Li Xing-Fang, Huan Tao
Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver Campus, 2036 Main Mall, Vancouver, British Columbia V6T 1Z1, Canada.
Division of Analytical and Environmental Toxicology, Department of Laboratory Medicine and Pathology, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta T6G 2G3, Canada.
Environ Sci Technol. 2025 Mar 11;59(9):4530-4539. doi: 10.1021/acs.est.4c12698. Epub 2025 Feb 27.
Iodinated disinfection byproducts (I-DBPs) pose significant health concerns due to their high toxicity. Current approaches to recognize unknown I-DBPs in mass spectrometry (MS) analysis rely on negative ionization mode, in which the characteristic I fragment can be observed in tandem mass spectra (MS/MS). Still, many I-DBPs ionize exclusively in positive ionization mode, where the I fragment is absent. To address this gap, this work developed a machine learning-based strategy to recognize iodinated compounds (I-compounds) from their MS/MS in both electrospray positive (ESI+) and negative ionization (ESI-) modes. Investigating over 6000 MS/MS spectra of 381 I-compounds, we first identified five characteristic I-containing neutral losses and one diagnostic I fragment in ESI+ and ESI- modes, respectively. We then trained Random Forest models and integrated them into IodoFinder, a Python program, to streamline the recognition of I-compounds from raw LC-MS data. IodoFinder accurately recognized over 96% of the 161 I-compound standards in both ionization modes. In its application to DBP mixtures, IodoFinder discovered 19 I-DBPs with annotated structures and an additional 17 with assigned formulas, including 12 novel and 3 confirmed I-DBPs. We envision that IodoFinder will advance the identification of both known and unknown I-compounds in exposome studies.
碘化消毒副产物(I-DBPs)因其高毒性而引发了重大的健康问题。目前在质谱(MS)分析中识别未知I-DBPs的方法依赖于负离子模式,在该模式下,可在串联质谱(MS/MS)中观察到特征性的碘离子碎片。然而,许多I-DBPs仅在正离子模式下电离,在该模式下不存在碘离子碎片。为了弥补这一差距,本研究开发了一种基于机器学习的策略,用于在电喷雾正离子(ESI+)和负离子(ESI-)模式下从其MS/MS中识别碘化化合物(I-化合物)。通过研究381种I-化合物的6000多个MS/MS谱图,我们首先分别在ESI+和ESI-模式下确定了五种含碘特征性中性丢失和一个诊断性碘离子碎片。然后,我们训练了随机森林模型,并将其集成到一个名为IodoFinder的Python程序中,以简化从原始液相色谱-质谱数据中识别I-化合物的过程。IodoFinder在两种电离模式下准确识别了161种I-化合物标准品中的96%以上。在将其应用于消毒副产物混合物时,IodoFinder发现了19种具有注释结构的I-DBPs和另外17种具有指定分子式的I-DBPs,包括12种新型I-DBPs和3种已确认的I-DBPs。我们设想,IodoFinder将推动在暴露组研究中对已知和未知I-化合物的识别。