Gelao Vito, Fornasaro Stefano, Briguglio Sara C, Mattiussi Michele, De Martin Stefano, Astel Aleksander M, Barbieri Pierluigi, Licen Sabina
Regional Environmental Protection Agency-ARPA-FVG, Via Cairoli 14, 33057 Palmanova, Italy.
Department of Chemical and Pharmaceutical Sciences, University of Trieste, Via Giorgieri 1, 34127 Trieste, Italy.
Toxics. 2024 Jan 29;12(2):113. doi: 10.3390/toxics12020113.
(1) Background: Monitoring effluent in water treatment plants has a key role in identifying potential pollutants that might be released into the environment. A non-target analysis approach can be used for identifying unknown substances and source-specific multipollutant signatures. (2) Methods: Urban and industrial wastewater effluent were analyzed by HPLC-HRMS for non-target analysis. The anomalous infiltration of industrial wastewater into urban wastewater was investigated by analyzing the mass spectra data of "unknown common" compounds using principal component analysis (PCA) and the Self-Organizing Map (SOM) AI tool. The outcomes of the models were compared. (3) Results: The outlier detection was more straightforward in the SOM model than in the PCA one. The differences among the samples could not be completely perceived in the PCA model. Moreover, since PCA involves the calculation of new variables based on the original experimental ones, it is not possible to reconstruct a chromatogram that displays the recurring patterns in the urban WTP samples. This can be achieved using the SOM outcomes. (4) Conclusions: When comparing a large number of samples, the SOM AI tool is highly efficient in terms of calculation, visualization, and identifying outliers. Interpreting PCA visualization and outlier detection becomes challenging when dealing with a large sample size.
(1) 背景:监测污水处理厂的废水对于识别可能释放到环境中的潜在污染物具有关键作用。非目标分析方法可用于识别未知物质和特定来源的多污染物特征。(2) 方法:采用高效液相色谱-高分辨质谱法(HPLC-HRMS)对城市和工业废水进行非目标分析。利用主成分分析(PCA)和自组织映射(SOM)人工智能工具分析“未知共同”化合物的质谱数据,研究工业废水向城市废水的异常渗入情况。对模型结果进行比较。(3) 结果:SOM模型中的异常值检测比PCA模型更直接。PCA模型无法完全识别样本之间的差异。此外,由于PCA涉及基于原始实验变量计算新变量,因此无法重建显示城市污水处理厂样本中重复模式的色谱图。而使用SOM结果则可以实现这一点。(4) 结论:在比较大量样本时,SOM人工智能工具在计算、可视化和识别异常值方面效率很高。处理大量样本时,解释PCA可视化和异常值检测具有挑战性。