Suppr超能文献

采用多变量统计方法评估非靶向长时 LC-HRMS 时间序列数据。

Evaluation of Nontarget Long-Term LC-HRMS Time Series Data Using Multivariate Statistical Approaches.

机构信息

Environmental Analysis, Currenta GmbH & Co. OHG, CHEMPARK BLG Q18, D-51368 Leverkusen, Germany.

Instrumental Analytical Chemistry (IAC) and Centre for Water and Environmental Research (ZWU), University of Duisburg-Essen, UnivFersitaetsstrasse 5, D-45141 Essen, Germany.

出版信息

Anal Chem. 2020 Sep 15;92(18):12273-12281. doi: 10.1021/acs.analchem.0c01897. Epub 2020 Sep 2.

Abstract

The use of liquid chromatography coupled with high-resolution mass spectrometry (LC-HRMS) has steadily increased in many application fields ranging from metabolomics to environmental science. HRMS data are frequently used for nontarget screening (NTS), i.e., the search for compounds that are not previously known and where no reference substances are available. However, the large quantity of data produced by NTS analytical workflows makes data interpretation and time-dependent monitoring of samples very sophisticated and necessitates exploiting chemometric data processing techniques. Consequently, in this study, a prioritization method to handle time series in nontarget data was established. As proof of concept, industrial wastewater was investigated. As routine industrial wastewater analyses monitor the occurrence of a limited number of targeted water contaminants, NTS provides the opportunity to detect also unknown trace organic compounds (TrOCs) that are not in the focus of routine target analysis. The developed prioritization method enables reducing raw data and including identification of prioritized unknown contaminants. To that end, a five-month time series for industrial wastewaters was utilized, analyzed by liquid chromatography-time-of-flight mass spectrometry (LC-qTOF-MS), and evaluated by NTS. Following peak detection, alignment, grouping, and blank subtraction, 3303 features were obtained of wastewater treatment plant (WWTP) influent samples. Subsequently, two complementary ways for exploratory time trend detection and feature prioritization are proposed. Therefore, following a prefiltering step, featurewise principal component analysis (PCA) and groupwise PCA (GPCA) of the matrix (temporal wise) were used to annotate trends of relevant wastewater contaminants. With sparse factorization of data matrices using GPCA, groups of correlated features/mass fragments or adducts were detected, recovered, and prioritized. Similarities and differences in the chemical composition of wastewater samples were observed over time to reveal hidden factors accounting for the structure of the data. The detected features were reduced to 130 relevant time trends related to TrOCs for identification. Exemplarily, as proof of concept, one nontarget pollutant was identified as -methylpyrrolidone. The developed chemometric strategies of this study are not only suitable for industrial wastewater but also could be efficiently employed for time trend exploration in other scientific fields.

摘要

液相色谱与高分辨质谱联用(LC-HRMS)在代谢组学、环境科学等多个应用领域的应用不断增加。HRMS 数据经常用于非靶向筛选(NTS),即寻找以前未知且没有参考物质的化合物。然而,NTS 分析工作流程产生的大量数据使得数据解释和样品的时间依赖性监测变得非常复杂,需要利用化学计量学数据处理技术。因此,在这项研究中,建立了一种用于处理非靶向数据中时间序列的优先级方法。作为概念验证,研究了工业废水。由于常规工业废水分析监测的是有限数量的目标水污染物的发生情况,因此 NTS 提供了检测常规目标分析不关注的未知痕量有机化合物(TrOC)的机会。开发的优先级方法能够减少原始数据并包括对优先未知污染物的识别。为此,利用了为期五个月的工业废水时间序列,通过液相色谱-飞行时间质谱(LC-qTOF-MS)进行分析,并通过 NTS 进行评估。在进行峰检测、对齐、分组和空白扣除后,获得了废水处理厂(WWTP)进水样品的 3303 个特征。随后,提出了两种用于探索性时间趋势检测和特征优先级的互补方法。因此,在进行预过滤步骤之后,使用特征主成分分析(PCA)和矩阵的分组 PCA(GPCA)对矩阵(时间)进行特征级 PCA 和分组 PCA,以注释相关废水污染物的趋势。通过使用 GPCA 对数据矩阵进行稀疏分解,检测、恢复和优先处理相关的特征/质量碎片或加合物组。随着时间的推移,观察废水样品的化学成分的相似性和差异,以揭示隐藏的因素,这些因素解释了数据的结构。检测到的特征被减少到 130 个与 TrOC 相关的与时间相关的相关趋势,用于鉴定。作为概念验证,例如,鉴定出一种非目标污染物为 -甲基吡咯烷酮。本研究开发的化学计量学策略不仅适用于工业废水,而且还可以有效地用于其他科学领域的时间趋势探索。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验