Kronik Oskar Munk, Christensen Jan H, Nielsen Nikoline Juul, Tisler Selina, Tomasi Giorgio
Department of Plant and Environmental Science, University of Copenhagen, Thorvaldsensvej 40, DK-1871, Frederiksberg, Denmark.
Anal Bioanal Chem. 2025 Jan 9. doi: 10.1007/s00216-024-05718-7.
Liquid chromatography coupled to high-resolution mass spectrometry (LC-HRMS) is commonly used for identification of compounds in complex samples due to the high chromatographic and mass spectral resolution provided. In subsequent data processing workflows, it is imperative to preserve this resolution to fully exploit the data. "Region of interest" (ROI) algorithms were introduced as a better alternative to equidistant binning for compressing HRMS data because they better preserve the mass spectral resolution. In this paper, we present a new ROI algorithm that improves on the selection of contiguous m/z traces, amongst others by introducing the concept of chromatographic filter, allows for an automated approach to optimise the admissible mass-to-charge deviation (δ) and can be used to match ROIs across multiple samples. The algorithm was tested on a LC-HRMS dataset comprised of 21 replicate injections of a wastewater effluent extract and assessed on its ability to correctly retrieve the ROI's relative to 57 compounds and match them across all injections. In summary, it achieved a ten-fold compression rate in on-disk storage at a noise threshold of 200 counts, and the median ROI length matched the observed chromatographic peak width (12-23 points). Correct ROI matching with a mass accuracy of 9 ppm was observed for 52 compounds across all 21 injections with only one compound split between two adjacent m/z traces in six runs. Overall, the new algorithm performed favourably compared to the ROI algorithm currently used in the well-established ROI-MCR (multivariate curve resolution) workflow for deconvolution of HRMS chromatographic data.
液相色谱与高分辨率质谱联用(LC-HRMS)由于能提供高色谱分辨率和质谱分辨率,常用于复杂样品中化合物的鉴定。在后续的数据处理工作流程中,必须保留这种分辨率以充分利用数据。“感兴趣区域”(ROI)算法作为一种比等距分箱更好的替代方法被引入,用于压缩HRMS数据,因为它们能更好地保留质谱分辨率。在本文中,我们提出了一种新的ROI算法,该算法改进了连续质荷比(m/z)轨迹的选择,特别是通过引入色谱过滤器的概念,允许采用自动化方法优化允许的质荷比偏差(δ),并且可用于跨多个样品匹配ROI。该算法在一个由21次重复进样的废水流出物提取物组成的LC-HRMS数据集上进行了测试,并评估了其相对于57种化合物正确检索ROI并在所有进样中进行匹配的能力。总之,在200计数的噪声阈值下,它在磁盘存储中实现了10倍的压缩率,并且ROI长度的中位数与观察到的色谱峰宽相匹配(12 - 23个点)。在所有21次进样中,观察到52种化合物的ROI匹配质量准确度为9 ppm,只有一种化合物在6次运行中被分割在两个相邻的m/z轨迹之间。总体而言,与目前在成熟的ROI-MCR(多元曲线分辨率)工作流程中用于HRMS色谱数据去卷积的ROI算法相比,新算法表现良好。