• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于液相色谱-质谱联用代谢组学中常见峰识别算法差异的机制理解

Mechanistic Understanding of the Discrepancies between Common Peak Picking Algorithms in Liquid Chromatography-Mass Spectrometry-Based Metabolomics.

作者信息

Guo Jian, Huan Tao

机构信息

Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver Campus, 2036 Main Mall, Vancouver V6T 1Z1, BC, Canada.

出版信息

Anal Chem. 2023 Apr 11;95(14):5894-5902. doi: 10.1021/acs.analchem.2c04887. Epub 2023 Mar 27.

DOI:10.1021/acs.analchem.2c04887
PMID:36972195
Abstract

Inconsistent peak picking outcomes are a critical concern in processing liquid chromatography-mass spectrometry (LC-MS)-based untargeted metabolomics data. This work systematically studied the mechanisms behind the discrepancies among five commonly used peak picking algorithms, including CentWave in XCMS, linear-weighted moving average in MS-DIAL, automated data analysis pipeline (ADAP) in MZmine 2, Savitzky-Golay in El-MAVEN, and FeatureFinderMetabo in OpenMS. We first collected 10 public metabolomics datasets representing various LC-MS analytical conditions. We then incorporated several novel strategies to (i) acquire the optimal peak picking parameters of each algorithm for a fair comparison, (ii) automatically recognize false metabolic features with poor chromatographic peak shapes, and (iii) evaluate the real metabolic features that are missed by the algorithms. By applying these strategies, we compared the true, false, and undetected metabolic features in each data processing outcome. Our results show that linear-weighted moving average consistently outperforms the other peak picking algorithms. To facilitate a mechanistic understanding of the differences, we proposed six peak attributes: ideal slope, sharpness, peak height, mass deviation, peak width, and scan number. We also developed an R program to automatically measure these attributes for detected and undetected true metabolic features. From the results of the 10 datasets, we concluded that four peak attributes, including ideal slope, scan number, peak width, and mass deviation, are critical for the detectability of a peak. For instance, the focus on ideal slope critically hinders the extraction of true metabolic features with low ideal slope scores in linear-weighted moving average, Savitzky-Golay, and ADAP. The relationships between peak picking algorithms and peak attributes were also visualized in a principal component analysis biplot. Overall, the clear comparison and explanation of the differences between peak picking algorithms can lead to the design of better peak picking strategies in the future.

摘要

在基于液相色谱 - 质谱联用(LC - MS)的非靶向代谢组学数据处理中,不一致的峰识别结果是一个关键问题。这项工作系统地研究了五种常用峰识别算法之间差异背后的机制,这些算法包括XCMS中的CentWave、MS - DIAL中的线性加权移动平均、MZmine 2中的自动数据分析管道(ADAP)、El - MAVEN中的Savitzky - Golay以及OpenMS中的FeatureFinderMetabo。我们首先收集了10个代表各种LC - MS分析条件的公开代谢组学数据集。然后,我们采用了几种新颖的策略:(i)获取每种算法的最佳峰识别参数以进行公平比较;(ii)自动识别色谱峰形状不佳的假代谢特征;(iii)评估算法遗漏的真实代谢特征。通过应用这些策略,我们比较了每个数据处理结果中的真、假和未检测到的代谢特征。我们的结果表明,线性加权移动平均始终优于其他峰识别算法。为了便于从机制上理解这些差异,我们提出了六个峰属性:理想斜率、尖锐度、峰高、质量偏差、峰宽和扫描次数。我们还开发了一个R程序来自动测量检测到的和未检测到的真实代谢特征的这些属性。从10个数据集的结果来看,我们得出结论,包括理想斜率、扫描次数、峰宽和质量偏差在内的四个峰属性对峰的可检测性至关重要。例如,对理想斜率的关注严重阻碍了线性加权移动平均、Savitzky - Golay和ADAP中具有低理想斜率分数的真实代谢特征的提取。峰识别算法与峰属性之间的关系也在主成分分析双标图中进行了可视化展示。总体而言,对峰识别算法之间差异的清晰比较和解释能够在未来促成更好的峰识别策略的设计。

相似文献

1
Mechanistic Understanding of the Discrepancies between Common Peak Picking Algorithms in Liquid Chromatography-Mass Spectrometry-Based Metabolomics.基于液相色谱-质谱联用代谢组学中常见峰识别算法差异的机制理解
Anal Chem. 2023 Apr 11;95(14):5894-5902. doi: 10.1021/acs.analchem.2c04887. Epub 2023 Mar 27.
2
Automated optimization of XCMS parameters for improved peak picking of liquid chromatography-mass spectrometry data using the coefficient of variation and parameter sweeping for untargeted metabolomics.使用变异系数和参数扫描对液相色谱-质谱联用数据进行无靶标代谢组学分析时,自动优化 XCMS 参数以提高峰提取效率。
Drug Test Anal. 2019 Jun;11(6):752-761. doi: 10.1002/dta.2552. Epub 2018 Dec 25.
3
Detailed Investigation and Comparison of the XCMS and MZmine 2 Chromatogram Construction and Chromatographic Peak Detection Methods for Preprocessing Mass Spectrometry Metabolomics Data.用于质谱代谢组学数据预处理的XCMS和MZmine 2色谱图构建及色谱峰检测方法的详细研究与比较
Anal Chem. 2017 Sep 5;89(17):8689-8695. doi: 10.1021/acs.analchem.7b01069. Epub 2017 Aug 17.
4
Comparison of peak-picking workflows for untargeted liquid chromatography/high-resolution mass spectrometry metabolomics data analysis.非靶向液相色谱/高分辨率质谱代谢组学数据分析中峰挑选工作流程的比较
Rapid Commun Mass Spectrom. 2015 Jan 15;29(1):119-27. doi: 10.1002/rcm.7094.
5
Impact of three different peak picking software tools on the quality of untargeted metabolomics data.三种不同的峰挑选软件工具对非靶向代谢组学数据质量的影响
J Pharm Biomed Anal. 2024 Sep 15;248:116302. doi: 10.1016/j.jpba.2024.116302. Epub 2024 Jun 10.
6
Metabolomics Data Preprocessing Using ADAP and MZmine 2.基于 ADAP 和 MZmine 2 的代谢组学数据预处理
Methods Mol Biol. 2020;2104:25-48. doi: 10.1007/978-1-0716-0239-3_3.
7
JPA: Joint Metabolic Feature Extraction Increases the Depth of Chemical Coverage for LC-MS-Based Metabolomics and Exposomics.JPA:联合代谢特征提取增加了基于液相色谱-质谱联用的代谢组学和暴露组学的化学覆盖深度。
Metabolites. 2022 Feb 26;12(3):212. doi: 10.3390/metabo12030212.
8
Optimization of XCMS parameters for LC-MS metabolomics: an assessment of automated versus manual tuning and its effect on the final results.LC-MS 代谢组学中 XCMS 参数的优化:自动调谐与手动调谐的评估及其对最终结果的影响。
Metabolomics. 2020 Jan 10;16(1):14. doi: 10.1007/s11306-020-1636-9.
9
IPO: a tool for automated optimization of XCMS parameters.IPO:一种用于自动优化XCMS参数的工具。
BMC Bioinformatics. 2015 Apr 16;16:118. doi: 10.1186/s12859-015-0562-8.
10
Paramounter: Direct Measurement of Universal Parameters To Process Metabolomics Data in a "White Box".至上主义者:在“白盒”中直接测量通用参数以处理代谢组学数据。
Anal Chem. 2022 Mar 15;94(10):4260-4268. doi: 10.1021/acs.analchem.1c04758. Epub 2022 Mar 4.

引用本文的文献

1
Exploring the Cytokinin Profile of (Aubl.) Standl. From Guyana and Its Relationship with Secondary Metabolites: Insights into Potential Therapeutic Benefits.探索圭亚那(Aubl.) Standl.的细胞分裂素谱及其与次生代谢产物的关系:对潜在治疗益处的见解
Metabolites. 2025 Aug 6;15(8):533. doi: 10.3390/metabo15080533.
2
Unraveling non-target screening variability for LC-HRMS data: a chemometric comparative analysis of river water samples impacted by treated wastewater.解析液相色谱-高分辨质谱数据的非目标筛查变异性:受处理后废水影响的河水样本的化学计量学比较分析
Anal Bioanal Chem. 2025 Jun 25. doi: 10.1007/s00216-025-05966-1.
3
Exposome-Scale Investigation of Cl-/Br-Containing Chemicals Using High-Resolution Mass Spectrometry, Multistage Machine Learning, and Cloud Computing.
使用高分辨率质谱、多级机器学习和云计算对含氯/溴化学品进行暴露组规模调查。
Anal Chem. 2025 Jun 3;97(21):11099-11109. doi: 10.1021/acs.analchem.5c00503. Epub 2025 May 22.