代谢组合器2.0：用于液相色谱-质谱代谢组学的多数据集特征对齐

metabCombiner 2.0: Disparate Multi-Dataset Feature Alignment for LC-MS Metabolomics.

作者信息

Habra Hani, Meijer Jennifer L, Shen Tong, Fiehn Oliver, Gaul David A, Fernández Facundo M, Rempfert Kaitlin R, Metz Thomas O, Peterson Karen E, Evans Charles R, Karnovsky Alla

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.

Department of Medicine, Geisel School of Medicine, Dartmouth College, Hanover, NH 03755, USA.

出版信息

Metabolites. 2024 Feb 15;14(2):125. doi: 10.3390/metabo14020125.

DOI:10.3390/metabo14020125

PMID:38393017

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10891690/

Abstract

Liquid chromatography-high-resolution mass spectrometry (LC-HRMS), as applied to untargeted metabolomics, enables the simultaneous detection of thousands of small molecules, generating complex datasets. Alignment is a crucial step in data processing pipelines, whereby LC-MS features derived from common ions are assembled into a unified matrix amenable to further analysis. Variability in the analytical factors that influence liquid chromatography separations complicates data alignment. This is prominent when aligning data acquired in different laboratories, generated using non-identical instruments, or between batches from large-scale studies. Previously, we developed metabCombiner for aligning disparately acquired LC-MS metabolomics datasets. Here, we report significant upgrades to metabCombiner that enable the stepwise alignment of multiple untargeted LC-MS metabolomics datasets, facilitating inter-laboratory reproducibility studies. To accomplish this, a "primary" feature list is used as a template for matching compounds in "target" feature lists. We demonstrate this workflow by aligning four lipidomics datasets from core laboratories generated using each institution's in-house LC-MS instrumentation and methods. We also introduce batchCombine, an application of the metabCombiner framework for aligning experiments composed of multiple batches. metabCombiner is available as an R package on Github and Bioconductor, along with a new online version implemented as an R Shiny App.

摘要

液相色谱-高分辨率质谱（LC-HRMS）应用于非靶向代谢组学时，能够同时检测数千种小分子，生成复杂的数据集。数据对齐是数据处理流程中的关键步骤，通过该步骤，源自共同离子的LC-MS特征被组装成一个统一的矩阵，便于进一步分析。影响液相色谱分离的分析因素的变异性使数据对齐变得复杂。在对齐不同实验室获取的数据、使用不同仪器生成的数据或大规模研究中不同批次的数据时，这一问题尤为突出。此前，我们开发了metabCombiner来对齐不同获取方式的LC-MS代谢组学数据集。在此，我们报告了对metabCombiner的重大升级，使其能够对多个非靶向LC-MS代谢组学数据集进行逐步对齐，促进实验室间的可重复性研究。为此，一个“主要”特征列表被用作匹配“目标”特征列表中化合物的模板。我们通过对齐来自核心实验室的四个脂质组学数据集来展示这一工作流程，这些数据集是使用每个机构内部的LC-MS仪器和方法生成的。我们还引入了batchCombine，这是metabCombiner框架的一个应用，用于对齐由多个批次组成的实验。metabCombiner可作为一个R包在Github和Bioconductor上获取，同时还有一个作为R Shiny应用实现的新在线版本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a363/10891690/e90333ab7e16/metabolites-14-00125-g001.jpg

相似文献

metabCombiner 2.0: Disparate Multi-Dataset Feature Alignment for LC-MS Metabolomics.

Metabolites. 2024 Feb 15;14(2):125. doi: 10.3390/metabo14020125.

: Paired Untargeted LC-HRMS Metabolomics Feature Matching and Concatenation of Disparately Acquired Data Sets.

Anal Chem. 2021 Mar 30;93(12):5028-5036. doi: 10.1021/acs.analchem.0c03693. Epub 2021 Mar 16.

Alignment and Analysis of a Disparately Acquired Multibatch Metabolomics Study of Maternal Pregnancy Samples.

J Proteome Res. 2022 Dec 2;21(12):2936-2946. doi: 10.1021/acs.jproteome.2c00371. Epub 2022 Nov 11.

Data Processing for GC-MS- and LC-MS-Based Untargeted Metabolomics.

Methods Mol Biol. 2019;1978:287-299. doi: 10.1007/978-1-4939-9236-2_18.

G-Aligner: a graph-based feature alignment method for untargeted LC-MS-based metabolomics.

BMC Bioinformatics. 2023 Nov 14;24(1):431. doi: 10.1186/s12859-023-05525-4.

Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography-Mass Spectrometry Metabolomics Datasets.

Anal Chem. 2022 Apr 12;94(14):5493-5503. doi: 10.1021/acs.analchem.1c03592. Epub 2022 Mar 31.

Combining peak- and chromatogram-based retention time alignment algorithms for multiple chromatography-mass spectrometry datasets.

BMC Bioinformatics. 2012 Aug 27;13:214. doi: 10.1186/1471-2105-13-214.

Large-scale untargeted LC-MS metabolomics data correction using between-batch feature alignment and cluster-based within-batch signal intensity drift correction.

Metabolomics. 2016;12(11):173. doi: 10.1007/s11306-016-1124-4. Epub 2016 Sep 22.

Batch alignment via retention orders for preprocessing large-scale multi-batch LC-MS experiments.

Bioinformatics. 2022 Aug 2;38(15):3759-3767. doi: 10.1093/bioinformatics/btac407.

Automated Annotation of Untargeted All-Ion Fragmentation LC-MS Metabolomics Data with MetaboAnnotatoR.

Anal Chem. 2022 Mar 1;94(8):3446-3455. doi: 10.1021/acs.analchem.1c03032. Epub 2022 Feb 18.

引用本文的文献

Constructing a consensus serum metabolome.

bioRxiv. 2025 May 11:2025.05.07.652782. doi: 10.1101/2025.05.07.652782.

Eclipse: a Python package for alignment of two or more nontargeted LC-MS metabolomics datasets.

Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf290.

Challenges in Lipidomics Biomarker Identification: Avoiding the Pitfalls and Improving Reproducibility.

Metabolites. 2024 Aug 19;14(8):461. doi: 10.3390/metabo14080461.

本文引用的文献

Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography-Mass Spectrometry Metabolomics Datasets.

Anal Chem. 2022 Apr 12;94(14):5493-5503. doi: 10.1021/acs.analchem.1c03592. Epub 2022 Mar 31.

Metabolomics reveals sex-specific pathways associated with changes in adiposity and muscle mass in a cohort of Mexican adolescents.

Pediatr Obes. 2022 Jun;17(6):e12887. doi: 10.1111/ijpo.12887. Epub 2022 Jan 12.

: Paired Untargeted LC-HRMS Metabolomics Feature Matching and Concatenation of Disparately Acquired Data Sets.

Anal Chem. 2021 Mar 30;93(12):5028-5036. doi: 10.1021/acs.analchem.0c03693. Epub 2021 Mar 16.

Interlaboratory Comparison of Untargeted Mass Spectrometry Data Uncovers Underlying Causes for Variability.

J Nat Prod. 2021 Mar 26;84(3):824-835. doi: 10.1021/acs.jnatprod.0c01376. Epub 2021 Mar 5.

Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing.

Sci Rep. 2020 Aug 17;10(1):13856. doi: 10.1038/s41598-020-70850-0.

Disparate Metabolomics Data Reassembler: A Novel Algorithm for Agglomerating Incongruent LC-MS Metabolomics Datasets.

Anal Chem. 2020 Apr 7;92(7):5231-5239. doi: 10.1021/acs.analchem.9b05763. Epub 2020 Mar 10.

Targeted realignment of LC-MS profiles by neighbor-wise compound-specific graphical time warping with misalignment detection.

Bioinformatics. 2020 May 1;36(9):2862-2871. doi: 10.1093/bioinformatics/btaa037.

Comparison of Software Tools for Liquid Chromatography-High-Resolution Mass Spectrometry Data Processing in Nontarget Screening of Environmental Samples.

Anal Chem. 2020 Jan 21;92(2):1898-1907. doi: 10.1021/acs.analchem.9b04095. Epub 2019 Dec 27.

Early Life Exposure in Mexico to ENvironmental Toxicants (ELEMENT) Project.

BMJ Open. 2019 Aug 26;9(8):e030427. doi: 10.1136/bmjopen-2019-030427.

PAIRUP-MS: Pathway analysis and imputation to relate unknowns in profiles from mass spectrometry-based metabolite data.

PLoS Comput Biol. 2019 Jan 14;15(1):e1006734. doi: 10.1371/journal.pcbi.1006734. eCollection 2019 Jan.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

代谢组合器2.0：用于液相色谱-质谱代谢组学的多数据集特征对齐

metabCombiner 2.0: Disparate Multi-Dataset Feature Alignment for LC-MS Metabolomics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献