非靶向 LC-MS 代谢组学的峰注释和验证引擎。

Peak Annotation and Verification Engine for Untargeted LC-MS Metabolomics.

机构信息

Lewis Sigler Institute for Integrative Genomics , Princeton University , Princeton , New Jersey 08544 , United States.

Department of Chemistry , Princeton University , Princeton , New Jersey 08544 , United States.

出版信息

Anal Chem. 2019 Feb 5;91(3):1838-1846. doi: 10.1021/acs.analchem.8b03132. Epub 2019 Jan 10.

DOI:10.1021/acs.analchem.8b03132

PMID:30586294

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6501219/

Abstract

Untargeted metabolomics can detect more than 10 000 peaks in a single LC-MS run. The correspondence between these peaks and metabolites, however, remains unclear. Here, we introduce a Peak Annotation and Verification Engine (PAVE) for annotating untargeted microbial metabolomics data. The workflow involves growing cells in C and N isotope-labeled media to identify peaks from biological compounds and their carbon and nitrogen atom counts. Improved deisotoping and deadducting are enabled by algorithms that integrate positive mode, negative mode, and labeling data. To distinguish metabolites and their fragments, PAVE experimentally measures the response of each peak to weak in-source collision induced dissociation, which increases the peak intensity for fragments while decreasing it for their parent ions. The molecular formulas of the putative metabolites are then assigned based on database searching using both m/ z and C/N atom counts. Application of this procedure to Saccharomyces cerevisiae and Escherichia coli revealed that more than 80% of peaks do not label, i.e., are environmental contaminants. More than 70% of the biological peaks are isotopic variants, adducts, fragments, or mass spectrometry artifacts yielding ∼2000 apparent metabolites across the two organisms. About 650 match to a known metabolite formula based on m/ z and C/N atom counts, with 220 assigned structures based on MS/MS and/or retention time to match to authenticated standards. Thus, PAVE enables systematic annotation of LC-MS metabolomics data with only ∼4% of peaks annotated as apparent metabolites.

摘要

非靶向代谢组学可以在单次 LC-MS 运行中检测到超过 10000 个峰。然而，这些峰与代谢物之间的对应关系尚不清楚。在这里，我们介绍了一种用于注释非靶向微生物代谢组学数据的峰注释和验证引擎（PAVE）。该工作流程涉及在 C 和 N 同位素标记的培养基中培养细胞，以鉴定来自生物化合物及其碳和氮原子数的峰。通过整合正模式、负模式和标记数据的算法，实现了改进的去同位素化和去加成。为了区分代谢物及其片段，PAVE 通过实验测量每个峰对弱源内碰撞诱导解离的响应，从而增加片段的峰强度，同时降低其母体离子的峰强度。然后根据数据库搜索，使用 m/z 和 C/N 原子数对假定代谢物的分子式进行分配。将该程序应用于酿酒酵母和大肠杆菌，结果表明，超过 80%的峰不标记，即属于环境污染物。超过 70%的生物峰是同位素变体、加合物、片段或质谱伪影，这两种生物产生的表观代谢物约有 2000 种。约有 650 种根据 m/z 和 C/N 原子数匹配到已知代谢物公式，其中 220 种根据 MS/MS 和/或保留时间分配结构以匹配经认证的标准。因此，PAVE 可以对 LC-MS 代谢组学数据进行系统注释，只有约 4%的峰被注释为表观代谢物。

相似文献

Peak Annotation and Verification Engine for Untargeted LC-MS Metabolomics.非靶向 LC-MS 代谢组学的峰注释和验证引擎。

Anal Chem. 2019 Feb 5;91(3):1838-1846. doi: 10.1021/acs.analchem.8b03132. Epub 2019 Jan 10.

Improved Annotation of Untargeted Metabolomics Data through Buffer Modifications That Shift Adduct Mass and Intensity.通过改变加合物质量和强度的缓冲液修饰来改进非靶向代谢组学数据的注释。

Anal Chem. 2020 Sep 1;92(17):11573-11581. doi: 10.1021/acs.analchem.0c00985. Epub 2020 Aug 12.

Metabolite discovery through global annotation of untargeted metabolomics data.通过对非靶向代谢组学数据的全局注释发现代谢物。

Nat Methods. 2021 Nov;18(11):1377-1385. doi: 10.1038/s41592-021-01303-3. Epub 2021 Oct 28.

Targeting unique biological signals on the fly to improve MS/MS coverage and identification efficiency in metabolomics.针对飞行中的独特生物信号，提高代谢组学中 MS/MS 覆盖度和鉴定效率。

Anal Chim Acta. 2021 Mar 8;1149:338210. doi: 10.1016/j.aca.2021.338210. Epub 2021 Jan 12.

High-throughput Saccharomyces cerevisiae cultivation method for credentialing-based untargeted metabolomics.基于凭证的非靶向代谢组学的高通量酿酒酵母培养方法。

Anal Bioanal Chem. 2023 Jul;415(17):3415-3434. doi: 10.1007/s00216-023-04724-5. Epub 2023 May 22.

Automated LC-HRMS(/MS) approach for the annotation of fragment ions derived from stable isotope labeling-assisted untargeted metabolomics.用于注释源自稳定同位素标记辅助非靶向代谢组学的碎片离子的自动化液相色谱-高分辨质谱（/质谱）方法。

Anal Chem. 2014 Aug 5;86(15):7320-7. doi: 10.1021/ac501358z. Epub 2014 Jul 14.

Tag you're it: Application of stable isotope labeling and LC-MS to identify the precursors of specialized metabolites in plants.标记你：利用稳定同位素标记和 LC-MS 鉴定植物中特征代谢物前体。

Methods Enzymol. 2022;676:279-303. doi: 10.1016/bs.mie.2022.07.039. Epub 2022 Sep 22.

Autonomous METLIN-Guided In-source Fragment Annotation for Untargeted Metabolomics.自主 METLIN 引导的内源性碎片注释用于非靶向代谢组学。

Anal Chem. 2019 Mar 5;91(5):3246-3253. doi: 10.1021/acs.analchem.8b03126. Epub 2019 Feb 11.

[A novel method for efficient screening and annotation of important pathway-associated metabolites based on the modified metabolome and probe molecules].一种基于改良代谢组和探针分子的重要通路相关代谢物高效筛选与注释新方法

Se Pu. 2022 Sep;40(9):788-796. doi: 10.3724/SP.J.1123.2022.03025.

geoRge: A Computational Tool To Detect the Presence of Stable Isotope Labeling in LC/MS-Based Untargeted Metabolomics.乔治：一种用于在基于液相色谱/质谱的非靶向代谢组学中检测稳定同位素标记存在的计算工具。

Anal Chem. 2016 Jan 5;88(1):621-8. doi: 10.1021/acs.analchem.5b03628. Epub 2015 Dec 18.

引用本文的文献

Ensemble quantitation of absolute metabolite concentrations in T cells reveals conserved features of immunometabolism.T细胞中绝对代谢物浓度的整体定量揭示了免疫代谢的保守特征。

bioRxiv. 2025 Jun 12:2025.06.09.658709. doi: 10.1101/2025.06.09.658709.

CLN3 disease disrupts very early postnatal hippocampal maturation.CLN3病会破坏出生后早期海马体的成熟。

Sci Rep. 2025 Jul 8;15(1):24411. doi: 10.1038/s41598-025-02010-1.

Comprehensive profiling of folates across polyglutamylation and one-carbon states.对不同多聚谷氨酸化状态和一碳状态下的叶酸进行全面分析。

Metabolomics. 2025 May 27;21(3):71. doi: 10.1007/s11306-025-02269-5.

Multi-omics characterization of early chronic obstructive pulmonary disease.早期慢性阻塞性肺疾病的多组学特征分析

Respir Res. 2025 Apr 28;26(1):167. doi: 10.1186/s12931-025-03250-5.

A two-stage metabolome refining pipeline for natural products discovery.用于天然产物发现的两阶段代谢组学优化流程。

Synth Syst Biotechnol. 2025 Feb 5;10(2):600-609. doi: 10.1016/j.synbio.2025.01.006. eCollection 2025 Jun.

Systematic pre-annotation explains the "dark matter" in LC-MS metabolomics.系统预注释解释了液相色谱-质谱联用代谢组学中的“暗物质”。

bioRxiv. 2025 Mar 25:2025.02.04.636472. doi: 10.1101/2025.02.04.636472.

Response of human metabolism to ultra-low and high nicotine cigarettes based on urine metabolomics and bioinformatic analysis.基于尿液代谢组学和生物信息学分析的人体代谢对超低尼古丁和高尼古丁香烟的反应

Tob Induc Dis. 2024 Dec 18;22. doi: 10.18332/tid/196677. eCollection 2024.

HIV persists in late coronary atheroma and is associated with increased local inflammation and disease progression.HIV存在于晚期冠状动脉粥样硬化斑块中，并与局部炎症增加和疾病进展相关。

Res Sq. 2024 Oct 18:rs.3.rs-5125826. doi: 10.21203/rs.3.rs-5125826/v1.

Annotation of Metabolites in Stable Isotope Tracing Untargeted Metabolomics via Khipu-web.通过Khipu-web对稳定同位素示踪非靶向代谢组学中的代谢物进行注释。

J Am Soc Mass Spectrom. 2024 Dec 4;35(12):2824-2835. doi: 10.1021/jasms.4c00175. Epub 2024 Sep 30.

MICOS Complex Loss Governs Age-Associated Murine Mitochondrial Architecture and Metabolism in the Liver, While Sam50 Dictates Diet Changes.MICOS复合体缺失调控衰老相关的小鼠肝脏线粒体结构与代谢，而Sam50决定饮食变化。

bioRxiv. 2024 Jul 3:2024.06.20.599846. doi: 10.1101/2024.06.20.599846.

本文引用的文献

New methods to identify high peak density artifacts in Fourier transform mass spectra and to mitigate their effects on high-throughput metabolomic data analysis.鉴定傅里叶变换质谱中高峰密度伪影的新方法及其对高通量代谢组学数据分析的影响的缓解方法。

Metabolomics. 2018 Sep 17;14(10):125. doi: 10.1007/s11306-018-1426-9.

Metabolomics and Isotope Tracing.代谢组学和同位素示踪。

Cell. 2018 May 3;173(4):822-837. doi: 10.1016/j.cell.2018.03.055.

Enhanced Isotopic Ratio Outlier Analysis (IROA) Peak Detection and Identification with Ultra-High Resolution GC-Orbitrap/MS: Potential Application for Investigation of Model Organism Metabolomes.超高分辨率气相色谱-轨道阱质谱联用增强同位素比离群值分析（IROA）峰检测与鉴定：在模式生物代谢组学研究中的潜在应用

Metabolites. 2018 Jan 18;8(1):9. doi: 10.3390/metabo8010009.

Noninvasive liquid diet delivery of stable isotopes into mouse models for deep metabolic network tracing.非侵入性液体饮食递送来对小鼠模型进行深入代谢网络示踪的稳定同位素。

Nat Commun. 2017 Nov 21;8(1):1646. doi: 10.1038/s41467-017-01518-z.

Glucose feeds the TCA cycle via circulating lactate.葡萄糖通过循环的乳酸为三羧酸循环提供能量。

Nature. 2017 Nov 2;551(7678):115-118. doi: 10.1038/nature24057. Epub 2017 Oct 18.

Annotation: A Computational Solution for Streamlining Metabolomics Analysis.注释：一种简化代谢组学分析的计算解决方案。

Anal Chem. 2018 Jan 2;90(1):480-489. doi: 10.1021/acs.analchem.7b03929. Epub 2017 Nov 3.

Lactate Metabolism in Human Lung Tumors.人类肺部肿瘤中的乳酸代谢

Cell. 2017 Oct 5;171(2):358-371.e9. doi: 10.1016/j.cell.2017.09.019.

Systems-Level Annotation of a Metabolomics Data Set Reduces 25 000 Features to Fewer than 1000 Unique Metabolites.系统级注释代谢组学数据集将 25000 个特征减少到不到 1000 个独特代谢物。

Anal Chem. 2017 Oct 3;89(19):10397-10406. doi: 10.1021/acs.analchem.7b02380. Epub 2017 Sep 15.

MetExtract II: A Software Suite for Stable Isotope-Assisted Untargeted Metabolomics.MetExtract II：一款用于稳定同位素辅助非靶向代谢组学的软件套件。

Anal Chem. 2017 Sep 5;89(17):9518-9526. doi: 10.1021/acs.analchem.7b02518. Epub 2017 Aug 22.

Detailed Investigation and Comparison of the XCMS and MZmine 2 Chromatogram Construction and Chromatographic Peak Detection Methods for Preprocessing Mass Spectrometry Metabolomics Data.用于质谱代谢组学数据预处理的XCMS和MZmine 2色谱图构建及色谱峰检测方法的详细研究与比较

Anal Chem. 2017 Sep 5;89(17):8689-8695. doi: 10.1021/acs.analchem.7b01069. Epub 2017 Aug 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。