群体水平无靶向代谢组学数据中光谱漂移动力学的可视化、量化和对准。

Visualization, Quantification, and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data.

机构信息

Departments of Medicine and Pharmacology, University of California San Diego , La Jolla, California 92093, United States.

Cardiovascular Division, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School , Boston, Massachusetts 02115, United States.

出版信息

Anal Chem. 2017 Feb 7;89(3):1399-1404. doi: 10.1021/acs.analchem.6b04337. Epub 2017 Jan 26.

DOI:10.1021/acs.analchem.6b04337

PMID:28208263

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5455767/

Abstract

Untargeted liquid-chromatography-mass spectrometry (LC-MS)-based metabolomics analysis of human biospecimens has become among the most promising strategies for probing the underpinnings of human health and disease. Analysis of spectral data across population scale cohorts, however, is precluded by day-to-day nonlinear signal drifts in LC retention time or batch effects that complicate comparison of thousands of untargeted peaks. To date, there exists no efficient means of visualization and quantitative assessment of signal drift, correction of drift when present, and automated filtering of unstable spectral features, particularly across thousands of data files in population scale experiments. Herein, we report the development of a set of R-based scripts that allow for pre- and postprocessing of raw LC-MS data. These methods can be integrated with existing data analysis workflows by providing initial preprocessing bulk nonlinear retention time correction at the raw data level. Further, this approach provides postprocessing visualization and quantification of peak alignment accuracy, as well as peak-reliability-based parsing of processed data through hierarchical clustering of signal profiles. In a metabolomics data set derived from ∼3000 human plasma samples, we find that application of our alignment tools resulted in substantial improvement in peak alignment accuracy, automated data filtering, and ultimately statistical power for detection of metabolite correlates of clinical measures. These tools will enable metabolomics studies of population scale cohorts.

摘要

基于非靶向液相色谱-质谱（LC-MS）的人生物样本代谢组学分析已成为探索人类健康和疾病基础的最有前途的策略之一。然而，由于 LC 保留时间的日常非线性信号漂移或批处理效应，使得对数千个非靶向峰进行比较变得复杂，因此无法在人群规模队列中分析光谱数据。迄今为止，还没有有效的方法来可视化和定量评估信号漂移、纠正存在的漂移以及自动过滤不稳定的光谱特征，特别是在人群规模实验中的数千个数据文件中。在此，我们报告了一组基于 R 的脚本的开发，这些脚本允许对原始 LC-MS 数据进行预处理和后处理。这些方法可以通过在原始数据级别提供初始预处理批量非线性保留时间校正来集成到现有的数据分析工作流程中。此外，该方法提供了峰对齐准确性的后处理可视化和量化，以及基于峰可靠性的处理后数据解析，通过信号谱图的层次聚类。在来自约 3000 个人血浆样本的代谢组学数据集，我们发现应用我们的对齐工具可显著提高峰对齐准确性、自动数据过滤，最终提高检测与临床测量相关代谢物的统计能力。这些工具将使人群规模队列的代谢组学研究成为可能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/31ef/5455767/802ac3cf90e1/nihms853802f1.jpg

相似文献

Visualization, Quantification, and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data.群体水平无靶向代谢组学数据中光谱漂移动力学的可视化、量化和对准。

Anal Chem. 2017 Feb 7;89(3):1399-1404. doi: 10.1021/acs.analchem.6b04337. Epub 2017 Jan 26.

Comparison of peak-picking workflows for untargeted liquid chromatography/high-resolution mass spectrometry metabolomics data analysis.非靶向液相色谱/高分辨率质谱代谢组学数据分析中峰挑选工作流程的比较

Rapid Commun Mass Spectrom. 2015 Jan 15;29(1):119-27. doi: 10.1002/rcm.7094.

Large-scale untargeted LC-MS metabolomics data correction using between-batch feature alignment and cluster-based within-batch signal intensity drift correction.使用批次间特征比对和基于聚类的批次内信号强度漂移校正对大规模非靶向液相色谱-质谱代谢组学数据进行校正。

Metabolomics. 2016;12(11):173. doi: 10.1007/s11306-016-1124-4. Epub 2016 Sep 22.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

IP4M: an integrated platform for mass spectrometry-based metabolomics data mining.IP4M：基于质谱的代谢组学数据挖掘的集成平台。

BMC Bioinformatics. 2020 Oct 7;21(1):444. doi: 10.1186/s12859-020-03786-x.

Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST.msFeaST 中结合了 LC-MS/MS 特征分组、统计优先级排序和交互式网络。

Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae584.

LC-MS untargeted metabolomics assesses the delayed response of glufosinate treatment of transgenic glufosinate resistant (GR) buffalo grasses (Stenotaphrum secundatum L.).液相色谱-质谱联用非靶向代谢组学评估了草铵膦处理转基因抗草铵膦（GR）水牛草（钝叶草）后的延迟反应。

Metabolomics. 2021 Feb 20;17(3):28. doi: 10.1007/s11306-021-01776-5.

A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.用于非靶向代谢组学工作流程的基于Python的液相色谱-质谱数据预处理管道。

Metabolites. 2020 Oct 16;10(10):416. doi: 10.3390/metabo10100416.

Evaluation of intensity drift correction strategies using MetaboDrift, a normalization tool for multi-batch metabolomics data.使用MetaboDrift（一种用于多批次代谢组学数据的归一化工具）评估强度漂移校正策略。

J Chromatogr A. 2017 Nov 10;1523:265-274. doi: 10.1016/j.chroma.2017.09.023. Epub 2017 Sep 9.

Filtering procedures for untargeted LC-MS metabolomics data.非靶向 LC-MS 代谢组学数据的过滤程序。

BMC Bioinformatics. 2019 Jun 14;20(1):334. doi: 10.1186/s12859-019-2871-9.

引用本文的文献

[Alignment method for metabolite chromatographic peaks using an -acyl glycine retention index system].[基于N-酰基甘氨酸保留指数系统的代谢物色谱峰对齐方法]

Se Pu. 2024 Feb;42(2):159-163. doi: 10.3724/SP.J.1123.2023.07015.

Pulmonary primary oxysterol and bile acid synthesis as a predictor of outcomes in pulmonary arterial hypertension.肺原发性氧化甾醇和胆汁酸合成作为肺动脉高压预后的预测指标

bioRxiv. 2024 Jan 23:2024.01.20.576474. doi: 10.1101/2024.01.20.576474.

Eicosanoid and eicosanoid-related inflammatory mediators and exercise intolerance in heart failure with preserved ejection fraction.在射血分数保留的心力衰竭中，二十烷类和与二十烷类相关的炎症介质与运动不耐受有关。

Nat Commun. 2023 Nov 20;14(1):7557. doi: 10.1038/s41467-023-43363-3.

Alignment of multiple metabolomics LC-MS datasets from disparate diseases to reveal fever-associated metabolites.将来自不同疾病的多个代谢组学 LC-MS 数据集进行对齐，以揭示与发热相关的代谢物。

PLoS Negl Trop Dis. 2023 Jul 24;17(7):e0011133. doi: 10.1371/journal.pntd.0011133. eCollection 2023 Jul.

An epidemiological introduction to human metabolomic investigations.人类代谢组学研究的流行病学概论。

Trends Endocrinol Metab. 2023 Sep;34(9):505-525. doi: 10.1016/j.tem.2023.06.006. Epub 2023 Jul 17.

Spectral binning as an approach to post-acquisition processing of high resolution FIE-MS metabolome fingerprinting data.光谱-bin 作为一种后获取处理高分辨率 FIE-MS 代谢组指纹图谱数据的方法。

Metabolomics. 2022 Aug 2;18(8):64. doi: 10.1007/s11306-022-01923-6.

Quantitative Comparison of Statistical Methods for Analyzing Human Metabolomics Data.分析人类代谢组学数据的统计方法的定量比较

Metabolites. 2022 Jun 4;12(6):519. doi: 10.3390/metabo12060519.

DEIMoS: An Open-Source Tool for Processing High-Dimensional Mass Spectrometry Data.DEIMoS：用于处理高维质谱数据的开源工具。

Anal Chem. 2022 Apr 26;94(16):6130-6138. doi: 10.1021/acs.analchem.1c05017. Epub 2022 Apr 17.

Metabolomics for personalized medicine: the input of analytical chemistry from biomarker discovery to point-of-care tests.代谢组学在个性化医疗中的应用：分析化学在从生物标志物发现到即时检测的贡献。

Anal Bioanal Chem. 2022 Jan;414(2):759-789. doi: 10.1007/s00216-021-03586-z. Epub 2021 Aug 25.

Nontargeted mass spectrometry of dried blood spots for interrogation of the human circulating metabolome.用于检测人体循环代谢组的干血斑非靶向质谱分析

J Mass Spectrom. 2021 May 27;56(8):e4772. doi: 10.1002/jms.4772.

本文引用的文献

Metabolomics enables precision medicine: "A White Paper, Community Perspective".代谢组学助力精准医学：“白皮书，社区视角”

Metabolomics. 2016;12(10):149. doi: 10.1007/s11306-016-1094-6. Epub 2016 Sep 2.

SMART: Statistical Metabolomics Analysis-An R Tool.SMART：统计代谢组学分析——R 工具。

Anal Chem. 2016 Jun 21;88(12):6334-41. doi: 10.1021/acs.analchem.6b00603. Epub 2016 Jun 1.

Biomarker Discovery and Translation in Metabolomics.代谢组学中的生物标志物发现与转化

Curr Metabolomics. 2013;1(3):227-240. doi: 10.2174/2213235X113019990005.

Improved batch correction in untargeted MS-based metabolomics.非靶向质谱代谢组学中改进的批次校正

Metabolomics. 2016;12:88. doi: 10.1007/s11306-016-1015-8. Epub 2016 Mar 18.

Metabolomics: beyond biomarkers and towards mechanisms.代谢组学：超越生物标志物，迈向作用机制研究

Nat Rev Mol Cell Biol. 2016 Jul;17(7):451-9. doi: 10.1038/nrm.2016.25. Epub 2016 Mar 16.

Intra-batch effect correction in liquid chromatography-mass spectrometry using quality control samples and support vector regression (QC-SVRC).使用质量控制样品和支持向量回归（QC-SVRC）进行液相色谱-质谱联用中的批内效应校正。

Analyst. 2015 Nov 21;140(22):7810-7. doi: 10.1039/c5an01638j.

Thermal Degradation of Small Molecules: A Global Metabolomic Investigation.小分子的热降解：一项全球代谢组学研究。

Anal Chem. 2015 Nov 3;87(21):10935-41. doi: 10.1021/acs.analchem.5b03003. Epub 2015 Oct 14.

Defining the metabolome: size, flux, and regulation.定义代谢组：规模、通量与调控

Mol Cell. 2015 May 21;58(4):699-706. doi: 10.1016/j.molcel.2015.04.021.

Distinct metabolomic signatures are associated with longevity in humans.独特的代谢组学特征与人类长寿相关。

Nat Commun. 2015 Apr 13;6:6791. doi: 10.1038/ncomms7791.

Analytical methods in untargeted metabolomics: state of the art in 2015.非靶向代谢组学中的分析方法：2015 年的最新进展。

Front Bioeng Biotechnol. 2015 Mar 5;3:23. doi: 10.3389/fbioe.2015.00023. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验