DBnorm作为一个R包，用于代谢组学研究中批次效应校正的合适统计方法的比较与选择。

DBnorm as an R package for the comparison and selection of appropriate statistical methods for batch effect correction in metabolomic studies.

作者信息

Bararpour Nasim, Gilardi Federica, Carmeli Cristian, Sidibe Jonathan, Ivanisevic Julijana, Caputo Tiziana, Augsburger Marc, Grabherr Silke, Desvergne Béatrice, Guex Nicolas, Bochud Murielle, Thomas Aurelien

机构信息

Unit of Forensic Toxicology and Chemistry, CURML, Lausanne University Hospital-Geneva University Hospitals, Lausanne-Geneva, Switzerland.

Faculty Unit of Toxicology, CURML, Lausanne University Hospital, Faculty of Biology and Medicine, University of Lausanne, Lausanne, Switzerland.

出版信息

Sci Rep. 2021 Mar 11;11(1):5657. doi: 10.1038/s41598-021-84824-3.

DOI:10.1038/s41598-021-84824-3

PMID:33707505

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7952378/

Abstract

As a powerful phenotyping technology, metabolomics provides new opportunities in biomarker discovery through metabolome-wide association studies (MWAS) and the identification of metabolites having a regulatory effect in various biological processes. While mass spectrometry-based (MS) metabolomics assays are endowed with high throughput and sensitivity, MWAS are doomed to long-term data acquisition generating an overtime-analytical signal drift that can hinder the uncovering of real biologically relevant changes. We developed "dbnorm", a package in the R environment, which allows for an easy comparison of the model performance of advanced statistical tools commonly used in metabolomics to remove batch effects from large metabolomics datasets. "dbnorm" integrates advanced statistical tools to inspect the dataset structure not only at the macroscopic (sample batches) scale, but also at the microscopic (metabolic features) level. To compare the model performance on data correction, "dbnorm" assigns a score that help users identify the best fitting model for each dataset. In this study, we applied "dbnorm" to two large-scale metabolomics datasets as a proof of concept. We demonstrate that "dbnorm" allows for the accurate selection of the most appropriate statistical tool to efficiently remove the overtime signal drift and to focus on the relevant biological components of complex datasets.

摘要

作为一种强大的表型分析技术，代谢组学通过全代谢组关联研究（MWAS）以及鉴定在各种生物过程中具有调节作用的代谢物，为生物标志物的发现提供了新的机遇。虽然基于质谱（MS）的代谢组学分析具有高通量和高灵敏度，但MWAS注定要进行长期的数据采集，这会产生随时间变化的分析信号漂移，从而可能阻碍发现真正具有生物学相关性的变化。我们开发了“dbnorm”，这是R环境中的一个软件包，它可以轻松比较代谢组学中常用的先进统计工具的模型性能，以便从大型代谢组学数据集中消除批次效应。“dbnorm”整合了先进的统计工具，不仅可以在宏观（样本批次）尺度上检查数据集结构，还可以在微观（代谢特征）层面进行检查。为了比较数据校正方面的模型性能，“dbnorm”会给出一个分数，帮助用户为每个数据集确定最合适的模型。在本研究中，我们将“dbnorm”应用于两个大规模代谢组学数据集作为概念验证。我们证明，“dbnorm”能够准确选择最合适的统计工具，以有效消除随时间变化的信号漂移，并专注于复杂数据集的相关生物学成分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f67/7952378/395700bac829/41598_2021_84824_Fig1_HTML.jpg

相似文献

DBnorm as an R package for the comparison and selection of appropriate statistical methods for batch effect correction in metabolomic studies.DBnorm作为一个R包，用于代谢组学研究中批次效应校正的合适统计方法的比较与选择。

Sci Rep. 2021 Mar 11;11(1):5657. doi: 10.1038/s41598-021-84824-3.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

Evaluation of intensity drift correction strategies using MetaboDrift, a normalization tool for multi-batch metabolomics data.使用MetaboDrift（一种用于多批次代谢组学数据的归一化工具）评估强度漂移校正策略。

J Chromatogr A. 2017 Nov 10;1523:265-274. doi: 10.1016/j.chroma.2017.09.023. Epub 2017 Sep 9.

Large-scale untargeted LC-MS metabolomics data correction using between-batch feature alignment and cluster-based within-batch signal intensity drift correction.使用批次间特征比对和基于聚类的批次内信号强度漂移校正对大规模非靶向液相色谱-质谱代谢组学数据进行校正。

Metabolomics. 2016;12(11):173. doi: 10.1007/s11306-016-1124-4. Epub 2016 Sep 22.

Concordance-Based Batch Effect Correction for Large-Scale Metabolomics.基于一致性的大规模代谢组学批次效应校正

Anal Chem. 2023 May 9;95(18):7220-7228. doi: 10.1021/acs.analchem.2c05748. Epub 2023 Apr 28.

Feature Selection Methods for Early Predictive Biomarker Discovery Using Untargeted Metabolomic Data.基于非靶向代谢组学数据的早期预测生物标志物发现的特征选择方法。

Front Mol Biosci. 2016 Jul 8;3:30. doi: 10.3389/fmolb.2016.00030. eCollection 2016.

DBNorm: normalizing high-density oligonucleotide microarray data based on distributions.DBNorm：基于分布对高密度寡核苷酸微阵列数据进行归一化处理。

BMC Bioinformatics. 2017 Nov 29;18(1):527. doi: 10.1186/s12859-017-1912-5.

statTarget: A streamlined tool for signal drift correction and interpretations of quantitative mass spectrometry-based omics data.statTarget：一种用于信号漂移校正和基于定量质谱组学数据解释的简化工具。

Anal Chim Acta. 2018 Dec 7;1036:66-72. doi: 10.1016/j.aca.2018.08.002. Epub 2018 Aug 6.

Normalizing and Correcting Variable and Complex LC-MS Metabolomic Data with the R Package pseudoDrift.使用R包pseudoDrift对可变且复杂的液相色谱-质谱代谢组学数据进行归一化和校正

Metabolites. 2022 May 12;12(5):435. doi: 10.3390/metabo12050435.

LargeMetabo: an out-of-the-box tool for processing and analyzing large-scale metabolomic data.LargeMetabo：一款用于处理和分析大规模代谢组学数据的即用型工具。

Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac455.

引用本文的文献

Serum levels of C-terminal peptides of alpha-1 antitrypsin as potential biomarkers in non-small cell lung cancer.α-1抗胰蛋白酶C末端肽的血清水平作为非小细胞肺癌的潜在生物标志物

Transl Lung Cancer Res. 2025 Jun 30;14(6):2113-2124. doi: 10.21037/tlcr-2025-178. Epub 2025 Jun 23.

Multiomic analysis of familial adenomatous polyposis reveals molecular pathways associated with early tumorigenesis.家族性腺瘤性息肉病的多组学分析揭示了与早期肿瘤发生相关的分子途径。

Nat Cancer. 2024 Nov;5(11):1737-1753. doi: 10.1038/s43018-024-00831-z. Epub 2024 Oct 30.

Association of Ultraprocessed Foods Intake with Untargeted Metabolomics Profiles in Adolescents and Young Adults in the DONALD Cohort Study.超加工食品摄入与 DONALD 队列研究中青少年和年轻成年人非靶向代谢组学特征的关联。

J Nutr. 2024 Nov;154(11):3255-3265. doi: 10.1016/j.tjnut.2024.09.023. Epub 2024 Sep 25.

Metabolomics signatures of sweetened beverages and added sugar are related to anthropometric measures of adiposity in young individuals: results from a cohort study.甜味饮料和添加糖的代谢组学特征与年轻人肥胖的人体测量指标有关：一项队列研究的结果。

Am J Clin Nutr. 2024 Oct;120(4):879-890. doi: 10.1016/j.ajcnut.2024.07.021. Epub 2024 Jul 24.

A phase IIb randomized placebo-controlled trial testing the effect of MAG-EPA long-chain omega-3 fatty acid dietary supplement on prostate cancer proliferation.一项IIb期随机安慰剂对照试验，测试MAG-EPA长链omega-3脂肪酸膳食补充剂对前列腺癌增殖的影响。

Commun Med (Lond). 2024 Mar 22;4(1):56. doi: 10.1038/s43856-024-00456-4.

High-Resolution Mass Spectrometry-Based Metabolomics for Increased Grape Juice Metabolite Coverage.基于高分辨率质谱的代谢组学用于增加葡萄汁代谢物覆盖范围

Foods. 2023 Dec 22;13(1):54. doi: 10.3390/foods13010054.

Metabolic diversity of human macrophages: potential influence on intracellular survival.人类巨噬细胞的代谢多样性：对细胞内生存的潜在影响。

Infect Immun. 2024 Feb 13;92(2):e0047423. doi: 10.1128/iai.00474-23. Epub 2024 Jan 5.

Dynamic lipidome alterations associated with human health, disease and ageing.与人类健康、疾病和衰老相关的动态脂质组变化。

Nat Metab. 2023 Sep;5(9):1578-1594. doi: 10.1038/s42255-023-00880-1. Epub 2023 Sep 11.

Biofilms on Indwelling Artificial Urinary Sphincter Devices Harbor Complex Microbe-Metabolite Interaction Networks and Reconstitute Differentially In Vitro by Material Type.留置人工尿道括约肌装置上的生物膜具有复杂的微生物-代谢物相互作用网络，并根据材料类型在体外进行不同程度的重构。

Biomedicines. 2023 Jan 14;11(1):215. doi: 10.3390/biomedicines11010215.

Multi-omics microsampling for the profiling of lifestyle-associated changes in health.多组学生物标志物微采样分析与生活方式相关的健康变化特征。

Nat Biomed Eng. 2024 Jan;8(1):11-29. doi: 10.1038/s41551-022-00999-8. Epub 2023 Jan 19.

本文引用的文献

Evaluating and minimizing batch effects in metabolomics.评估和最小化代谢组学中的批次效应。

Mass Spectrom Rev. 2022 May;41(3):421-442. doi: 10.1002/mas.21672. Epub 2020 Nov 25.

Anti-adipogenic signals at the onset of obesity-related inflammation in white adipose tissue.肥胖相关炎症起始时白色脂肪组织中的抗脂肪生成信号。

Cell Mol Life Sci. 2021 Jan;78(1):227-247. doi: 10.1007/s00018-020-03485-z. Epub 2020 Mar 11.

WaveICA: A novel algorithm to remove batch effects for large-scale untargeted metabolomics data based on wavelet analysis.WaveICA：一种基于小波分析的新型算法，用于去除大规模无靶向代谢组学数据中的批次效应。

Anal Chim Acta. 2019 Jul 11;1061:60-69. doi: 10.1016/j.aca.2019.02.010. Epub 2019 Feb 19.

A large-scale metabolomics study to harness chemical diversity and explore biochemical mechanisms in ryegrass.一项大规模代谢组学研究，旨在利用黑麦草的化学多样性并探索其生化机制。

Commun Biol. 2019 Mar 4;2:87. doi: 10.1038/s42003-019-0289-6. eCollection 2019.

NormalizeMets: assessing, selecting and implementing statistical methods for normalizing metabolomics data.NormalizeMets：评估、选择和实施代谢组学数据标准化的统计方法。

Metabolomics. 2018 Mar 20;14(5):54. doi: 10.1007/s11306-018-1347-7.

Systematic Error Removal Using Random Forest for Normalizing Large-Scale Untargeted Lipidomics Data.使用随机森林消除系统误差以实现大规模非靶向脂质组学数据的标准化。

Anal Chem. 2019 Mar 5;91(5):3590-3596. doi: 10.1021/acs.analchem.8b05592. Epub 2019 Feb 19.

Anal Chim Acta. 2018 Dec 7;1036:66-72. doi: 10.1016/j.aca.2018.08.002. Epub 2018 Aug 6.

Separation of blood microsamples by exploiting sedimentation at the microscale.利用微尺度沉降分离血液微样本。

Sci Rep. 2018 Sep 20;8(1):14101. doi: 10.1038/s41598-018-32314-4.

Metabolomics as a Tool to Understand Pathophysiological Processes.代谢组学作为理解病理生理过程的一种工具。

Methods Mol Biol. 2018;1730:3-28. doi: 10.1007/978-1-4939-7592-1_1.

Proteome and Metabolome of Subretinal Fluid in Central Serous Chorioretinopathy and Rhegmatogenous Retinal Detachment: A Pilot Case Study.中心性浆液性脉络膜视网膜病变和孔源性视网膜脱离患者视网膜下液的蛋白质组学和代谢组学：一项初步病例研究

Transl Vis Sci Technol. 2018 Jan 18;7(1):3. doi: 10.1167/tvst.7.1.3. eCollection 2018 Jan.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

DBnorm作为一个R包，用于代谢组学研究中批次效应校正的合适统计方法的比较与选择。

DBnorm as an R package for the comparison and selection of appropriate statistical methods for batch effect correction in metabolomic studies.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献