• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于质谱的代谢组学分析中的统计学与机器学习

Statistics and Machine Learning in Mass Spectrometry-Based Metabolomics Analysis.

作者信息

Fan Sili, Wilson Christopher M, Fridley Brooke L, Li Qian

机构信息

Graduate Group of Biostatistics, University of California, Davis, CA, USA.

Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA.

出版信息

Methods Mol Biol. 2023;2629:247-269. doi: 10.1007/978-1-0716-2986-4_12.

DOI:10.1007/978-1-0716-2986-4_12
PMID:36929081
Abstract

In this chapter, we review the cutting-edge statistical and machine learning methods for missing value imputation, normalization, and downstream analyses in mass spectrometry metabolomics studies, with illustration by example datasets. The missing peak recovery includes simple imputation by zero or limit of detection, regression-based or distribution-based imputation, and prediction by random forest. The batch effect can be removed by data-driven methods, internal standard-based, and quality control sample-based normalization. We also summarize different types of statistical analysis for metabolomics and clinical outcomes, such as inference on metabolic biomarkers, clustering of metabolomic profiles, metabolite module building, and integrative analysis with transcriptome.

摘要

在本章中,我们回顾了质谱代谢组学研究中用于缺失值插补、归一化及下游分析的前沿统计和机器学习方法,并通过示例数据集进行说明。缺失峰恢复方法包括以零值或检测限进行简单插补、基于回归或基于分布的插补以及随机森林预测。批次效应可通过数据驱动方法、基于内标的归一化和基于质量控制样本的归一化来消除。我们还总结了代谢组学与临床结果的不同类型统计分析,例如代谢生物标志物推断、代谢组学谱聚类、代谢物模块构建以及与转录组的整合分析。

相似文献

1
Statistics and Machine Learning in Mass Spectrometry-Based Metabolomics Analysis.基于质谱的代谢组学分析中的统计学与机器学习
Methods Mol Biol. 2023;2629:247-269. doi: 10.1007/978-1-0716-2986-4_12.
2
Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data.基于质谱的代谢组学数据的缺失值插补方法。
Sci Rep. 2018 Jan 12;8(1):663. doi: 10.1038/s41598-017-19120-0.
3
Mechanism-aware imputation: a two-step approach in handling missing values in metabolomics.基于机制的插补:代谢组学中处理缺失值的两步法。
BMC Bioinformatics. 2022 May 16;23(1):179. doi: 10.1186/s12859-022-04659-1.
4
Random forest-based imputation outperforms other methods for imputing LC-MS metabolomics data: a comparative study.基于随机森林的插补方法在 LC-MS 代谢组学数据插补方面优于其他方法:一项比较研究。
BMC Bioinformatics. 2019 Oct 11;20(1):492. doi: 10.1186/s12859-019-3110-0.
5
NMF-Based Approach for Missing Values Imputation of Mass Spectrometry Metabolomics Data.基于 NMF 的质谱代谢组学数据缺失值插补方法。
Molecules. 2021 Sep 24;26(19):5787. doi: 10.3390/molecules26195787.
6
metaX: a flexible and comprehensive software for processing metabolomics data.MetaX:一款用于处理代谢组学数据的灵活且全面的软件。
BMC Bioinformatics. 2017 Mar 21;18(1):183. doi: 10.1186/s12859-017-1579-y.
7
Comparison of imputation and imputation-free methods for statistical analysis of mass spectrometry data with missing data.比较缺失数据下质谱数据分析的插补和非插补方法。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab353.
8
Machine Learning Applications for Mass Spectrometry-Based Metabolomics.基于质谱的代谢组学的机器学习应用
Metabolites. 2020 Jun 13;10(6):243. doi: 10.3390/metabo10060243.
9
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
10
Evaluating Machine Learning Methods of Analyzing Multiclass Metabolomics.评估分析多类代谢组学的机器学习方法。
J Chem Inf Model. 2023 Dec 25;63(24):7628-7641. doi: 10.1021/acs.jcim.3c01525. Epub 2023 Dec 11.

引用本文的文献

1
Machine Learning-Driven Insights in Cancer Metabolomics: From Subtyping to Biomarker Discovery and Prognostic Modeling.机器学习驱动的癌症代谢组学见解:从亚型分类到生物标志物发现与预后建模
Metabolites. 2025 Aug 1;15(8):514. doi: 10.3390/metabo15080514.

本文引用的文献

1
Ontogeny Related Changes in the Pediatric Liver Metabolome.儿童肝脏代谢组中与个体发育相关的变化
Front Pediatr. 2020 Sep 29;8:549. doi: 10.3389/fped.2020.00549. eCollection 2020.
2
Plasma Metabolome and Circulating Vitamins Stratified Onset Age of an Initial Islet Autoantibody and Progression to Type 1 Diabetes: The TEDDY Study.血浆代谢组学和循环维生素根据胰岛自身抗体初发年龄分层与 1 型糖尿病的进展:TEDDY 研究。
Diabetes. 2021 Jan;70(1):282-292. doi: 10.2337/db20-0696. Epub 2020 Oct 26.
3
GMSimpute: a generalized two-step Lasso approach to impute missing values in label-free mass spectrum analysis.
GMSimpute:一种用于在无标记质谱分析中插补缺失值的广义两步套索方法。
Bioinformatics. 2020 Jan 1;36(1):257-263. doi: 10.1093/bioinformatics/btz488.
4
Systematic Error Removal Using Random Forest for Normalizing Large-Scale Untargeted Lipidomics Data.使用随机森林消除系统误差以实现大规模非靶向脂质组学数据的标准化。
Anal Chem. 2019 Mar 5;91(5):3590-3596. doi: 10.1021/acs.analchem.8b05592. Epub 2019 Feb 19.
5
Ontogeny-related pharmacogene changes in the pediatric liver transcriptome.儿科肝脏转录组中与个体发育相关的药物基因变化。
Pharmacogenet Genomics. 2018 Mar;28(3):86-94. doi: 10.1097/FPC.0000000000000326.
6
Best-Matched Internal Standard Normalization in Liquid Chromatography-Mass Spectrometry Metabolomics Applied to Environmental Samples.最佳匹配内标归一化在环境样品液质联用代谢组学中的应用。
Anal Chem. 2018 Jan 16;90(2):1363-1369. doi: 10.1021/acs.analchem.7b04400. Epub 2018 Jan 3.
7
NOREVA: normalization and evaluation of MS-based metabolomics data.NOREVA:基于 MS 的代谢组学数据的归一化和评估。
Nucleic Acids Res. 2017 Jul 3;45(W1):W162-W170. doi: 10.1093/nar/gkx449.
8
Metabolomic Profiling of 13 Diatom Cultures and Their Adaptation to Nitrate-Limited Growth Conditions.13种硅藻培养物的代谢组学分析及其对硝酸盐限制生长条件的适应性
PLoS One. 2015 Oct 6;10(10):e0138965. doi: 10.1371/journal.pone.0138965. eCollection 2015.
9
Phospholipids and insulin resistance in psychosis: a lipidomics study of twin pairs discordant for schizophrenia.精神分裂症中磷脂与胰岛素抵抗:精神分裂症双生子对磷脂组学研究。
Genome Med. 2012 Jan 18;4(1):1. doi: 10.1186/gm300.
10
Streamlined pentafluorophenylpropyl column liquid chromatography-tandem quadrupole mass spectrometry and global (13)C-labeled internal standards improve performance for quantitative metabolomics in bacteria.简化的五氟苯基丙基柱液相色谱-串联四极杆质谱联用和全(13)C 标记内标法提高了细菌定量代谢组学的性能。
J Chromatogr A. 2010 Nov 19;1217(47):7401-10. doi: 10.1016/j.chroma.2010.09.055. Epub 2010 Sep 29.