• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类代谢组学数据特征选择的统计工作流程

Statistical Workflow for Feature Selection in Human Metabolomics Data.

作者信息

Antonelli Joseph, Claggett Brian L, Henglin Mir, Kim Andy, Ovsak Gavin, Kim Nicole, Deng Katherine, Rao Kevin, Tyagi Octavia, Watrous Jeramie D, Lagerborg Kim A, Hushcha Pavel V, Demler Olga V, Mora Samia, Niiranen Teemu J, Pereira Alexandre C, Jain Mohit, Cheng Susan

机构信息

Department of Statistics, University of Florida, Gainesville, FL 32611, USA.

Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA.

出版信息

Metabolites. 2019 Jul 12;9(7):143. doi: 10.3390/metabo9070143.

DOI:10.3390/metabo9070143
PMID:31336989
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6680705/
Abstract

High-throughput metabolomics investigations, when conducted in large human cohorts, represent a potentially powerful tool for elucidating the biochemical diversity underlying human health and disease. Large-scale metabolomics data sources, generated using either targeted or nontargeted platforms, are becoming more common. Appropriate statistical analysis of these complex high-dimensional data will be critical for extracting meaningful results from such large-scale human metabolomics studies. Therefore, we consider the statistical analytical approaches that have been employed in prior human metabolomics studies. Based on the lessons learned and collective experience to date in the field, we offer a step-by-step framework for pursuing statistical analyses of cohort-based human metabolomics data, with a focus on feature selection. We discuss the range of options and approaches that may be employed at each stage of data management, analysis, and interpretation and offer guidance on the analytical decisions that need to be considered over the course of implementing a data analysis workflow. Certain pervasive analytical challenges facing the field warrant ongoing focused research. Addressing these challenges, particularly those related to analyzing human metabolomics data, will allow for more standardization of as well as advances in how research in the field is practiced. In turn, such major analytical advances will lead to substantial improvements in the overall contributions of human metabolomics investigations.

摘要

高通量代谢组学研究在大规模人类队列中进行时,是阐明人类健康和疾病背后生化多样性的一种潜在有力工具。使用靶向或非靶向平台生成的大规模代谢组学数据源正变得越来越普遍。对这些复杂的高维数据进行适当的统计分析,对于从此类大规模人类代谢组学研究中提取有意义的结果至关重要。因此,我们考虑了先前人类代谢组学研究中采用的统计分析方法。基于该领域迄今吸取的经验教训和集体经验,我们提供了一个逐步框架,用于对基于队列的人类代谢组学数据进行统计分析,重点是特征选择。我们讨论了在数据管理、分析和解释的每个阶段可能采用的选项和方法范围,并就实施数据分析工作流程过程中需要考虑的分析决策提供指导。该领域面临的某些普遍分析挑战需要持续的重点研究。应对这些挑战,尤其是与分析人类代谢组学数据相关的挑战,将使该领域的研究实践更加标准化,并取得进展。反过来,这种重大的分析进展将导致人类代谢组学研究的总体贡献大幅提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c4/6680705/ef35ce48ec33/metabolites-09-00143-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c4/6680705/a819fa8e3f87/metabolites-09-00143-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c4/6680705/3ee823cdb852/metabolites-09-00143-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c4/6680705/ef35ce48ec33/metabolites-09-00143-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c4/6680705/a819fa8e3f87/metabolites-09-00143-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c4/6680705/3ee823cdb852/metabolites-09-00143-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c4/6680705/ef35ce48ec33/metabolites-09-00143-g003.jpg

相似文献

1
Statistical Workflow for Feature Selection in Human Metabolomics Data.人类代谢组学数据特征选择的统计工作流程
Metabolites. 2019 Jul 12;9(7):143. doi: 10.3390/metabo9070143.
2
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
3
Metabolomics Analytics Workflow for Epidemiological Research: Perspectives from the Consortium of Metabolomics Studies (COMETS).用于流行病学研究的代谢组学分析工作流程:代谢组学研究联盟(COMETS)的观点
Metabolites. 2019 Jul 17;9(7):145. doi: 10.3390/metabo9070145.
4
Statistical methods and resources for biomarker discovery using metabolomics.基于代谢组学的生物标志物发现的统计方法和资源。
BMC Bioinformatics. 2023 Jun 15;24(1):250. doi: 10.1186/s12859-023-05383-0.
5
Chronic Kidney Disease Cohort Studies: A Guide to Metabolome Analyses.慢性肾脏病队列研究:代谢组学分析指南
Metabolites. 2021 Jul 16;11(7):460. doi: 10.3390/metabo11070460.
6
Feature Selection Methods for Early Predictive Biomarker Discovery Using Untargeted Metabolomic Data.基于非靶向代谢组学数据的早期预测生物标志物发现的特征选择方法。
Front Mol Biosci. 2016 Jul 8;3:30. doi: 10.3389/fmolb.2016.00030. eCollection 2016.
7
Quantitative Comparison of Statistical Methods for Analyzing Human Metabolomics Data.分析人类代谢组学数据的统计方法的定量比较
Metabolites. 2022 Jun 4;12(6):519. doi: 10.3390/metabo12060519.
8
MRS-based Metabolomics in Cancer Research.基于磁共振波谱的代谢组学在癌症研究中的应用
Magn Reson Insights. 2014 Feb 13;7:1-14. doi: 10.4137/MRI.S13755. eCollection 2014.
9
Tools and resources for metabolomics research community: A 2017-2018 update.代谢组学研究社区的工具和资源:2017-2018 年更新。
Electrophoresis. 2019 Jan;40(2):227-246. doi: 10.1002/elps.201800428. Epub 2018 Nov 30.
10
Metabolomics in clinical and forensic toxicology, sports anti-doping and veterinary residues.临床与法医毒理学、运动反兴奋剂及兽药残留中的代谢组学。
Drug Test Anal. 2022 May;14(5):794-807. doi: 10.1002/dta.3245. Epub 2022 Mar 8.

引用本文的文献

1
Integrating multi-omics and machine learning for disease resistance prediction in legumes.整合多组学和机器学习用于豆类抗病性预测
Theor Appl Genet. 2025 Jun 27;138(7):163. doi: 10.1007/s00122-025-04948-2.
2
Stimulant Use, HIV, and Plasma Metabolites Among Men.男性中的兴奋剂使用、艾滋病毒与血浆代谢物
J Neuroimmune Pharmacol. 2025 Jun 19;20(1):68. doi: 10.1007/s11481-025-10223-4.
3
On the analysis of metabolite quantitative trait loci: Impact of different data transformations and study designs.代谢物数量性状位点分析:不同数据转换和研究设计的影响

本文引用的文献

1
Directed Non-targeted Mass Spectrometry and Chemical Networking for Discovery of Eicosanoids and Related Oxylipins.靶向非靶向质谱和化学网络用于发现类二十烷酸和相关的氧化脂类。
Cell Chem Biol. 2019 Mar 21;26(3):433-442.e4. doi: 10.1016/j.chembiol.2018.11.015. Epub 2019 Jan 17.
2
The Differential Effects of Eicosapentaenoic Acid and Docosahexaenoic Acid on Cardiometabolic Risk Factors: A Systematic Review.二十碳五烯酸和二十二碳六烯酸对代谢相关心血管风险因素的差异化影响:系统综述。
Int J Mol Sci. 2018 Feb 9;19(2):532. doi: 10.3390/ijms19020532.
3
Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data.
Sci Adv. 2025 Apr 11;11(15):eadp4532. doi: 10.1126/sciadv.adp4532.
4
A Metabolomics Approach to Identify Metabolites Associated With Mortality in Patients Receiving Maintenance Hemodialysis.一种代谢组学方法用于识别维持性血液透析患者中与死亡率相关的代谢物。
Kidney Int Rep. 2024 Jun 29;9(9):2718-2726. doi: 10.1016/j.ekir.2024.06.039. eCollection 2024 Sep.
5
Analysis of plasma metabolomes from 11 309 subjects in five population-based cohorts.分析来自五个基于人群的队列中 11309 名个体的血浆代谢组学数据。
Sci Rep. 2024 Apr 18;14(1):8933. doi: 10.1038/s41598-024-59388-7.
6
Metabolites Associated With Uremic Symptoms in Patients With CKD: Findings From the Chronic Renal Insufficiency Cohort (CRIC) Study.与慢性肾脏病患者尿毒症症状相关的代谢物:来自慢性肾功能不全队列(CRIC)研究的结果。
Am J Kidney Dis. 2024 Jul;84(1):49-61.e1. doi: 10.1053/j.ajkd.2023.11.013. Epub 2024 Jan 23.
7
A new statistical workflow (R-packages based) to investigate associations between one variable of interest and the metabolome.一种新的统计工作流程(基于 R 包),用于研究感兴趣的一个变量与代谢组之间的关联。
Metabolomics. 2023 Nov 30;20(1):2. doi: 10.1007/s11306-023-02065-z.
8
Applications of multi-omics analysis in human diseases.多组学分析在人类疾病中的应用。
MedComm (2020). 2023 Jul 31;4(4):e315. doi: 10.1002/mco2.315. eCollection 2023 Aug.
9
Deriving Convergent and Divergent Metabolomic Correlates of Pulmonary Arterial Hypertension.推导肺动脉高压的趋同和趋异代谢组学关联
Metabolites. 2023 Jun 28;13(7):802. doi: 10.3390/metabo13070802.
10
Statistical methods and resources for biomarker discovery using metabolomics.基于代谢组学的生物标志物发现的统计方法和资源。
BMC Bioinformatics. 2023 Jun 15;24(1):250. doi: 10.1186/s12859-023-05383-0.
基于质谱的代谢组学数据的缺失值插补方法。
Sci Rep. 2018 Jan 12;8(1):663. doi: 10.1038/s41598-017-19120-0.
4
Metabolomics biomarkers to predict acamprosate treatment response in alcohol-dependent subjects.代谢组学生物标志物预测酒精依赖患者对安非他酮治疗的反应。
Sci Rep. 2017 May 31;7(1):2496. doi: 10.1038/s41598-017-02442-4.
5
Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data.非靶向气相色谱/质谱代谢组学数据的混合模型归一化
BMC Bioinformatics. 2017 Feb 2;18(1):84. doi: 10.1186/s12859-017-1501-7.
6
Large-scale untargeted LC-MS metabolomics data correction using between-batch feature alignment and cluster-based within-batch signal intensity drift correction.使用批次间特征比对和基于聚类的批次内信号强度漂移校正对大规模非靶向液相色谱-质谱代谢组学数据进行校正。
Metabolomics. 2016;12(11):173. doi: 10.1007/s11306-016-1124-4. Epub 2016 Sep 22.
7
Using MetaboAnalyst 3.0 for Comprehensive Metabolomics Data Analysis.使用MetaboAnalyst 3.0进行综合代谢组学数据分析。
Curr Protoc Bioinformatics. 2016 Sep 7;55:14.10.1-14.10.91. doi: 10.1002/cpbi.11.
8
Metabolic Networks and Metabolites Underlie Associations Between Maternal Glucose During Pregnancy and Newborn Size at Birth.代谢网络和代谢物是孕期母体血糖与出生时新生儿大小之间关联的基础。
Diabetes. 2016 Jul;65(7):2039-50. doi: 10.2337/db15-1748. Epub 2016 Apr 5.
9
Improved batch correction in untargeted MS-based metabolomics.非靶向质谱代谢组学中改进的批次校正
Metabolomics. 2016;12:88. doi: 10.1007/s11306-016-1015-8. Epub 2016 Mar 18.
10
A batch correction method for liquid chromatography-mass spectrometry data that does not depend on quality control samples.一种不依赖质量控制样品的液相色谱-质谱数据批校正方法。
Metabolomics. 2016;12(3):56. doi: 10.1007/s11306-016-0972-2. Epub 2016 Feb 18.