• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高分辨率气相色谱-质谱联用(HR GC-MS)代谢组学数据插补策略的优化

Optimization of Imputation Strategies for High-Resolution Gas Chromatography-Mass Spectrometry (HR GC-MS) Metabolomics Data.

作者信息

Ampong Isaac, Zimmerman Kip D, Nathanielsz Peter W, Cox Laura A, Olivier Michael

机构信息

Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest University, Winston-Salem, NC 27157, USA.

Center for the Study of Fetal Programming, University of Wyoming, Laramie, WY 82071, USA.

出版信息

Metabolites. 2022 May 11;12(5):429. doi: 10.3390/metabo12050429.

DOI:10.3390/metabo12050429
PMID:35629933
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9144635/
Abstract

Gas chromatography-coupled mass spectrometry (GC-MS) has been used in biomedical research to analyze volatile, non-polar, and polar metabolites in a wide array of sample types. Despite advances in technology, missing values are still common in metabolomics datasets and must be properly handled. We evaluated the performance of ten commonly used missing value imputation methods with metabolites analyzed on an HR GC-MS instrument. By introducing missing values into the complete (i.e., data without any missing values) National Institute of Standards and Technology (NIST) plasma dataset, we demonstrate that random forest (RF), glmnet ridge regression (GRR), and Bayesian principal component analysis (BPCA) shared the lowest root mean squared error (RMSE) in technical replicate data. Further examination of these three methods in data from baboon plasma and liver samples demonstrated they all maintained high accuracy. Overall, our analysis suggests that any of the three imputation methods can be applied effectively to untargeted metabolomics datasets with high accuracy. However, it is important to note that imputation will alter the correlation structure of the dataset and bias downstream regression coefficients and -values.

摘要

气相色谱-质谱联用(GC-MS)已用于生物医学研究,以分析多种样本类型中的挥发性、非极性和极性代谢物。尽管技术有所进步,但代谢组学数据集中缺失值仍然常见,必须妥善处理。我们评估了十种常用的缺失值插补方法在高分辨率气相色谱-质谱仪上分析代谢物时的性能。通过将缺失值引入完整的(即无任何缺失值的)美国国家标准与技术研究院(NIST)血浆数据集,我们证明随机森林(RF)、广义线性模型套索回归(GRR)和贝叶斯主成分分析(BPCA)在技术重复数据中具有最低的均方根误差(RMSE)。对狒狒血浆和肝脏样本数据中这三种方法进一步研究表明,它们都保持了较高的准确性。总体而言,我们的分析表明,这三种插补方法中的任何一种都可以有效地高精度应用于非靶向代谢组学数据集。然而,需要注意的是,插补会改变数据集的相关结构,并使下游回归系数和p值产生偏差。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5af/9144635/fc5b515022ac/metabolites-12-00429-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5af/9144635/fc5b515022ac/metabolites-12-00429-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5af/9144635/fc5b515022ac/metabolites-12-00429-g003.jpg

相似文献

1
Optimization of Imputation Strategies for High-Resolution Gas Chromatography-Mass Spectrometry (HR GC-MS) Metabolomics Data.高分辨率气相色谱-质谱联用(HR GC-MS)代谢组学数据插补策略的优化
Metabolites. 2022 May 11;12(5):429. doi: 10.3390/metabo12050429.
2
Random forest-based imputation outperforms other methods for imputing LC-MS metabolomics data: a comparative study.基于随机森林的插补方法在 LC-MS 代谢组学数据插补方面优于其他方法:一项比较研究。
BMC Bioinformatics. 2019 Oct 11;20(1):492. doi: 10.1186/s12859-019-3110-0.
3
Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data.基于质谱的代谢组学数据的缺失值插补方法。
Sci Rep. 2018 Jan 12;8(1):663. doi: 10.1038/s41598-017-19120-0.
4
Addressing Missing Data in GC × GC Metabolomics: Identifying Missingness Type and Evaluating the Impact of Imputation Methods on Experimental Replication.解决 GC × GC 代谢组学中的缺失数据问题:确定缺失类型,并评估插补方法对实验重复的影响。
Anal Chem. 2022 Aug 9;94(31):10912-10920. doi: 10.1021/acs.analchem.1c04093. Epub 2022 Jul 26.
5
Characterization of missing values in untargeted MS-based metabolomics data and evaluation of missing data handling strategies.基于非靶向 MS 的代谢组学数据中缺失值的特征描述及缺失数据处理策略的评价。
Metabolomics. 2018 Sep 20;14(10):128. doi: 10.1007/s11306-018-1420-2.
6
[Development of a widely-targeted metabolomics method based on gas chromatography-mass spectrometry].基于气相色谱-质谱联用技术的广泛靶向代谢组学方法的开发
Se Pu. 2023 Jun 8;41(6):520-526. doi: 10.3724/SP.J.1123.2022.10003.
7
High-resolution gas chromatography/mass spectrometry metabolomics of non-human primate serum.非人灵长类动物血清的高分辨率气相色谱/质谱代谢组学
Rapid Commun Mass Spectrom. 2018 Sep 15;32(17):1497-1506. doi: 10.1002/rcm.8197.
8
NMF-Based Approach for Missing Values Imputation of Mass Spectrometry Metabolomics Data.基于 NMF 的质谱代谢组学数据缺失值插补方法。
Molecules. 2021 Sep 24;26(19):5787. doi: 10.3390/molecules26195787.
9
Mechanism-aware imputation: a two-step approach in handling missing values in metabolomics.基于机制的插补:代谢组学中处理缺失值的两步法。
BMC Bioinformatics. 2022 May 16;23(1):179. doi: 10.1186/s12859-022-04659-1.
10
Advances in high resolution GC-MS technology: a focus on the application of GC-Orbitrap-MS in metabolomics and exposomics for FAIR practices.高分辨率 GC-MS 技术的进展:聚焦 GC-Orbitrap-MS 在代谢组学和暴露组学中的应用,以实现 FAIR 实践。
Anal Methods. 2021 May 27;13(20):2265-2282. doi: 10.1039/d1ay00173f.

引用本文的文献

1
Conducting Polymers-Based Gas Sensors: Principles, Materials, and Applications.基于导电聚合物的气体传感器:原理、材料及应用
Sensors (Basel). 2025 Apr 25;25(9):2724. doi: 10.3390/s25092724.
2
Integrated multi-omics analysis of brain aging in female nonhuman primates reveals altered signaling pathways relevant to age-related disorders.对雌性非人类灵长类动物大脑衰老的综合多组学分析揭示了与年龄相关疾病相关的信号通路改变。
Neurobiol Aging. 2023 Dec;132:109-119. doi: 10.1016/j.neurobiolaging.2023.08.009. Epub 2023 Aug 29.
3
Maternal obesity alters offspring liver and skeletal muscle metabolism in early post-puberty despite maintaining a normal post-weaning dietary lifestyle.

本文引用的文献

1
The Application of Metabolomics in Forensic Science with Focus on Forensic Toxicology and Time-of-Death Estimation.代谢组学在法医学中的应用,重点关注法医毒理学和死亡时间推断
Metabolites. 2021 Nov 26;11(12):801. doi: 10.3390/metabo11120801.
2
Binary Simplification as an Effective Tool in Metabolomics Data Analysis.二元简化作为代谢组学数据分析的有效工具
Metabolites. 2021 Nov 18;11(11):788. doi: 10.3390/metabo11110788.
3
A survey on missing data in machine learning.关于机器学习中缺失数据的一项调查。
母体肥胖会改变后代青春期后早期的肝脏和骨骼肌代谢,尽管其在断奶后仍保持正常的饮食生活方式。
FASEB J. 2022 Dec;36(12):e22644. doi: 10.1096/fj.202201473R.
4
Imputation of Missing Values for Multi-Biospecimen Metabolomics Studies: Bias and Effects on Statistical Validity.多生物样本代谢组学研究中缺失值的插补:偏差及其对统计有效性的影响
Metabolites. 2022 Jul 21;12(7):671. doi: 10.3390/metabo12070671.
J Big Data. 2021;8(1):140. doi: 10.1186/s40537-021-00516-9. Epub 2021 Oct 27.
4
Metabolomics unveils the influence of dietary phytochemicals on residual pesticide concentrations in honey bees.代谢组学揭示了膳食植物化学物质对蜂蜜中残留农药浓度的影响。
Environ Int. 2021 Jul;152:106503. doi: 10.1016/j.envint.2021.106503. Epub 2021 Mar 20.
5
A Workflow for Missing Values Imputation of Untargeted Metabolomics Data.非靶向代谢组学数据缺失值插补的工作流程
Metabolites. 2020 Nov 26;10(12):486. doi: 10.3390/metabo10120486.
6
Accuracy of random-forest-based imputation of missing data in the presence of non-normality, non-linearity, and interaction.基于随机森林的缺失数据插补在非正态性、非线性和交互作用存在下的准确性。
BMC Med Res Methodol. 2020 Jul 25;20(1):199. doi: 10.1186/s12874-020-01080-1.
7
High Resolution GC-Orbitrap-MS Metabolomics Using Both Electron Ionization and Chemical Ionization for Analysis of Human Plasma.使用电子电离和化学电离的高分辨率气相色谱-轨道阱质谱代谢组学分析人血浆
J Proteome Res. 2020 Jul 2;19(7):2717-2731. doi: 10.1021/acs.jproteome.9b00774. Epub 2020 Feb 10.
8
BayesMetab: treatment of missing values in metabolomic studies using a Bayesian modeling approach.BayesMetab:基于贝叶斯建模方法处理代谢组学研究中的缺失值。
BMC Bioinformatics. 2019 Dec 20;20(Suppl 24):673. doi: 10.1186/s12859-019-3250-2.
9
Analytical techniques for metabolomic studies: a review.代谢组学研究的分析技术:综述
Bioanalysis. 2019 Dec;11(24):2297-2318. doi: 10.4155/bio-2019-0014.
10
Random forest-based imputation outperforms other methods for imputing LC-MS metabolomics data: a comparative study.基于随机森林的插补方法在 LC-MS 代谢组学数据插补方面优于其他方法:一项比较研究。
BMC Bioinformatics. 2019 Oct 11;20(1):492. doi: 10.1186/s12859-019-3110-0.