• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解决基于质谱的代谢组学中的大数据挑战。

Addressing big data challenges in mass spectrometry-based metabolomics.

机构信息

Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, BC Canada, V6T 1Z1, Canada.

出版信息

Chem Commun (Camb). 2022 Sep 8;58(72):9979-9990. doi: 10.1039/d2cc03598g.

DOI:10.1039/d2cc03598g
PMID:35997016
Abstract

Advancements in computer science and software engineering have greatly facilitated mass spectrometry (MS)-based untargeted metabolomics. Nowadays, gigabytes of metabolomics data are routinely generated from MS platforms, containing condensed structural and quantitative information from thousands of metabolites. Manual data processing is almost impossible due to the large data size. Therefore, in the "omics" era, we are faced with new challenges, the big data challenges of how to accurately and efficiently process the raw data, extract the biological information, and visualize the results from the gigantic amount of collected data. Although important, proposing solutions to address these big data challenges requires broad interdisciplinary knowledge, which can be challenging for many metabolomics practitioners. Our laboratory in the Department of Chemistry at the University of British Columbia is committed to combining analytical chemistry, computer science, and statistics to develop bioinformatics tools that address these big data challenges. In this Feature Article, we elaborate on the major big data challenges in metabolomics, including data acquisition, feature extraction, quantitative measurements, statistical analysis, and metabolite annotation. We also introduce our recently developed bioinformatics solutions for these challenges. Notably, all of the bioinformatics tools and source codes are freely available on GitHub (https://www.github.com/HuanLab), along with revised and regularly updated content.

摘要

计算机科学和软件工程的进步极大地促进了基于质谱(MS)的非靶向代谢组学。如今,从 MS 平台上通常可以生成千兆字节的代谢组学数据,其中包含来自数千种代谢物的浓缩结构和定量信息。由于数据量庞大,手动数据处理几乎是不可能的。因此,在“组学”时代,我们面临着新的挑战,即如何准确有效地处理原始数据、提取生物信息以及从大量采集的数据中可视化结果的大数据挑战。虽然提出解决这些大数据挑战的解决方案很重要,但这需要广泛的跨学科知识,这对于许多代谢组学从业者来说可能具有挑战性。我们在不列颠哥伦比亚大学化学系的实验室致力于将分析化学、计算机科学和统计学结合起来,开发生物信息学工具来应对这些大数据挑战。在这篇专题文章中,我们详细阐述了代谢组学中的主要大数据挑战,包括数据采集、特征提取、定量测量、统计分析和代谢物注释。我们还介绍了我们最近针对这些挑战开发的生物信息学解决方案。值得注意的是,所有的生物信息学工具和源代码都可以在 GitHub(https://www.github.com/HuanLab)上免费获取,并且内容经过修订和定期更新。

相似文献

1
Addressing big data challenges in mass spectrometry-based metabolomics.解决基于质谱的代谢组学中的大数据挑战。
Chem Commun (Camb). 2022 Sep 8;58(72):9979-9990. doi: 10.1039/d2cc03598g.
2
Data Processing for GC-MS- and LC-MS-Based Untargeted Metabolomics.基于气相色谱-质谱联用和液相色谱-质谱联用的非靶向代谢组学的数据处理
Methods Mol Biol. 2019;1978:287-299. doi: 10.1007/978-1-4939-9236-2_18.
3
MARS: A Multipurpose Software for Untargeted LC-MS-Based Metabolomics and Exposomics.MARS:一种基于非靶向 LC-MS 的代谢组学和暴露组学的多功能软件。
Anal Chem. 2024 Jan 30;96(4):1468-1477. doi: 10.1021/acs.analchem.3c03620. Epub 2024 Jan 18.
4
ChemDistiller: an engine for metabolite annotation in mass spectrometry.ChemDistiller:用于质谱代谢物注释的引擎。
Bioinformatics. 2018 Jun 15;34(12):2096-2102. doi: 10.1093/bioinformatics/bty080.
5
Metandem: An online software tool for mass spectrometry-based isobaric labeling metabolomics.Metandem:基于质谱的同位素质谱标记代谢组学的在线软件工具。
Anal Chim Acta. 2019 Dec 11;1088:99-106. doi: 10.1016/j.aca.2019.08.046. Epub 2019 Aug 21.
6
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
7
Paramounter: Direct Measurement of Universal Parameters To Process Metabolomics Data in a "White Box".至上主义者:在“白盒”中直接测量通用参数以处理代谢组学数据。
Anal Chem. 2022 Mar 15;94(10):4260-4268. doi: 10.1021/acs.analchem.1c04758. Epub 2022 Mar 4.
8
MetCirc: navigating mass spectral similarity in high-resolution MS/MS metabolomics data.MetCirc:在高分辨率 MS/MS 代谢组学数据中导航质谱相似性。
Bioinformatics. 2017 Aug 1;33(15):2419-2420. doi: 10.1093/bioinformatics/btx159.
9
ipaPy2: Integrated Probabilistic Annotation (IPA) 2.0-an improved Bayesian-based method for the annotation of LC-MS/MS untargeted metabolomics data.ipaPy2:集成概率标注(IPA)2.0——一种改进的基于贝叶斯的 LC-MS/MS 非靶向代谢组学数据标注方法。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad455.
10
Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST.msFeaST 中结合了 LC-MS/MS 特征分组、统计优先级排序和交互式网络。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae584.

引用本文的文献

1
LC-MS-based metabolomics for detecting adulteration in -derived dietary supplements.基于液相色谱-质谱联用技术的代谢组学用于检测植物源膳食补充剂中的掺假情况。
Food Chem X. 2025 Apr 18;27:102476. doi: 10.1016/j.fochx.2025.102476. eCollection 2025 Apr.
2
Untargeted Lipidomic Reveals Potential Biomarkers in Plasma Samples for the Discrimination of Patients Affected by Parkinson's Disease.非靶向脂质组学揭示血浆样本中用于鉴别帕金森病患者的潜在生物标志物。
Molecules. 2025 Feb 12;30(4):850. doi: 10.3390/molecules30040850.
3
Application of machine learning for mass spectrometry-based multi-omics in thyroid diseases.
机器学习在基于质谱的甲状腺疾病多组学中的应用。
Front Mol Biosci. 2024 Dec 17;11:1483326. doi: 10.3389/fmolb.2024.1483326. eCollection 2024.
4
Development of a metabolomic risk score for exposure to traffic-related air pollution: A multi-cohort study.交通相关空气污染暴露的代谢组学风险评分的开发:一项多队列研究。
Environ Res. 2024 Dec 15;263(Pt 3):120172. doi: 10.1016/j.envres.2024.120172. Epub 2024 Oct 16.
5
Comprehensive immune cell spectral library for large-scale human primary T, B, and NK cell proteomics.用于大规模人类原发性 T、B 和 NK 细胞蛋白质组学的全面免疫细胞光谱文库。
Sci Data. 2024 Aug 10;11(1):871. doi: 10.1038/s41597-024-03721-2.
6
High-Resolution Mass Spectrometry for Human Exposomics: Expanding Chemical Space Coverage.高分辨率质谱技术在人类暴露组学中的应用:拓展化学空间覆盖范围。
Environ Sci Technol. 2024 Jul 23;58(29):12784-12822. doi: 10.1021/acs.est.4c01156. Epub 2024 Jul 10.
7
Metabolite identification of salvianolic acid A in rat using post collision-induced dissociation energy-resolved mass spectrometry.采用碰撞诱导解离能量分辨质谱法对大鼠体内丹酚酸A进行代谢物鉴定。
Chin Med. 2024 Apr 26;19(1):64. doi: 10.1186/s13020-024-00931-z.
8
LC-MS/DIA-based strategy for comprehensive flavonoid profiling: an spp. applicability case.基于液相色谱-质谱联用/数据独立采集的全面黄酮类化合物分析策略:一个 spp. 适用性案例。
RSC Adv. 2024 Apr 2;14(15):10481-10498. doi: 10.1039/d4ra01384k. eCollection 2024 Mar 26.
9
Metabolomics: A Tool to Envisage Biomarkers in Clinical Interpretation of Cancer.代谢组学:临床解读癌症中生物标志物的工具。
Curr Drug Res Rev. 2024;16(3):333-348. doi: 10.2174/2589977516666230912120412.
10
SMetaS: A Sample Metadata Standardizer for Metabolomics.SMetaS:一种用于代谢组学的样本元数据标准化工具
Metabolites. 2023 Aug 12;13(8):941. doi: 10.3390/metabo13080941.