使用 NOREVA 优化代谢组学数据处理。

Optimization of metabolomic data processing using NOREVA.

机构信息

College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.

Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, China.

出版信息

Nat Protoc. 2022 Jan;17(1):129-151. doi: 10.1038/s41596-021-00636-9. Epub 2021 Dec 24.

DOI:10.1038/s41596-021-00636-9

PMID:34952956

Abstract

A typical output of a metabolomic experiment is a peak table corresponding to the intensity of measured signals. Peak table processing, an essential procedure in metabolomics, is characterized by its study dependency and combinatorial diversity. While various methods and tools have been developed to facilitate metabolomic data processing, it is challenging to determine which processing workflow will give good performance for a specific metabolomic study. NOREVA, an out-of-the-box protocol, was therefore developed to meet this challenge. First, the peak table is subjected to many processing workflows that consist of three to five defined calculations in combinatorially determined sequences. Second, the results of each workflow are judged against objective performance criteria. Third, various benchmarks are analyzed to highlight the uniqueness of this newly developed protocol in (1) evaluating the processing performance based on multiple criteria, (2) optimizing data processing by scanning thousands of workflows, and (3) allowing data processing for time-course and multiclass metabolomics. This protocol is implemented in an R package for convenient accessibility and to protect users' data privacy. Preliminary experience in R language would facilitate the usage of this protocol, and the execution time may vary from several minutes to a couple of hours depending on the size of the analyzed data.

摘要

代谢组学实验的典型输出是一个对应于测量信号强度的峰表。峰表处理是代谢组学中的一个基本步骤，其特点是具有研究依赖性和组合多样性。虽然已经开发了各种方法和工具来促进代谢组学数据处理，但确定哪种处理工作流程将为特定的代谢组学研究提供良好的性能是具有挑战性的。因此，开发了一种即开即用的协议（NOREVA）来应对这一挑战。首先，将峰表提交给许多处理工作流程，这些工作流程由三个到五个在组合上确定的序列中定义的计算组成。其次，根据客观性能标准来判断每个工作流程的结果。第三，通过分析各种基准来突出这个新开发的协议的独特性，包括：(1) 根据多个标准评估处理性能，(2) 通过扫描数千个工作流程来优化数据处理，以及 (3) 允许对时间序列和多类代谢组学进行数据处理。该协议在 R 包中实现，便于访问，并保护用户的数据隐私。在 R 语言方面有初步经验将有助于使用该协议，执行时间可能会根据分析数据的大小从几分钟到几个小时不等。

相似文献

Optimization of metabolomic data processing using NOREVA.使用 NOREVA 优化代谢组学数据处理。

Nat Protoc. 2022 Jan;17(1):129-151. doi: 10.1038/s41596-021-00636-9. Epub 2021 Dec 24.

Two data pre-processing workflows to facilitate the discovery of biomarkers by 2D NMR metabolomics.两种数据预处理工作流程，可通过 2D-NMR 代谢组学发现生物标志物。

Metabolomics. 2019 Apr 16;15(4):63. doi: 10.1007/s11306-019-1524-3.

NOREVA: enhanced normalization and evaluation of time-course and multi-class metabolomic data.NOREVA：时间进程和多类代谢组学数据的增强标准化和评估。

Nucleic Acids Res. 2020 Jul 2;48(W1):W436-W448. doi: 10.1093/nar/gkaa258.

Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis.使用MetaboAnalyst 4.0进行全面综合的代谢组学数据分析。

Curr Protoc Bioinformatics. 2019 Dec;68(1):e86. doi: 10.1002/cpbi.86.

NOREVA: normalization and evaluation of MS-based metabolomics data.NOREVA：基于 MS 的代谢组学数据的归一化和评估。

Nucleic Acids Res. 2017 Jul 3;45(W1):W162-W170. doi: 10.1093/nar/gkx449.

Pathomx: an interactive workflow-based tool for the analysis of metabolomic data.Pathomx：一种基于交互式工作流程的代谢组学数据分析工具。

BMC Bioinformatics. 2014 Dec 10;15(1):396. doi: 10.1186/s12859-014-0396-9.

Cognitive analysis of metabolomics data for systems biology.用于系统生物学的代谢组学数据的认知分析。

Nat Protoc. 2021 Mar;16(3):1376-1418. doi: 10.1038/s41596-020-00455-4. Epub 2021 Jan 22.

Updates in metabolomics tools and resources: 2014-2015.代谢组学工具与资源的更新：2014 - 2015年

Electrophoresis. 2016 Jan;37(1):86-110. doi: 10.1002/elps.201500417. Epub 2015 Nov 17.

Extracting Knowledge from MS Clinical Metabolomic Data: Processing and Analysis Strategies.从 MS 临床代谢组学数据中提取知识：处理和分析策略。

Methods Mol Biol. 2025;2855:539-554. doi: 10.1007/978-1-0716-4116-3_29.

The role of the Human Metabolome Database in inborn errors of metabolism.人代谢组数据库在先天性代谢缺陷中的作用。

J Inherit Metab Dis. 2018 May;41(3):329-336. doi: 10.1007/s10545-018-0137-8. Epub 2018 Apr 16.

引用本文的文献

LocPro: A deep learning-based prediction of protein subcellular localization for promoting multi-directional pharmaceutical research.LocPro：基于深度学习的蛋白质亚细胞定位预测，以促进多方向药物研究。

J Pharm Anal. 2025 Aug;15(8):101255. doi: 10.1016/j.jpha.2025.101255. Epub 2025 Mar 5.

A novel prognostic risk score associated with resistance to docetaxel chemotherapy for predicting biochemical recurrence-free survival in patients with prostate cancer.一种与多西他赛化疗耐药相关的新型预后风险评分，用于预测前列腺癌患者的无生化复发生存率。

Discov Oncol. 2025 Aug 16;16(1):1565. doi: 10.1007/s12672-025-03423-0.

Analysis of plant metabolomics data using identification-free approaches.使用无鉴定方法分析植物代谢组学数据。

Appl Plant Sci. 2025 Mar 1;13(4):e70001. doi: 10.1002/aps3.70001. eCollection 2025 Jul-Aug.

Spatially Resolved Multiomics: Data Analysis from Monoomics to Multiomics.空间分辨多组学：从单组学到多组学的数据分析

BME Front. 2024 Jan 13;6:0084. doi: 10.34133/bmef.0084. eCollection 2025.

OncoSexome: the landscape of sex-based differences in oncologic diseases.肿瘤性染色体组：肿瘤疾病中基于性别的差异概况。

Nucleic Acids Res. 2025 Jan 6;53(D1):D1443-D1459. doi: 10.1093/nar/gkae1003.

Chem(Pro)2: the atlas of chemoproteomic probes labelling human proteins.Chem(Pro)2：标记人类蛋白质的化学蛋白质组学探针图谱

Nucleic Acids Res. 2025 Jan 6;53(D1):D1651-D1662. doi: 10.1093/nar/gkae943.

SYNBIP 2.0: epitopes mapping, sequence expansion and scaffolds discovery for synthetic binding protein innovation.SYNBIP 2.0：用于合成结合蛋白创新的表位映射、序列扩展和支架发现

Nucleic Acids Res. 2025 Jan 6;53(D1):D595-D603. doi: 10.1093/nar/gkae893.

MolBiC: the cell-based landscape illustrating molecular bioactivities.MolBiC：基于细胞的展现分子生物活性的图谱

Nucleic Acids Res. 2025 Jan 6;53(D1):D1683-D1691. doi: 10.1093/nar/gkae868.

OrgXenomics: an integrated proteomic knowledge base for patient-derived organoid and xenograft.器官异种组学：一个用于患者来源类器官和异种移植的综合蛋白质组知识库。

Nucleic Acids Res. 2025 Jan 6;53(D1):D504-D515. doi: 10.1093/nar/gkae861.

SubCELL: the landscape of subcellular compartment-specific molecular interactions.亚细胞：亚细胞区室特异性分子相互作用的全景图。

Nucleic Acids Res. 2025 Jan 6;53(D1):D738-D747. doi: 10.1093/nar/gkae863.

本文引用的文献

Spatial Metabolomics and Imaging Mass Spectrometry in the Age of Artificial Intelligence.人工智能时代的空间代谢组学与成像质谱分析

Annu Rev Biomed Data Sci. 2020 Jul;3:61-87. doi: 10.1146/annurev-biodatasci-011420-031537. Epub 2020 Apr 13.

Targeted Profiling of Short-, Medium-, and Long-Chain Fatty Acyl-Coenzyme As in Biological Samples by Phosphate Methylation Coupled to Liquid Chromatography-Tandem Mass Spectrometry.通过磷酸甲基化结合液相色谱-串联质谱法对生物样品中的短链、中链和长链脂肪酰辅酶A进行靶向分析。

Anal Chem. 2021 Mar 9;93(9):4342-4350. doi: 10.1021/acs.analchem.1c00664. Epub 2021 Feb 23.

Single cell metabolomics using mass spectrometry: Techniques and data analysis.单细胞代谢组学的质谱分析：技术与数据分析。

Anal Chim Acta. 2021 Jan 25;1143:124-134. doi: 10.1016/j.aca.2020.11.020. Epub 2020 Nov 25.

NMR: Unique Strengths That Enhance Modern Metabolomics Research.核磁共振：增强现代代谢组学研究的独特优势。

Anal Chem. 2021 Jan 12;93(1):478-499. doi: 10.1021/acs.analchem.0c04414. Epub 2020 Nov 12.

Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。

Nucleic Acids Res. 2021 Jan 8;49(D1):D10-D17. doi: 10.1093/nar/gkaa892.

Microbial community dynamics in phyto-thermotherapy baths viewed through next generation sequencing and metabolomics approach.通过下一代测序和代谢组学方法观察植物热疗浴中的微生物群落动态。

Sci Rep. 2020 Oct 21;10(1):17931. doi: 10.1038/s41598-020-74586-9.

IP4M: an integrated platform for mass spectrometry-based metabolomics data mining.IP4M：基于质谱的代谢组学数据挖掘的集成平台。

BMC Bioinformatics. 2020 Oct 7;21(1):444. doi: 10.1186/s12859-020-03786-x.

Metabolomic biomarkers in midtrimester maternal plasma can accurately predict the development of preeclampsia.孕中期母体血浆代谢组学标志物可准确预测子痫前期的发生。

Sci Rep. 2020 Sep 30;10(1):16142. doi: 10.1038/s41598-020-72852-4.

Unraveling the Cyclization of l-Argininosuccinic Acid in Biological Samples: A Study via Mass Spectrometry and NMR Spectroscopy.解析生物样本中 l-精氨琥珀酸的环化反应：质谱和核磁共振光谱研究。

Anal Chem. 2020 Oct 6;92(19):12891-12899. doi: 10.1021/acs.analchem.0c01420. Epub 2020 Sep 4.

SS-31 and NMN: Two paths to improve metabolism and function in aged hearts.SS-31 和 NMN：改善衰老心脏代谢和功能的两条途径。

Aging Cell. 2020 Oct;19(10):e13213. doi: 10.1111/acel.13213. Epub 2020 Aug 11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用 NOREVA 优化代谢组学数据处理。

Optimization of metabolomic data processing using NOREVA.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献