• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于质谱的多组学数据集标准化策略的评估

Evaluation of normalization strategies for mass spectrometry-based multi-omics datasets.

作者信息

Tseng Chi Yen, Salguero Jessica A, Breidenbach Joshua D, Solomon Emilia, Sanders Claire K, Harvey Tara, Thornhill M Grace, Palmisano Salvator J, Sasiene Zachary J, Blackwell Brett R, McBride Ethan M, Luchini Kes A, LeBrun Erick S, Alvarez Marc, Mach Phillip M, Rivera Emilio S, Glaros Trevor G

机构信息

Biochemistry and Biotechnology Group, Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, 84545, USA.

Microbial and Biome Sciences Group, Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.

出版信息

Metabolomics. 2025 Jul 1;21(4):98. doi: 10.1007/s11306-025-02297-1.

DOI:10.1007/s11306-025-02297-1
PMID:40593232
Abstract

INTRODUCTION

Data normalization is crucial for multi-omics integration, reducing systematic errors and maximizing the likelihood of discovering true biological variation. Most studies assess normalization for a single omics type or use datasets from separate experiments. Few address time-course data, where normalization might bias temporal differentiation. In this study, we compared common normalization methods and a machine learning approach, Systematical Error Removal using Random Forest (SERRF), using multi-omics datasets generated from the same experiment-even from the same cell lysate.

OBJECTIVES

To develop a straightforward process to assess normalization effects and identify the most robust methods across multi-omics datasets.

METHODS

We analyzed metabolomics, lipidomics, and proteomics datasets from primary human cardiomyocytes and motor neurons exposed to acetylcholine-active compounds over time. Normalization effectiveness was evaluated based on improvement in QC features consistency and observing the change in treatment and time-related variance.

RESULTS

Probabilistic Quotient Normalization (PQN) and Locally Estimated Scatterplot Smoothing (LOESS) QC were identified as optimal for metabolomics and lipidomics, while PQN, Median, and LOESS normalization excelled for proteomics. These methods consistently enhanced QC feature consistency in metabolomics and lipidomics, and preserved time-related variance or treatment-related variance in proteomics, demonstrating their effectiveness and robustness. SERRF normalization, applied only to metabolomics in this study, outperformed other methods in some datasets but inadvertently masked treatment-related variance in others.

CONCLUSION

Our evaluation identified PQN and LoessQC as the top methods for metabolomics and lipidomics, and PQN, Median, and Loess normalization for proteomics, in multi-omics integration in a temporal study.

摘要

引言

数据归一化对于多组学整合至关重要,它可减少系统误差并最大化发现真实生物学变异的可能性。大多数研究评估单一组学类型的归一化,或使用来自单独实验的数据集。很少有研究涉及时间进程数据,而归一化可能会使时间差异产生偏差。在本研究中,我们使用来自同一实验甚至同一细胞裂解物生成的多组学数据集,比较了常见的归一化方法和一种机器学习方法——使用随机森林去除系统误差(SERRF)。

目的

开发一个简单的流程来评估归一化效果,并在多组学数据集中识别最稳健的方法。

方法

我们分析了原代人心肌细胞和运动神经元随时间暴露于乙酰胆碱活性化合物后的代谢组学、脂质组学和蛋白质组学数据集。基于质量控制(QC)特征一致性的改善以及观察处理和时间相关方差的变化来评估归一化效果。

结果

概率商归一化(PQN)和局部估计散点图平滑(LOESS)质量控制被确定为代谢组学和脂质组学的最佳方法,而PQN、中位数和LOESS归一化在蛋白质组学方面表现出色。这些方法持续增强了代谢组学和脂质组学中QC特征的一致性,并在蛋白质组学中保留了时间相关方差或处理相关方差,证明了它们的有效性和稳健性。在本研究中仅应用于代谢组学的SERRF归一化在某些数据集中优于其他方法,但在其他数据集中无意中掩盖了处理相关方差。

结论

我们的评估确定了在时间研究的多组学整合中,PQN和LoessQC是代谢组学和脂质组学的顶级方法,而PQN、中位数和Loess归一化是蛋白质组学的顶级方法。

相似文献

1
Evaluation of normalization strategies for mass spectrometry-based multi-omics datasets.基于质谱的多组学数据集标准化策略的评估
Metabolomics. 2025 Jul 1;21(4):98. doi: 10.1007/s11306-025-02297-1.
2
Supervised Parametric Learning in the Identification of Composite Biomarker Signatures of Type 1 Diabetes in Integrated Parallel Multi-Omics Datasets.在整合的平行多组学数据集中识别1型糖尿病复合生物标志物特征的监督参数学习
Biomedicines. 2024 Feb 22;12(3):492. doi: 10.3390/biomedicines12030492.
3
A Comprehensive Protocol and Step-by-Step Guide for Multi-Omics Integration in Biological Research.生物研究中多组学整合的综合方案与分步指南
J Vis Exp. 2025 Aug 8(222). doi: 10.3791/66995.
4
Precision Neuro-Oncology in Glioblastoma: AI-Guided CRISPR Editing and Real-Time Multi-Omics for Genomic Brain Surgery.胶质母细胞瘤中的精准神经肿瘤学:用于基因组脑手术的人工智能引导的CRISPR编辑和实时多组学技术
Int J Mol Sci. 2025 Jul 30;26(15):7364. doi: 10.3390/ijms26157364.
5
Integrated Multi-Omics Analysis of Cerebrospinal Fluid in Postoperative Delirium.术后谵妄患者脑脊液的综合多组学分析。
Biomolecules. 2024 Jul 30;14(8):924. doi: 10.3390/biom14080924.
6
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
7
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
8
Local neighbor Normalization: Reconciling accurate normalization and heterogeneity recovery in large-scale metabolomics.局部邻域归一化:在大规模代谢组学中协调精确归一化与异质性恢复
Anal Chim Acta. 2025 Oct 22;1372:344440. doi: 10.1016/j.aca.2025.344440. Epub 2025 Jul 16.
9
Characterizing the omics landscape based on 10,000+ datasets.基于一万多个数据集描绘组学全景。
Sci Rep. 2025 Jan 25;15(1):3189. doi: 10.1038/s41598-025-87256-5.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

本文引用的文献

1
Characterizing the omics landscape based on 10,000+ datasets.基于一万多个数据集描绘组学全景。
Sci Rep. 2025 Jan 25;15(1):3189. doi: 10.1038/s41598-025-87256-5.
2
The Addition of Transcriptomics to the Bead-Enabled Accelerated Monophasic Multi-Omics Method: A Step toward Universal Sample Preparation.转录组学在珠加速单相多组学方法中的应用:迈向通用样品制备的一步。
Anal Chem. 2024 Nov 19;96(46):18343-18348. doi: 10.1021/acs.analchem.4c02835. Epub 2024 Oct 9.
3
Feature-agnostic metabolomics for determining effective subcytotoxic doses of common pesticides in human cells.
基于特征的代谢组学方法用于确定常见农药对人细胞的亚细胞毒性有效剂量。
Toxicol Sci. 2024 Nov 1;202(1):85-95. doi: 10.1093/toxsci/kfae101.
4
MetaboAnalyst 6.0: towards a unified platform for metabolomics data processing, analysis and interpretation.MetaboAnalyst 6.0:迈向代谢组学数据处理、分析和解释的统一平台。
Nucleic Acids Res. 2024 Jul 5;52(W1):W398-W406. doi: 10.1093/nar/gkae253.
5
Toward a More Comprehensive Approach for Dissolved Organic Matter Chemical Characterization Using an Orbitrap Fusion Tribrid Mass Spectrometer Coupled with Ion and Liquid Chromatography Techniques.采用与离子色谱和液相色谱技术联用的轨道阱融合型三合一质谱仪,实现对溶解有机物化学特性更全面的分析方法。
Anal Chem. 2024 Mar 5;96(9):3744-3753. doi: 10.1021/acs.analchem.3c02599. Epub 2024 Feb 19.
6
Workflow for Evaluating Normalization Tools for Omics Data Using Supervised and Unsupervised Machine Learning.使用监督式和非监督式机器学习评估组学数据标准化工具的工作流程。
J Am Soc Mass Spectrom. 2023 Dec 6;34(12):2775-2784. doi: 10.1021/jasms.3c00295. Epub 2023 Oct 28.
7
Denoising Autoencoder Normalization for Large-Scale Untargeted Metabolomics by Gas Chromatography-Mass Spectrometry.基于气相色谱-质谱联用技术的大规模非靶向代谢组学去噪自动编码器归一化方法
Metabolites. 2023 Aug 13;13(8):944. doi: 10.3390/metabo13080944.
8
Normalization methods in mass spectrometry-based analytical proteomics: A case study based on renal cell carcinoma datasets.基于质谱的分析蛋白质组学中的归一化方法:基于肾细胞癌数据集的案例研究。
Talanta. 2024 Jan 1;266(Pt 1):124953. doi: 10.1016/j.talanta.2023.124953. Epub 2023 Jul 17.
9
Missing data in multi-omics integration: Recent advances through artificial intelligence.多组学整合中的缺失数据:通过人工智能取得的最新进展
Front Artif Intell. 2023 Feb 9;6:1098308. doi: 10.3389/frai.2023.1098308. eCollection 2023.
10
Dealing with missing values in proteomics data.处理蛋白质组学数据中的缺失值。
Proteomics. 2022 Dec;22(23-24):e2200092. doi: 10.1002/pmic.202200092. Epub 2022 Nov 17.