• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

WaveICA:一种基于小波分析的新型算法,用于去除大规模无靶向代谢组学数据中的批次效应。

WaveICA: A novel algorithm to remove batch effects for large-scale untargeted metabolomics data based on wavelet analysis.

机构信息

Department of Epidemiology and Biostatistics, School of Public Health, Harbin Medical University, Harbin, 150086, China.

Laboratory of Hematology Center, First Affiliated Hospital of Harbin Medical University, Harbin, 150086, China.

出版信息

Anal Chim Acta. 2019 Jul 11;1061:60-69. doi: 10.1016/j.aca.2019.02.010. Epub 2019 Feb 19.

DOI:10.1016/j.aca.2019.02.010
PMID:30926040
Abstract

Metabolomics provides new insights into disease pathogenesis and biomarker discovery. Samples from large-scale untargeted metabolomics studies are typically analyzed using a liquid chromatography-mass spectrometry platform in several batches. Batch effects that are caused by non-biological systematic biases are unavoidable in large-scale metabolomics studies, even with properly designed experiments. The statistical analysis of large-scale metabolomics data without managing batch effects will yield misleading results. In this study, we propose a novel algorithm, called WaveICA, which is based on the wavelet transform method with independent component analysis, as the threshold processing method to capture and remove batch effects for large-scale metabolomics data. The WaveICA method uses the time trend of samples over the injection order, decomposes the original data into multi-scale data with different features, extracts and removes the batch effect information in multi-scale data, and obtains clean data. The WaveICA method was tested on real metabolomics data. After applying the WaveICA method, scattered quality control samples (QCS) and subject samples in a PCA score plot of the original data were closely clustered, respectively. The average Pearson correlation coefficients for all peaks of the QCS increased from 0.872 to 0.972. Additionally, WaveICA significantly improved the classification accuracy for metabolomics data. The method was compared with three representative methods, and outperformed all of them. To conclude, WaveICA can efficiently remove batch effects while revealing more biological information. This method can be used in large-scale untargeted metabolomics studies to preprocess raw metabolomics data.

摘要

代谢组学为疾病发病机制和生物标志物的发现提供了新的见解。在几批实验中,通常使用液相色谱-质谱联用平台对来自大规模非靶向代谢组学研究的样本进行分析。即使实验设计合理,在大规模代谢组学研究中,由非生物系统偏差引起的批次效应是不可避免的。如果不对大型代谢组学数据进行批次效应管理就进行统计分析,将得出误导性的结果。在这项研究中,我们提出了一种新的算法,称为 WaveICA,它是基于小波变换方法与独立成分分析的方法,作为处理阈值的方法,用于捕获和去除大规模代谢组学数据中的批次效应。WaveICA 方法利用样品在注射顺序上的时间趋势,将原始数据分解为具有不同特征的多尺度数据,提取和去除多尺度数据中的批次效应信息,并获得干净的数据。在真实的代谢组学数据上测试了 WaveICA 方法。在应用 WaveICA 方法后,原始数据 PCA 得分图中分散的质控样品(QCS)和个体样品分别紧密聚类。QCS 中所有峰的平均 Pearson 相关系数从 0.872 增加到 0.972。此外,WaveICA 显著提高了代谢组学数据的分类准确性。该方法与三种代表性方法进行了比较,均优于所有方法。总之,WaveICA 可以在去除批次效应的同时,揭示更多的生物学信息。该方法可用于大规模非靶向代谢组学研究,用于预处理原始代谢组学数据。

相似文献

1
WaveICA: A novel algorithm to remove batch effects for large-scale untargeted metabolomics data based on wavelet analysis.WaveICA:一种基于小波分析的新型算法,用于去除大规模无靶向代谢组学数据中的批次效应。
Anal Chim Acta. 2019 Jul 11;1061:60-69. doi: 10.1016/j.aca.2019.02.010. Epub 2019 Feb 19.
2
WaveICA 2.0: a novel batch effect removal method for untargeted metabolomics data without using batch information.WaveICA 2.0:一种新颖的无批次信息的靶向代谢组学数据批处理效应去除方法。
Metabolomics. 2021 Sep 20;17(10):87. doi: 10.1007/s11306-021-01839-7.
3
Batch Normalizer: a fast total abundance regression calibration method to simultaneously adjust batch and injection order effects in liquid chromatography/time-of-flight mass spectrometry-based metabolomics data and comparison with current calibration methods.批量归一化:一种快速的总丰度回归校准方法,可同时调整基于液相色谱/飞行时间质谱的代谢组学数据中的批次和进样顺序效应,并与当前的校准方法进行比较。
Anal Chem. 2013 Jan 15;85(2):1037-46. doi: 10.1021/ac302877x. Epub 2012 Dec 26.
4
NormAE: Deep Adversarial Learning Model to Remove Batch Effects in Liquid Chromatography Mass Spectrometry-Based Metabolomics Data.NormAE:用于去除基于液相色谱-质谱代谢组学数据中批次效应的深度对抗学习模型。
Anal Chem. 2020 Apr 7;92(7):5082-5090. doi: 10.1021/acs.analchem.9b05460. Epub 2020 Mar 24.
5
Improved batch correction in untargeted MS-based metabolomics.非靶向质谱代谢组学中改进的批次校正
Metabolomics. 2016;12:88. doi: 10.1007/s11306-016-1015-8. Epub 2016 Mar 18.
6
Characterising and correcting batch variation in an automated direct infusion mass spectrometry (DIMS) metabolomics workflow.分析和校正自动化直接进样质谱(DIMS)代谢组学工作流程中的批间差异。
Anal Bioanal Chem. 2013 Jun;405(15):5147-57. doi: 10.1007/s00216-013-6856-7. Epub 2013 Mar 1.
7
A Novel Strategy for Large-Scale Metabolomics Study by Calibrating Gross and Systematic Errors in Gas Chromatography-Mass Spectrometry.一种通过校准气相色谱-质谱联用仪中的总误差和系统误差进行大规模代谢组学研究的新策略。
Anal Chem. 2016 Feb 16;88(4):2234-42. doi: 10.1021/acs.analchem.5b03912. Epub 2016 Jan 27.
8
statTarget: A streamlined tool for signal drift correction and interpretations of quantitative mass spectrometry-based omics data.statTarget:一种用于信号漂移校正和基于定量质谱组学数据解释的简化工具。
Anal Chim Acta. 2018 Dec 7;1036:66-72. doi: 10.1016/j.aca.2018.08.002. Epub 2018 Aug 6.
9
Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data.非靶向气相色谱/质谱代谢组学数据的混合模型归一化
BMC Bioinformatics. 2017 Feb 2;18(1):84. doi: 10.1186/s12859-017-1501-7.
10
Concordance-Based Batch Effect Correction for Large-Scale Metabolomics.基于一致性的大规模代谢组学批次效应校正
Anal Chem. 2023 May 9;95(18):7220-7228. doi: 10.1021/acs.analchem.2c05748. Epub 2023 Apr 28.

引用本文的文献

1
Machine Learning-Driven Insights in Cancer Metabolomics: From Subtyping to Biomarker Discovery and Prognostic Modeling.机器学习驱动的癌症代谢组学见解:从亚型分类到生物标志物发现与预后建模
Metabolites. 2025 Aug 1;15(8):514. doi: 10.3390/metabo15080514.
2
Quality Control Standards for Batch Effect Evaluation and Correction in Mass Spectrometry Imaging.质谱成像中批次效应评估与校正的质量控制标准
Anal Chem. 2025 May 27;97(20):10919-10928. doi: 10.1021/acs.analchem.5c02020. Epub 2025 May 12.
3
Decoding immune cell interactions during cardiac allograft vasculopathy: insights derived from bioinformatic strategies.
解析心脏移植血管病变过程中的免疫细胞相互作用:来自生物信息学策略的见解
Front Cardiovasc Med. 2025 Apr 24;12:1568528. doi: 10.3389/fcvm.2025.1568528. eCollection 2025.
4
An untargeted metabolome-wide association study of maternal perinatal tobacco smoking in newborn blood spots.一项针对新生儿血斑中母亲围产期吸烟情况的非靶向全代谢组关联研究。
Metabolomics. 2025 Feb 20;21(2):30. doi: 10.1007/s11306-025-02225-3.
5
Advancements in Mass Spectrometry-Based Targeted Metabolomics and Lipidomics: Implications for Clinical Research.基于质谱的靶向代谢组学和脂质组学的进展:对临床研究的启示
Molecules. 2024 Dec 16;29(24):5934. doi: 10.3390/molecules29245934.
6
Statistical analysis of feature-based molecular networking results from non-targeted metabolomics data.基于特征的非靶向代谢组学数据分子网络结果的统计分析
Nat Protoc. 2025 Jan;20(1):92-162. doi: 10.1038/s41596-024-01046-3. Epub 2024 Sep 20.
7
Unraveling the metabolomic architecture of autism in a large Danish population-based cohort.在一个大型丹麦基于人群的队列中揭示自闭症的代谢组学结构。
BMC Med. 2024 Jul 19;22(1):302. doi: 10.1186/s12916-024-03516-7.
8
BERNN: Enhancing classification of Liquid Chromatography Mass Spectrometry data with batch effect removal neural networks.贝恩:利用批处理效应消除神经网络增强液相色谱质谱数据的分类。
Nat Commun. 2024 May 6;15(1):3777. doi: 10.1038/s41467-024-48177-5.
9
The neonatal blood spot metabolome in retinoblastoma.视网膜母细胞瘤中的新生儿血斑代谢组
EJC Paediatr Oncol. 2023 Dec;2. doi: 10.1016/j.ejcped.2023.100123. Epub 2023 Nov 10.
10
Workflow for Evaluating Normalization Tools for Omics Data Using Supervised and Unsupervised Machine Learning.使用监督式和非监督式机器学习评估组学数据标准化工具的工作流程。
J Am Soc Mass Spectrom. 2023 Dec 6;34(12):2775-2784. doi: 10.1021/jasms.3c00295. Epub 2023 Oct 28.