• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多组学数据整合方法的技术综述:从经典统计方法到深度生成方法

A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.

作者信息

Baião Ana R, Cai Zhaoxiang, Poulos Rebecca C, Robinson Phillip J, Reddel Roger R, Zhong Qing, Vinga Susana, Gonçalves Emanuel

机构信息

INESC-ID, Rua Alves Redol 9, 1000-029 Lisboa, Portugal.

Instituto Superior Técnico (IST), Universidade de Lisboa, Av. Rovisco Pais, 1049-001 Lisboa, Portugal.

出版信息

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.

DOI:10.1093/bib/bbaf355
PMID:40748323
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12315550/
Abstract

The rapid advancement of high-throughput sequencing and other assay technologies has resulted in the generation of large and complex multi-omics datasets, offering unprecedented opportunities for advancing precision medicine. However, multi-omics data integration remains challenging due to the high-dimensionality, heterogeneity, and frequency of missing values across data types. Computational methods leveraging statistical and machine learning approaches have been developed to address these issues and uncover complex biological patterns, improving our understanding of disease mechanisms. Here, we comprehensively review state-of-the-art multi-omics integration methods with a focus on deep generative models, particularly variational autoencoders (VAEs) that have been widely used for data imputation, augmentation, and batch effect correction. We explore the technical aspects of VAE loss functions and regularisation techniques, including adversarial training, disentanglement, and contrastive learning. Moreover, we highlight recent advancements in foundation models and multimodal data integration, outlining future directions in precision medicine research.

摘要

高通量测序和其他检测技术的快速发展,已产生了大量复杂的多组学数据集,为推进精准医学提供了前所未有的机遇。然而,由于数据类型之间存在高维度、异质性和缺失值频率等问题,多组学数据整合仍然具有挑战性。利用统计和机器学习方法的计算方法已被开发出来,以解决这些问题并揭示复杂的生物学模式,增进我们对疾病机制的理解。在此,我们全面综述了最先进的多组学整合方法,重点关注深度生成模型,特别是变分自编码器(VAE),它已被广泛用于数据插补、增强和批次效应校正。我们探讨了VAE损失函数和正则化技术的技术方面,包括对抗训练、解缠和对比学习。此外,我们强调了基础模型和多模态数据整合方面的最新进展,概述了精准医学研究的未来方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/6b1dc724ef22/bbaf355f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/d0fc75198b84/bbaf355f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/61425fea613d/bbaf355f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/414c2dc1b8a7/bbaf355f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/6b1dc724ef22/bbaf355f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/d0fc75198b84/bbaf355f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/61425fea613d/bbaf355f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/414c2dc1b8a7/bbaf355f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c5b/12315550/6b1dc724ef22/bbaf355f4.jpg

相似文献

1
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.多组学数据整合方法的技术综述:从经典统计方法到深度生成方法
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.
2
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.新型多组学去混淆变分自动编码器可获得有意义的疾病亚型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae512.
3
Deep learning applications in single-cell genomics and transcriptomics data analysis.深度学习在单细胞基因组学和转录组学数据分析中的应用。
Biomed Pharmacother. 2023 Sep;165:115077. doi: 10.1016/j.biopha.2023.115077. Epub 2023 Jul 1.
4
Graph neural networks for single-cell omics data: a review of approaches and applications.用于单细胞组学数据的图神经网络:方法与应用综述
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf109.
5
The Use of AI for Phenotype-Genotype Mapping.人工智能在表型-基因型映射中的应用。
Methods Mol Biol. 2025;2952:369-410. doi: 10.1007/978-1-0716-4690-8_21.
6
Comparative analysis of statistical and deep learning-based multi-omics integration for breast cancer subtype classification.基于统计和深度学习的多组学整合用于乳腺癌亚型分类的比较分析
J Transl Med. 2025 Jul 1;23(1):709. doi: 10.1186/s12967-025-06662-5.
7
Deep learning tackles single-cell analysis-a survey of deep learning for scRNA-seq analysis.深度学习应对单细胞分析——深度学习在 scRNA-seq 分析中的应用综述。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab531.
8
Gene regulatory network integration with multi-omics data enhances survival predictions in cancer.基因调控网络与多组学数据的整合提高了癌症生存预测能力。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf315.
9
Multi-omics data integration and analysis pipeline for precision medicine: Systematic review.用于精准医学的多组学数据整合与分析流程:系统综述
Comput Biol Chem. 2024 Dec;113:108254. doi: 10.1016/j.compbiolchem.2024.108254. Epub 2024 Oct 16.
10
Integrating multi-omics and machine learning for disease resistance prediction in legumes.整合多组学和机器学习用于豆类抗病性预测
Theor Appl Genet. 2025 Jun 27;138(7):163. doi: 10.1007/s00122-025-04948-2.

引用本文的文献

1
Deep learning-driven multi-omics analysis: enhancing cancer diagnostics and therapeutics.深度学习驱动的多组学分析:增强癌症诊断与治疗
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf440.

本文引用的文献

1
Foundation models in bioinformatics.生物信息学中的基础模型。
Natl Sci Rev. 2025 Jan 25;12(4):nwaf028. doi: 10.1093/nsr/nwaf028. eCollection 2025 Apr.
2
A foundation model of transcription across human cell types.一种跨人类细胞类型的转录基础模型。
Nature. 2025 Jan;637(8047):965-973. doi: 10.1038/s41586-024-08391-z. Epub 2025 Jan 8.
3
Synthetic augmentation of cancer cell line multi-omic datasets using unsupervised deep learning.使用无监督深度学习对癌细胞系多组学数据集进行人工合成扩充。
Nat Commun. 2024 Nov 29;15(1):10390. doi: 10.1038/s41467-024-54771-4.
4
MolE: a foundation model for molecular graphs using disentangled attention.MolE:一种基于解缠注意力的分子图基础模型。
Nat Commun. 2024 Nov 12;15(1):9431. doi: 10.1038/s41467-024-53751-y.
5
DeePathNet: A Transformer-Based Deep Learning Model Integrating Multiomic Data with Cancer Pathways.DeePathNet:一种基于Transformer的深度学习模型,整合多组学数据与癌症通路。
Cancer Res Commun. 2024 Dec 1;4(12):3151-3164. doi: 10.1158/2767-9764.CRC-24-0285.
6
Assessing and mitigating batch effects in large-scale omics studies.评估和减轻大规模组学研究中的批次效应。
Genome Biol. 2024 Oct 3;25(1):254. doi: 10.1186/s13059-024-03401-9.
7
Deep learning-based approaches for multi-omics data integration and analysis.基于深度学习的多组学数据整合与分析方法。
BioData Min. 2024 Oct 2;17(1):38. doi: 10.1186/s13040-024-00391-z.
8
Benchmarking algorithms for single-cell multi-omics prediction and integration.单细胞多组学预测和整合算法的基准测试。
Nat Methods. 2024 Nov;21(11):2182-2194. doi: 10.1038/s41592-024-02429-w. Epub 2024 Sep 25.
9
Joint variational autoencoders for multimodal imputation and embedding.用于多模态插补和嵌入的联合变分自编码器
Nat Mach Intell. 2023 Jun;5(6):631-642. doi: 10.1038/s42256-023-00663-z. Epub 2023 May 29.
10
Transformers in single-cell omics: a review and new perspectives.单细胞组学中的转换器:综述与新视角。
Nat Methods. 2024 Aug;21(8):1430-1443. doi: 10.1038/s41592-024-02353-z. Epub 2024 Aug 9.