• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习的癌症多组学数据融合方法的基准研究。

A benchmark study of deep learning-based multi-omics data fusion methods for cancer.

机构信息

Institute of Health Service and Transfusion Medicine, Beijing, People's Republic of China.

School of Informatics, Xiamen University, Xiamen, People's Republic of China.

出版信息

Genome Biol. 2022 Aug 9;23(1):171. doi: 10.1186/s13059-022-02739-2.

DOI:10.1186/s13059-022-02739-2
PMID:35945544
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9361561/
Abstract

BACKGROUND

A fused method using a combination of multi-omics data enables a comprehensive study of complex biological processes and highlights the interrelationship of relevant biomolecules and their functions. Driven by high-throughput sequencing technologies, several promising deep learning methods have been proposed for fusing multi-omics data generated from a large number of samples.

RESULTS

In this study, 16 representative deep learning methods are comprehensively evaluated on simulated, single-cell, and cancer multi-omics datasets. For each of the datasets, two tasks are designed: classification and clustering. The classification performance is evaluated by using three benchmarking metrics including accuracy, F1 macro, and F1 weighted. Meanwhile, the clustering performance is evaluated by using four benchmarking metrics including the Jaccard index (JI), C-index, silhouette score, and Davies Bouldin score. For the cancer multi-omics datasets, the methods' strength in capturing the association of multi-omics dimensionality reduction results with survival and clinical annotations is further evaluated. The benchmarking results indicate that moGAT achieves the best classification performance. Meanwhile, efmmdVAE, efVAE, and lfmmdVAE show the most promising performance across all complementary contexts in clustering tasks.

CONCLUSIONS

Our benchmarking results not only provide a reference for biomedical researchers to choose appropriate deep learning-based multi-omics data fusion methods, but also suggest the future directions for the development of more effective multi-omics data fusion methods. The deep learning frameworks are available at https://github.com/zhenglinyi/DL-mo .

摘要

背景

融合使用多种组学数据的方法能够全面研究复杂的生物过程,并突出相关生物分子及其功能的相互关系。受高通量测序技术的推动,已经提出了几种有前途的深度学习方法,用于融合来自大量样本的多种组学数据。

结果

在这项研究中,16 种有代表性的深度学习方法在模拟、单细胞和癌症多组学数据集中进行了全面评估。对于每个数据集,设计了两个任务:分类和聚类。使用准确性、F1 宏和 F1 加权三种基准指标评估分类性能。同时,使用 Jaccard 指数(JI)、C 指数、轮廓得分和 Davies Bouldin 得分四种基准指标评估聚类性能。对于癌症多组学数据集,进一步评估了方法在捕捉多维组学降维结果与生存和临床注释的关联方面的优势。基准测试结果表明,moGAT 实现了最佳的分类性能。同时,efmmdVAE、efVAE 和 lfmmdVAE 在聚类任务的所有互补上下文中表现出最有前途的性能。

结论

我们的基准测试结果不仅为生物医学研究人员选择合适的基于深度学习的多组学数据融合方法提供了参考,也为开发更有效的多组学数据融合方法提供了未来的方向。深度学习框架可在 https://github.com/zhenglinyi/DL-mo 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/19f9bd850376/13059_2022_2739_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/6db5dcd3704c/13059_2022_2739_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/d4562e565acb/13059_2022_2739_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/1570414727ae/13059_2022_2739_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/4d60c8dc3cc4/13059_2022_2739_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/dcecb7ba8d8c/13059_2022_2739_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/a4098bffd286/13059_2022_2739_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/bda7daa14235/13059_2022_2739_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/5c70ce4fefce/13059_2022_2739_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/19f9bd850376/13059_2022_2739_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/6db5dcd3704c/13059_2022_2739_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/d4562e565acb/13059_2022_2739_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/1570414727ae/13059_2022_2739_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/4d60c8dc3cc4/13059_2022_2739_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/dcecb7ba8d8c/13059_2022_2739_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/a4098bffd286/13059_2022_2739_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/bda7daa14235/13059_2022_2739_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/5c70ce4fefce/13059_2022_2739_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12ea/9361561/19f9bd850376/13059_2022_2739_Fig9_HTML.jpg

相似文献

1
A benchmark study of deep learning-based multi-omics data fusion methods for cancer.基于深度学习的癌症多组学数据融合方法的基准研究。
Genome Biol. 2022 Aug 9;23(1):171. doi: 10.1186/s13059-022-02739-2.
2
Benchmarking joint multi-omics dimensionality reduction approaches for the study of cancer.基于癌症研究的联合多组学降维方法的基准测试。
Nat Commun. 2021 Jan 5;12(1):124. doi: 10.1038/s41467-020-20430-7.
3
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.新型多组学去混淆变分自动编码器可获得有意义的疾病亚型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae512.
4
Multi-omics clustering for cancer subtyping based on latent subspace learning.基于潜在子空间学习的癌症亚型多组学聚类。
Comput Biol Med. 2023 Sep;164:107223. doi: 10.1016/j.compbiomed.2023.107223. Epub 2023 Jul 3.
5
Multi-omics integration method based on attention deep learning network for biomedical data classification.基于注意力深度学习网络的多组学整合方法用于生物医学数据分类
Comput Methods Programs Biomed. 2023 Apr;231:107377. doi: 10.1016/j.cmpb.2023.107377. Epub 2023 Jan 27.
6
Large-scale benchmark study of survival prediction methods using multi-omics data.大规模基于多组学数据的生存预测方法基准研究。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa167.
7
AVBAE-MODFR: A novel deep learning framework of embedding and feature selection on multi-omics data for pan-cancer classification.AVBAE-MODFR:一种基于多组学数据的嵌入和特征选择的深度学习框架,用于泛癌分类。
Comput Biol Med. 2024 Jul;177:108614. doi: 10.1016/j.compbiomed.2024.108614. Epub 2024 May 14.
8
MCluster-VAEs: An end-to-end variational deep learning-based clustering method for subtype discovery using multi-omics data.MCluster-VAEs:一种基于变分深度学习的端到端聚类方法,用于利用多组学数据进行亚型发现。
Comput Biol Med. 2022 Nov;150:106085. doi: 10.1016/j.compbiomed.2022.106085. Epub 2022 Sep 6.
9
scMIC: A Deep Multi-Level Information Fusion Framework for Clustering Single-Cell Multi-Omics Data.scMIC:一种用于聚类单细胞多组学数据的深度多层信息融合框架。
IEEE J Biomed Health Inform. 2023 Dec;27(12):6121-6132. doi: 10.1109/JBHI.2023.3317272. Epub 2023 Dec 5.
10
Incomplete multi-view gene clustering with data regeneration using Shape Boltzmann Machine.基于形状玻尔兹曼机的数据再生的不完全多视图基因聚类
Comput Biol Med. 2020 Oct;125:103965. doi: 10.1016/j.compbiomed.2020.103965. Epub 2020 Sep 8.

引用本文的文献

1
Multi-omics analysis of thermogenic lipolysis in brown adipocytes.棕色脂肪细胞产热脂肪分解的多组学分析
iScience. 2025 Aug 16;28(9):113382. doi: 10.1016/j.isci.2025.113382. eCollection 2025 Sep 19.
2
Flexynesis: A deep learning toolkit for bulk multi-omics data integration for precision oncology and beyond.Flexynesis:用于精准肿瘤学及其他领域的批量多组学数据整合的深度学习工具包。
Nat Commun. 2025 Sep 12;16(1):8261. doi: 10.1038/s41467-025-63688-5.
3
Deep learning-driven multi-omics analysis: enhancing cancer diagnostics and therapeutics.

本文引用的文献

1
Deep learning identified glioblastoma subtypes based on internal genomic expression ranks.深度学习基于内部基因组表达排名识别胶质母细胞瘤亚型。
BMC Cancer. 2022 Jan 20;22(1):86. doi: 10.1186/s12885-022-09191-2.
2
Harnessing multimodal data integration to advance precision oncology.利用多模态数据整合推进精准肿瘤学。
Nat Rev Cancer. 2022 Feb;22(2):114-126. doi: 10.1038/s41568-021-00408-3. Epub 2021 Oct 18.
3
Multi-omics integration in the age of million single-cell data.多组学整合在百万单细胞数据时代。
深度学习驱动的多组学分析:增强癌症诊断与治疗
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf440.
4
Mapping the future: bibliometric analysis of omics research trends in non-small cell lung cancer.绘制未来蓝图:非小细胞肺癌组学研究趋势的文献计量分析
Discov Oncol. 2025 Aug 12;16(1):1536. doi: 10.1007/s12672-025-03140-8.
5
Interpretable and integrative analysis of single-cell multiomics with scMKL.使用scMKL对单细胞多组学进行可解释的综合分析。
Commun Biol. 2025 Aug 6;8(1):1160. doi: 10.1038/s42003-025-08533-7.
6
Quality-of-life scale machine learning approach to predict immunotherapy response in patients with advanced non-small cell lung cancer.采用生活质量量表机器学习方法预测晚期非小细胞肺癌患者的免疫治疗反应。
Front Immunol. 2025 Jul 18;16:1600265. doi: 10.3389/fimmu.2025.1600265. eCollection 2025.
7
Visible neural networks for multi-omics integration: a critical review.用于多组学整合的可视化神经网络:批判性综述
Front Artif Intell. 2025 Jul 17;8:1595291. doi: 10.3389/frai.2025.1595291. eCollection 2025.
8
Identification of a 10-species microbial signature of inflammatory bowel disease by machine learning and external validation.通过机器学习和外部验证识别炎症性肠病的10种微生物特征
Cell Regen. 2025 Jul 14;14(1):32. doi: 10.1186/s13619-025-00246-w.
9
Comparative analysis of statistical and deep learning-based multi-omics integration for breast cancer subtype classification.基于统计和深度学习的多组学整合用于乳腺癌亚型分类的比较分析
J Transl Med. 2025 Jul 1;23(1):709. doi: 10.1186/s12967-025-06662-5.
10
3Mont: A multi-omics integrative tool for breast cancer subtype stratification.3Mont:一种用于乳腺癌亚型分层的多组学整合工具。
PLoS One. 2025 Jun 27;20(6):e0326154. doi: 10.1371/journal.pone.0326154. eCollection 2025.
Nat Rev Nephrol. 2021 Nov;17(11):710-724. doi: 10.1038/s41581-021-00463-x. Epub 2021 Aug 20.
4
Multi-omics data integration by generative adversarial network.基于生成对抗网络的多组学数据整合。
Bioinformatics. 2021 Dec 22;38(1):179-186. doi: 10.1093/bioinformatics/btab608.
5
Integrative Multi-Omics Approaches in Cancer Research: From Biological Networks to Clinical Subtypes.癌症研究中的整合多组学方法:从生物网络到临床亚型
Mol Cells. 2021 Jul 31;44(7):433-443. doi: 10.14348/molcells.2021.0042.
6
Bioactivity descriptors for uncharacterized chemical compounds.生物活性描述符用于未表征的化学化合物。
Nat Commun. 2021 Jun 24;12(1):3932. doi: 10.1038/s41467-021-24150-4.
7
MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification.MOGONET 通过使用图卷积网络整合多组学数据,从而实现患者分类和生物标志物识别。
Nat Commun. 2021 Jun 8;12(1):3445. doi: 10.1038/s41467-021-23774-w.
8
MatchMaker: A Deep Learning Framework for Drug Synergy Prediction.MatchMaker:一种用于药物协同作用预测的深度学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2334-2344. doi: 10.1109/TCBB.2021.3086702. Epub 2022 Aug 8.
9
Prediction of Synthetic Lethal Interactions in Human Cancers Using Multi-View Graph Auto-Encoder.利用多视图图自动编码器预测人类癌症中的合成致死相互作用。
IEEE J Biomed Health Inform. 2021 Oct;25(10):4041-4051. doi: 10.1109/JBHI.2021.3079302. Epub 2021 Oct 5.
10
Multi-view Multichannel Attention Graph Convolutional Network for miRNA-disease association prediction.多视图多通道注意力图卷积网络用于 miRNA-疾病关联预测。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab174.