• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多组学和多视角聚类算法:综述和癌症基准测试。

Multi-omic and multi-view clustering algorithms: review and cancer benchmark.

机构信息

Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.

出版信息

Nucleic Acids Res. 2018 Nov 16;46(20):10546-10562. doi: 10.1093/nar/gky889.

DOI:10.1093/nar/gky889
PMID:30295871
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6237755/
Abstract

Recent high throughput experimental methods have been used to collect large biomedical omics datasets. Clustering of single omic datasets has proven invaluable for biological and medical research. The decreasing cost and development of additional high throughput methods now enable measurement of multi-omic data. Clustering multi-omic data has the potential to reveal further systems-level insights, but raises computational and biological challenges. Here, we review algorithms for multi-omics clustering, and discuss key issues in applying these algorithms. Our review covers methods developed specifically for omic data as well as generic multi-view methods developed in the machine learning community for joint clustering of multiple data types. In addition, using cancer data from TCGA, we perform an extensive benchmark spanning ten different cancer types, providing the first systematic comparison of leading multi-omics and multi-view clustering algorithms. The results highlight key issues regarding the use of single- versus multi-omics, the choice of clustering strategy, the power of generic multi-view methods and the use of approximated p-values for gauging solution quality. Due to the growing use of multi-omics data, we expect these issues to be important for future progress in the field.

摘要

最近的高通量实验方法被用于收集大量的生物医学组学数据集。对单个组学数据集进行聚类已被证明对生物和医学研究非常有价值。随着高通量方法成本的降低和进一步的发展,现在可以测量多组学数据。对多组学数据进行聚类有可能揭示出进一步的系统级见解,但也带来了计算和生物学方面的挑战。在这里,我们回顾了多组学聚类的算法,并讨论了应用这些算法的关键问题。我们的综述涵盖了专门为组学数据开发的方法,以及机器学习社区中为联合聚类多种数据类型而开发的通用多视图方法。此外,我们使用 TCGA 的癌症数据进行了广泛的基准测试,涵盖了十种不同的癌症类型,首次对领先的多组学和多视图聚类算法进行了系统比较。结果突出了使用单组学与多组学、聚类策略的选择、通用多视图方法的有效性以及使用近似 p 值来衡量解决方案质量等方面的关键问题。由于多组学数据的使用越来越多,我们预计这些问题将对该领域的未来发展非常重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/03c11e91a343/gky889fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/1ec0565a5511/gky889fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/425905b1a8ac/gky889fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/17aa1de74308/gky889fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/03c11e91a343/gky889fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/1ec0565a5511/gky889fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/425905b1a8ac/gky889fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/17aa1de74308/gky889fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5253/6237755/03c11e91a343/gky889fig4.jpg

相似文献

1
Multi-omic and multi-view clustering algorithms: review and cancer benchmark.多组学和多视角聚类算法:综述和癌症基准测试。
Nucleic Acids Res. 2018 Nov 16;46(20):10546-10562. doi: 10.1093/nar/gky889.
2
MONET: Multi-omic module discovery by omic selection.MONET:通过组学选择进行多组学模块发现。
PLoS Comput Biol. 2020 Sep 15;16(9):e1008182. doi: 10.1371/journal.pcbi.1008182. eCollection 2020 Sep.
3
Convex Multi-View Clustering Via Robust Low Rank Approximation With Application to Multi-Omic Data.通过稳健低秩逼近的凸多视图聚类及其在多组学数据中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3340-3352. doi: 10.1109/TCBB.2021.3122961. Epub 2022 Dec 8.
4
A multiobjective multi-view cluster ensemble technique: Application in patient subclassification.一种多目标多视图聚类集成技术:在患者分类中的应用。
PLoS One. 2019 May 23;14(5):e0216904. doi: 10.1371/journal.pone.0216904. eCollection 2019.
5
Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification.使用低秩近似的多组学数据快速降维和整合聚类:在癌症分子分类中的应用
BMC Genomics. 2015 Dec 1;16:1022. doi: 10.1186/s12864-015-2223-8.
6
COPS: A novel platform for multi-omic disease subtype discovery via robust multi-objective evaluation of clustering algorithms.COPS:一种通过稳健的聚类算法多目标评估发现多组学疾病亚型的新平台。
PLoS Comput Biol. 2024 Aug 5;20(8):e1012275. doi: 10.1371/journal.pcbi.1012275. eCollection 2024 Aug.
7
Subtype identification from heterogeneous TCGA datasets on a genomic scale by multi-view clustering with enhanced consensus.通过具有增强一致性的多视图聚类,从基因组规模的异质TCGA数据集中进行亚型识别。
BMC Med Genomics. 2017 Dec 21;10(Suppl 4):75. doi: 10.1186/s12920-017-0306-x.
8
Evaluation of integrative clustering methods for the analysis of multi-omics data.评估整合聚类方法在多组学数据分析中的应用。
Brief Bioinform. 2020 Mar 23;21(2):541-552. doi: 10.1093/bib/bbz015.
9
Survey and comparative assessments of computational multi-omics integrative methods with multiple regulatory networks identifying distinct tumor compositions across pan-cancer data sets.对具有多个调控网络的计算多组学综合方法进行调查和比较评估,以识别泛癌数据集之间不同的肿瘤组成。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa102.
10
NEMO: cancer subtyping by integration of partial multi-omic data.NEMO:通过整合部分多组学数据进行癌症亚型分类。
Bioinformatics. 2019 Sep 15;35(18):3348-3356. doi: 10.1093/bioinformatics/btz058.

引用本文的文献

1
Product Centred Dirichlet Processes for Bayesian Multiview Clustering.用于贝叶斯多视图聚类的以产品为中心的狄利克雷过程
J R Stat Soc Series B Stat Methodol. 2025 Apr 30. doi: 10.1093/jrsssb/qkaf021.
2
Refining treatment strategies for non-small cell lung cancer lacking actionable mutations: insights from multi-omics studies.完善缺乏可靶向突变的非小细胞肺癌的治疗策略:多组学研究的见解
Br J Cancer. 2025 Aug 23. doi: 10.1038/s41416-025-03139-6.
3
Increasing pathogenic germline variant diagnosis rates in precision medicine: current best practices and future opportunities.

本文引用的文献

1
Robust clustering of noisy high-dimensional gene expression data for patients subtyping.对噪声高维基因表达数据进行稳健聚类,以对患者进行亚型划分。
Bioinformatics. 2018 Dec 1;34(23):4064-4072. doi: 10.1093/bioinformatics/bty502.
2
Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer.起源细胞模式主导了 33 种癌症类型的 10000 个肿瘤的分子分类。
Cell. 2018 Apr 5;173(2):291-304.e6. doi: 10.1016/j.cell.2018.03.022.
3
Opportunities and obstacles for deep learning in biology and medicine.
提高精准医学中致病种系变异的诊断率:当前最佳实践与未来机遇
Hum Genomics. 2025 Aug 22;19(1):97. doi: 10.1186/s40246-025-00811-z.
4
A review on multi-omics integration for aiding study design of large scale TCGA cancer datasets.关于多组学整合以辅助大规模TCGA癌症数据集研究设计的综述。
BMC Genomics. 2025 Aug 22;26(1):769. doi: 10.1186/s12864-025-11925-y.
5
Autoencoders with shared and specific embeddings for multi-omics data integration.用于多组学数据整合的具有共享和特定嵌入的自动编码器。
BMC Bioinformatics. 2025 Aug 19;26(1):214. doi: 10.1186/s12859-025-06245-7.
6
An efficient and interactive feature selection approach based on copula entropy for high-dimensional genetic data.一种基于Copula熵的高效交互式高维遗传数据特征选择方法。
Sci Rep. 2025 Aug 17;15(1):30100. doi: 10.1038/s41598-025-15068-8.
7
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.多组学数据整合方法的技术综述:从经典统计方法到深度生成方法
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.
8
MOTL: enhancing multi-omics matrix factorization with transfer learning.MOTL:通过迁移学习增强多组学矩阵分解
Genome Biol. 2025 Jul 25;26(1):224. doi: 10.1186/s13059-025-03675-7.
9
Targeting Lipid Transport Pathways: A New Frontier in Colorectal Cancer Therapy.靶向脂质转运途径:结直肠癌治疗的新前沿
Dig Dis Sci. 2025 Jul 24. doi: 10.1007/s10620-025-09214-5.
10
GAUDI: interpretable multi-omics integration with UMAP embeddings and density-based clustering.GAUDI:通过UMAP嵌入和基于密度的聚类实现可解释的多组学整合。
Nat Commun. 2025 Jul 1;16(1):5771. doi: 10.1038/s41467-025-60822-1.
深度学习在生物学和医学中的机遇与挑战。
J R Soc Interface. 2018 Apr;15(141). doi: 10.1098/rsif.2017.0387.
4
Multi-omics integration-a comparison of unsupervised clustering methodologies.多组学整合——无监督聚类方法比较。
Brief Bioinform. 2019 Jul 19;20(4):1269-1279. doi: 10.1093/bib/bbx167.
5
mixOmics: An R package for 'omics feature selection and multiple data integration.mixOmics:一个用于“组学”特征选择和多数据整合的R包。
PLoS Comput Biol. 2017 Nov 3;13(11):e1005752. doi: 10.1371/journal.pcbi.1005752. eCollection 2017 Nov.
6
A novel approach for data integration and disease subtyping.一种用于数据集成和疾病分型的新方法。
Genome Res. 2017 Dec;27(12):2025-2039. doi: 10.1101/gr.215129.116. Epub 2017 Oct 24.
7
Clusternomics: Integrative context-dependent clustering for heterogeneous datasets.聚类组学:针对异构数据集的整合上下文相关聚类
PLoS Comput Biol. 2017 Oct 16;13(10):e1005781. doi: 10.1371/journal.pcbi.1005781. eCollection 2017 Oct.
8
Deep Learning-Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer.基于深度学习的多组学整合可稳健预测肝癌患者的生存情况。
Clin Cancer Res. 2018 Mar 15;24(6):1248-1259. doi: 10.1158/1078-0432.CCR-17-0853. Epub 2017 Oct 5.
9
Towards clinically more relevant dissection of patient heterogeneity via survival-based Bayesian clustering.基于生存的贝叶斯聚类,探索更具临床相关性的患者异质性解剖。
Bioinformatics. 2017 Nov 15;33(22):3558-3566. doi: 10.1093/bioinformatics/btx464.
10
More Is Better: Recent Progress in Multi-Omics Data Integration Methods.越多越好:多组学数据整合方法的最新进展
Front Genet. 2017 Jun 16;8:84. doi: 10.3389/fgene.2017.00084. eCollection 2017.