• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多组学数据聚类的表示学习。

Representation Learning for the Clustering of Multi-Omics Data.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):135-145. doi: 10.1109/TCBB.2021.3060340. Epub 2022 Feb 3.

DOI:10.1109/TCBB.2021.3060340
PMID:33600320
Abstract

The integration of several sources of data for the identification of subtypes of diseases has gained attention over the past few years. The heterogeneity and the high dimensions of the data sets calls for an adequate representation of the data. We summarize the field of representation learning for the multi-omics clustering problem and we investigate several techniques to learn relevant combined representations, using methods from group factor analysis (PCA, MFA and extensions) and from machine learning with autoencoders. We highlight the importance of appropriately designing and training the latter, notably with a novel combination of a disjointed deep autoencoder (DDAE) architecture and a layer-wise reconstruction loss. These different representations can then be clustered to identify biologically meaningful clusters of patients. We provide a unifying framework for model comparison between statistical and deep learning approaches with the introduction of a new weighted internal clustering index that evaluates how well the clustering information is retained from each source, favoring contributions from all data sets. We apply our methodology to two case studies for which previous works of integrative clustering exist, TCGA Breast Cancer and TARGET Neuroblastoma, and show how our method can yield good and well-balanced clusters across the different data sources.

摘要

近年来,人们越来越关注将多种数据源整合起来以识别疾病亚型。由于数据集的异质性和高维度,需要对数据进行适当的表示。我们总结了多组学聚类问题的表示学习领域,并研究了几种技术,以使用来自组因子分析(PCA、MFA 和扩展)和机器学习的自动编码器学习相关的组合表示。我们强调了适当设计和训练后者的重要性,特别是使用不相交深度自动编码器(DDAE)架构和分层重建损失的新颖组合。然后,可以对这些不同的表示进行聚类,以识别具有生物学意义的患者聚类。我们通过引入新的加权内部聚类指数,为统计和深度学习方法之间的模型比较提供了一个统一的框架,该指数评估了从每个源保留聚类信息的程度,从而有利于所有数据集的贡献。我们将我们的方法应用于两个具有整合聚类先前工作的案例研究,TCGA 乳腺癌和 TARGET 神经母细胞瘤,并展示了我们的方法如何能够在不同的数据源中产生良好且平衡的聚类。

相似文献

1
Representation Learning for the Clustering of Multi-Omics Data.多组学数据聚类的表示学习。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):135-145. doi: 10.1109/TCBB.2021.3060340. Epub 2022 Feb 3.
2
Capturing the latent space of an Autoencoder for multi-omics integration and cancer subtyping.捕获自动编码器的潜在空间,用于多组学整合和癌症亚型分类。
Comput Biol Med. 2022 Sep;148:105832. doi: 10.1016/j.compbiomed.2022.105832. Epub 2022 Jul 5.
3
MCluster-VAEs: An end-to-end variational deep learning-based clustering method for subtype discovery using multi-omics data.MCluster-VAEs:一种基于变分深度学习的端到端聚类方法,用于利用多组学数据进行亚型发现。
Comput Biol Med. 2022 Nov;150:106085. doi: 10.1016/j.compbiomed.2022.106085. Epub 2022 Sep 6.
4
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.新型多组学去混淆变分自动编码器可获得有意义的疾病亚型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae512.
5
PathME: pathway based multi-modal sparse autoencoders for clustering of patient-level multi-omics data.PathME:基于通路的多模态稀疏自动编码器,用于对患者层面多组学数据进行聚类。
BMC Bioinformatics. 2020 Apr 16;21(1):146. doi: 10.1186/s12859-020-3465-2.
6
Autoencoder-assisted latent representation learning for survival prediction and multi-view clustering on multi-omics cancer subtyping.基于自动编码器辅助的生存预测潜在表示学习和多组学生物标志物癌症亚型的多视图聚类。
Math Biosci Eng. 2023 Nov 27;20(12):21098-21119. doi: 10.3934/mbe.2023933.
7
Multi-view spectral clustering with latent representation learning for applications on multi-omics cancer subtyping.基于潜在表示学习的多视图谱聚类在多组学癌症亚型分析中的应用
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac500.
8
Sparsely Connected Autoencoders: A Multi-Purpose Tool for Single Cell omics Analysis.稀疏连接自动编码器:单细胞组学分析的多用途工具。
Int J Mol Sci. 2021 Nov 25;22(23):12755. doi: 10.3390/ijms222312755.
9
Achieving deep clustering through the use of variational autoencoders and similarity-based loss.通过使用变分自编码器和基于相似度的损失来实现深度聚类。
Math Biosci Eng. 2022 Jul 22;19(10):10344-10360. doi: 10.3934/mbe.2022484.
10
Representation learning via Dual-Autoencoder for recommendation.通过双自动编码器进行推荐的表示学习。
Neural Netw. 2017 Jun;90:83-89. doi: 10.1016/j.neunet.2017.03.009. Epub 2017 Mar 27.

引用本文的文献

1
A machine learning and deep learning-based integrated multi-omics technique for leukemia prediction.一种基于机器学习和深度学习的用于白血病预测的集成多组学技术。
Heliyon. 2024 Feb 1;10(3):e25369. doi: 10.1016/j.heliyon.2024.e25369. eCollection 2024 Feb 15.
2
The childhood arthritis radiographic score of the hip: the proposal cut-off value using cluster analysis.髋关节幼年特发性关节炎放射学评分:使用聚类分析的建议截断值。
Clin Rheumatol. 2024 Jan;43(1):465-472. doi: 10.1007/s10067-023-06749-8. Epub 2023 Aug 28.
3
Rise of Deep Learning Clinical Applications and Challenges in Omics Data: A Systematic Review.
深度学习在临床应用中的兴起以及组学数据面临的挑战:一项系统综述
Diagnostics (Basel). 2023 Feb 10;13(4):664. doi: 10.3390/diagnostics13040664.