• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

推断:基于多组学数据的多头注意力解耦对比学习发现癌症亚型。

DEDUCE: Multi-head attention decoupled contrastive learning to discover cancer subtypes based on multi-omics data.

机构信息

College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410083, Hunan, China.

Department of Thoracic Surgery, The Second Xiangya Hospital, Central South University, Changsha, 410083, Hunan, China.

出版信息

Comput Methods Programs Biomed. 2024 Dec;257:108478. doi: 10.1016/j.cmpb.2024.108478. Epub 2024 Oct 30.

DOI:10.1016/j.cmpb.2024.108478
PMID:39504713
Abstract

BACKGROUND AND OBJECTIVE

Given the high heterogeneity and clinical diversity of cancer, substantial variations exist in multi-omics data and clinical features across different cancer subtypes.

METHODS

We propose a model, named DEDUCE, based on a symmetric multi-head attention encoders (SMAE), for unsupervised contrastive learning to analyze multi-omics cancer data, with the aim of identifying and characterizing cancer subtypes. This model adopts a unsupervised SMAE that can deeply extract contextual features and long-range dependencies from multi-omics data, thereby mitigating the impact of noise. Importantly, DEDUCE introduces a subtype decoupled contrastive learning method based on a multi-head attention mechanism to simultaneously learn features from multi-omics data and perform clustering for identifying cancer subtypes. Subtypes are clustered by calculating the similarity between samples in both the feature space and sample space of multi-omics data. The fundamental concept involves decoupling various attributes of multi-omics data features and learning them as contrasting terms. A contrastive loss function is constructed to quantify the disparity between positive and negative examples, and the model minimizes this difference, thereby promoting the acquisition of enhanced feature representation.

RESULTS

The DEDUCE model undergoes extensive experiments on simulated multi-omics datasets, single-cell multi-omics datasets, and cancer multi-omics datasets, outperforming 10 deep learning models. The DEDUCE model outperforms state-of-the-art methods, and ablation experiments demonstrate the effectiveness of each module in the DEDUCE model. Finally, we applied the DEDUCE model to identify six cancer subtypes of AML.

CONCLUSION

In this paper, we proposed DEDUCE model learns features from multi-omics data through SMAE, and the subtype decoupled contrastive learning consistently optimizes the model for clustering and identifying cancer subtypes. The DEDUCE model demonstrates a significant capability in discovering new cancer subtypes. We applied the DEDUCE model to identify six subtypes of AML. Through the analysis of GO function enrichment, subtype-specific biological functions, and GSEA of AML using the DEDUCE model, the interpretability of the DEDUCE model in identifying cancer subtypes is further enhanced.

摘要

背景与目的

由于癌症的高度异质性和临床多样性,不同癌症亚型的多组学数据和临床特征存在很大差异。

方法

我们提出了一种基于对称多头注意力编码器(SMAE)的模型 DEDUCE,用于无监督对比学习分析多组学癌症数据,旨在识别和表征癌症亚型。该模型采用无监督 SMAE,可以从多组学数据中深度提取上下文特征和长程依赖关系,从而减轻噪声的影响。重要的是,DEDUCE 引入了一种基于多头注意力机制的亚型解耦对比学习方法,用于同时从多组学数据中学习特征,并进行聚类以识别癌症亚型。通过计算多组学数据的特征空间和样本空间中样本之间的相似度来对亚型进行聚类。基本思想是解耦多组学数据特征的各种属性,并将它们学习为对比项。构建对比损失函数来量化正例和负例之间的差异,模型最小化这个差异,从而促进获取增强的特征表示。

结果

DEDUCE 模型在模拟多组学数据集、单细胞多组学数据集和癌症多组学数据集中进行了广泛的实验,优于 10 种深度学习模型。DEDUCE 模型优于最新方法,消融实验证明了 DEDUCE 模型中每个模块的有效性。最后,我们应用 DEDUCE 模型来识别 AML 的六种癌症亚型。

结论

本文提出的 DEDUCE 模型通过 SMAE 从多组学数据中学习特征,亚型解耦对比学习一致优化模型进行聚类和识别癌症亚型。DEDUCE 模型在发现新的癌症亚型方面表现出显著的能力。我们应用 DEDUCE 模型来识别 AML 的六种亚型。通过使用 DEDUCE 模型对 AML 进行 GO 功能富集分析、亚型特异性生物学功能分析和 GSEA 分析,进一步增强了 DEDUCE 模型在识别癌症亚型方面的可解释性。

相似文献

1
DEDUCE: Multi-head attention decoupled contrastive learning to discover cancer subtypes based on multi-omics data.推断:基于多组学数据的多头注意力解耦对比学习发现癌症亚型。
Comput Methods Programs Biomed. 2024 Dec;257:108478. doi: 10.1016/j.cmpb.2024.108478. Epub 2024 Oct 30.
2
Subtype-DCC: decoupled contrastive clustering method for cancer subtype identification based on multi-omics data.Subtype-DCC:基于多组学数据的用于癌症亚型识别的解耦对比聚类方法。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad025.
3
Cancer subtype identification by multi-omics clustering based on interpretable feature and latent subspace learning.基于可解释特征和潜在子空间学习的多组学聚类进行癌症亚型识别。
Methods. 2024 Nov;231:144-153. doi: 10.1016/j.ymeth.2024.09.014. Epub 2024 Sep 24.
4
A Contrastive-Learning-Based Deep Neural Network for Cancer Subtyping by Integrating Multi-Omics Data.基于对比学习的深度学习神经网络,通过整合多组学数据进行癌症亚型分类。
Interdiscip Sci. 2024 Dec;16(4):966-975. doi: 10.1007/s12539-024-00641-y. Epub 2024 Sep 4.
5
Subtype-MGTP: a cancer subtype identification framework based on multi-omics translation.基于多组学翻译的癌症亚型识别框架 Subtype-MGTP
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae360.
6
WMLRR: A Weighted Multi-View Low Rank Representation to Identify Cancer Subtypes From Multiple Types of Omics Data.WMLRR:一种加权多视图低秩表示方法,用于从多种类型的组学数据中识别癌症亚型。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2891-2897. doi: 10.1109/TCBB.2021.3063284. Epub 2021 Dec 8.
7
MCluster-VAEs: An end-to-end variational deep learning-based clustering method for subtype discovery using multi-omics data.MCluster-VAEs:一种基于变分深度学习的端到端聚类方法,用于利用多组学数据进行亚型发现。
Comput Biol Med. 2022 Nov;150:106085. doi: 10.1016/j.compbiomed.2022.106085. Epub 2022 Sep 6.
8
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.新型多组学去混淆变分自动编码器可获得有意义的疾病亚型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae512.
9
Deep multi-view contrastive learning for cancer subtype identification.用于癌症亚型识别的深度多视图对比学习
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad282.
10
CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data.CLCLSA:基于对比学习和自注意力机制的跨组学关联嵌入方法,用于整合不完整的多组学数据。
Comput Biol Med. 2024 Mar;170:108058. doi: 10.1016/j.compbiomed.2024.108058. Epub 2024 Jan 28.