Zhao Jing, Zhao Bowen, Song Xiaotong, Lyu Chujun, Chen Weizhi, Xiong Yi, Wei Dong-Qing
State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, 200240, China.
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad025.
Due to the high heterogeneity and complexity of cancers, patients with different cancer subtypes often have distinct groups of genomic and clinical characteristics. Therefore, the discovery and identification of cancer subtypes are crucial to cancer diagnosis, prognosis and treatment. Recent technological advances have accelerated the increasing availability of multi-omics data for cancer subtyping. To take advantage of the complementary information from multi-omics data, it is necessary to develop computational models that can represent and integrate different layers of data into a single framework. Here, we propose a decoupled contrastive clustering method (Subtype-DCC) based on multi-omics data integration for clustering to identify cancer subtypes. The idea of contrastive learning is introduced into deep clustering based on deep neural networks to learn clustering-friendly representations. Experimental results demonstrate the superior performance of the proposed Subtype-DCC model in identifying cancer subtypes over the currently available state-of-the-art clustering methods. The strength of Subtype-DCC is also supported by the survival and clinical analysis.
由于癌症具有高度的异质性和复杂性,不同癌症亚型的患者通常具有不同的基因组和临床特征组。因此,癌症亚型的发现和鉴定对于癌症的诊断、预后和治疗至关重要。最近的技术进步加速了用于癌症亚型分类的多组学数据的日益可用性。为了利用多组学数据中的互补信息,有必要开发能够将不同数据层表示并整合到单个框架中的计算模型。在此,我们提出了一种基于多组学数据整合的解耦对比聚类方法(Subtype-DCC)用于聚类以识别癌症亚型。将对比学习的思想引入基于深度神经网络的深度聚类中,以学习对聚类友好的表示。实验结果表明,与目前可用的最先进聚类方法相比,所提出的Subtype-DCC模型在识别癌症亚型方面具有卓越的性能。生存和临床分析也支持了Subtype-DCC的优势。