Chen Yuxin, Wen Yuqi, Xie Chenyang, Chen Xinjian, He Song, Bo Xiaochen, Zhang Zhongnan
School of Informatics, Xiamen University, Xiamen 361005, China.
Department of Bioinformatics, Institute of Health Service and Transfusion Medicine, Beijing 100850, China.
iScience. 2023 Jul 13;26(8):107378. doi: 10.1016/j.isci.2023.107378. eCollection 2023 Aug 18.
Cancer is an extremely complex disease and each type of cancer usually has several different subtypes. Multi-omics data can provide more comprehensive biological information for identifying and discovering cancer subtypes. However, existing unsupervised cancer subtyping methods cannot effectively learn comprehensive shared and specific information of multi-omics data. Therefore, a novel method is proposed based on shared and specific representation learning. For each omics data, two autoencoders are applied to extract shared and specific information, respectively. To reduce redundancy and mutual interference, orthogonality constraint is introduced to separate shared and specific information. In addition, contrastive learning is applied to align the shared information and strengthen their consistency. Finally, the obtained shared and specific information for all samples are used for clustering tasks to achieve cancer subtyping. Experimental results demonstrate that the proposed method can effectively capture shared and specific information of multi-omics data and outperform other state-of-the-art methods on cancer subtyping.
癌症是一种极其复杂的疾病,每种癌症通常都有几种不同的亚型。多组学数据可以为识别和发现癌症亚型提供更全面的生物学信息。然而,现有的无监督癌症亚型分类方法无法有效地学习多组学数据的全面共享和特定信息。因此,提出了一种基于共享和特定表示学习的新方法。对于每组组学数据,分别应用两个自动编码器来提取共享和特定信息。为了减少冗余和相互干扰,引入正交性约束来分离共享和特定信息。此外,应用对比学习来对齐共享信息并增强其一致性。最后,将所有样本获得的共享和特定信息用于聚类任务以实现癌症亚型分类。实验结果表明,所提出的方法可以有效地捕获多组学数据的共享和特定信息,并且在癌症亚型分类方面优于其他现有最先进的方法。