School of Artificial Intelligence and Computer Science, Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence, Jiangnan University, Wuxi 214122, China.
School of Intelligent Manufacturing, Nanjing University of Science and Technology, Nanjing 210094, China.
Math Biosci Eng. 2023 Nov 27;20(12):21098-21119. doi: 10.3934/mbe.2023933.
Cancer subtyping (or cancer subtypes identification) based on multi-omics data has played an important role in advancing diagnosis, prognosis and treatment, which triggers the development of advanced multi-view clustering algorithms. However, the high-dimension and heterogeneity of multi-omics data make great effects on the performance of these methods. In this paper, we propose to learn the informative latent representation based on autoencoder (AE) to naturally capture nonlinear omic features in lower dimensions, which is helpful for identifying the similarity of patients. Moreover, to take advantage of survival information or clinical information, a multi-omic survival analysis approach is embedded when integrating the similarity graph of heterogeneous data at the multi-omics level. Then, the clustering method is performed on the integrated similarity to generate subtype groups. In the experimental part, the effectiveness of the proposed framework is confirmed by evaluating five different multi-omics datasets, taken from The Cancer Genome Atlas. The results show that AE-assisted multi-omics clustering method can identify clinically significant cancer subtypes.
基于多组学数据的癌症亚型分类(或癌症亚型鉴定)在推进诊断、预后和治疗方面发挥了重要作用,这引发了先进的多视图聚类算法的发展。然而,多组学数据的高维性和异质性对这些方法的性能有很大的影响。在本文中,我们提出基于自动编码器(AE)学习信息潜在表示,以在较低维度自然地捕获非线性组学特征,这有助于识别患者的相似性。此外,为了利用生存信息或临床信息,在多组学水平上整合异质数据的相似性图时,嵌入了多组学生存分析方法。然后,在整合的相似性上执行聚类方法,以生成亚型组。在实验部分,通过评估来自癌症基因组图谱的五个不同的多组学数据集,验证了所提出框架的有效性。结果表明,AE 辅助的多组学聚类方法可以识别具有临床意义的癌症亚型。