Department of Medicine and Surgery, University of Milano-Bicocca, Milano, Italy.
Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milano, Italy.
Comput Biol Med. 2023 Aug;162:107064. doi: 10.1016/j.compbiomed.2023.107064. Epub 2023 May 28.
Cancer patients show heterogeneous phenotypes and very different outcomes and responses even to common treatments, such as standard chemotherapy. This state-of-affairs has motivated the need for the comprehensive characterization of cancer phenotypes and fueled the generation of large omics datasets, comprising multiple omics data reported for the same patients, which might now allow us to start deciphering cancer heterogeneity and implement personalized therapeutic strategies. In this work, we performed the analysis of four cancer types obtained from the latest efforts by The Cancer Genome Atlas, for which seven distinct omics data were available for each patient, in addition to curated clinical outcomes. We performed a uniform pipeline for raw data preprocessing and adopted the Cancer Integration via MultIkernel LeaRning (CIMLR) integrative clustering method to extract cancer subtypes. We then systematically review the discovered clusters for the considered cancer types, highlighting novel associations between the different omics and prognosis.
癌症患者表现出异质性的表型,即使接受常见的治疗,如标准化疗,其结果和反应也有很大的差异。这种情况促使我们需要全面描述癌症表型,并生成了大量的组学数据集,这些数据集包含了为同一患者报告的多种组学数据,现在可能使我们开始破解癌症异质性并实施个性化的治疗策略。在这项工作中,我们分析了来自癌症基因组图谱的最新研究中四种癌症类型的数据,每个患者除了经过精心整理的临床结果外,还有七种不同的组学数据。我们对原始数据进行了统一的预处理,并采用 Cancer Integration via MultIkernel LeaRning (CIMLR) 集成聚类方法提取癌症亚型。然后,我们系统地回顾了所考虑的癌症类型中发现的聚类,强调了不同组学与预后之间的新关联。