Seoul National University Biomedical Informatics (SNUBI), Div. of Biomedical Informatics, Seoul National University College of Medicine, Seoul 110-799, Republic of Korea.
J Biomed Inform. 2012 Dec;45(6):1191-8. doi: 10.1016/j.jbi.2012.07.008. Epub 2012 Aug 15.
There have been many attempts in cancer clinical-type classification by using a dataset from a number of molecular layers of biological system. Despite these efforts, however, it still remains difficult to elucidate the cancer phenotypes because the cancer genome is neither simple nor independent but rather complicated and dysregulated by multiple molecular mechanisms. Recently, heterogeneous types of data, generated from all molecular levels of 'omic' dimensions from genome to phenome, for instance, copy number variants at the genome level, DNA methylation at the epigenome level, and gene expression and microRNA at the transcriptome level, have become available. In this paper, we propose an integrated framework that uses multi-level genomic data for prediction of clinical outcomes in brain cancer (glioblastoma multiforme, GBM) and ovarian cancer (serous cystadenocarcinoma, OV). From empirical comparison results on individual genomic data, we provide some preliminary insights about which level of data is more informative to a given clinical-type classification problem and justify these perceptions with the corresponding biological implications for each type of cancer. For GBM, all clinical outcomes had a better the area under the curve (AUC) of receiver operating characteristic when integrating multi-layers of genomic data, 0.876 for survival to 0.832 for recurrence. Moreover, the better AUCs were achieved from the integration approach for all clinical outcomes in OV as well, ranging from 0.787 to 0.893. We found that the opportunity for success in prediction of clinical outcomes in cancer was increased when the prediction was based on the integration of multi-layers of genomic data. This study is expecting to improve comprehension of the molecular pathogenesis and underlying biology of both cancer types.
已经有许多尝试使用来自生物系统多个分子层面的数据集对癌症临床类型进行分类。然而,尽管做出了这些努力,要阐明癌症表型仍然很困难,因为癌症基因组既不简单也不独立,而是由多种分子机制复杂地失调。最近,从基因组到表型的“组学”维度的所有分子层面生成了异质类型的数据,例如基因组水平的拷贝数变异、表观基因组水平的 DNA 甲基化以及转录组水平的基因表达和 microRNA。在本文中,我们提出了一个集成框架,该框架使用多层次基因组数据来预测脑癌(多形性胶质母细胞瘤,GBM)和卵巢癌(浆液性囊腺癌,OV)的临床结局。通过对个体基因组数据的经验比较结果,我们提供了一些初步的见解,了解哪些层次的数据对给定的临床类型分类问题更具信息量,并为每种类型的癌症提供相应的生物学意义来证明这些看法。对于 GBM,当整合多层次基因组数据时,所有临床结局的生存和复发的曲线下面积(AUC)都更好,分别为 0.876 和 0.832。此外,在 OV 中,所有临床结局的 AUC 也都通过整合方法得到了改善,范围从 0.787 到 0.893。我们发现,当基于整合多层次基因组数据进行预测时,癌症临床结局预测的成功机会增加。这项研究有望提高对这两种癌症类型的分子发病机制和潜在生物学的理解。