1] National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China [2] MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST/Department of Automation, Tsinghua University, Beijing 100084, China.
MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST/Department of Automation, Tsinghua University, Beijing 100084, China.
Sci Rep. 2013 Dec 18;3:3538. doi: 10.1038/srep03538.
Cancer is a genomic disease associated with a plethora of gene mutations resulting in a loss of control over vital cellular functions. Among these mutated genes, driver genes are defined as being causally linked to oncogenesis, while passenger genes are thought to be irrelevant for cancer development. With increasing numbers of large-scale genomic datasets available, integrating these genomic data to identify driver genes from aberration regions of cancer genomes becomes an important goal of cancer genome analysis and investigations into mechanisms responsible for cancer development. A computational method, MAXDRIVER, is proposed here to identify potential driver genes on the basis of copy number aberration (CNA) regions of cancer genomes, by integrating publicly available human genomic data. MAXDRIVER employs several optimization strategies to construct a heterogeneous network, by means of combining a fused gene functional similarity network, gene-disease associations and a disease phenotypic similarity network. MAXDRIVER was validated to effectively recall known associations among genes and cancers. Previously identified as well as novel driver genes were detected by scanning CNAs of breast cancer, melanoma and liver carcinoma. Three predicted driver genes (CDKN2A, AKT1, RNF139) were found common in these three cancers by comparative analysis.
癌症是一种与大量基因突变相关的基因组疾病,这些基因突变导致对重要细胞功能的失控。在这些突变基因中,驱动基因被定义为与肿瘤发生有因果关系,而乘客基因则被认为与癌症的发展无关。随着越来越多的大规模基因组数据集的出现,将这些基因组数据整合起来,从癌症基因组的异常区域中识别驱动基因,成为癌症基因组分析和研究癌症发展机制的重要目标。本文提出了一种计算方法 MAXDRIVER,该方法基于癌症基因组的拷贝数异常(CNA)区域,整合公共可用的人类基因组数据,来识别潜在的驱动基因。MAXDRIVER 采用了几种优化策略来构建一个异构网络,方法是结合融合基因功能相似性网络、基因-疾病关联和疾病表型相似性网络。通过扫描乳腺癌、黑色素瘤和肝癌的 CNA,验证了 MAXDRIVER 有效地召回了基因和癌症之间的已知关联。通过扫描 CNA,检测到了先前确定的以及新的驱动基因。通过比较分析,在这三种癌症中发现了三个预测的驱动基因(CDKN2A、AKT1、RNF139)是共同的。