Gao Bo, Li Guojun, Liu Juntao, Li Yang, Huang Xiuzhen
School of Mathematics, Shandong University, Jinan, Shandong, 250100, China.
Department of Computer Science, Arkansas State University, Jonesboro, Arkansas, 72401, USA.
Oncotarget. 2017 May 30;8(22):36115-36126. doi: 10.18632/oncotarget.16433.
It is widely accepted that cancer is driven by accumulated somatic mutations during the lifetime of an individual. Cancer mutations may target relatively small number of cell functional modules. The heterogeneity in different cancer patients makes it difficult to identify driver mutations or functional modules related to cancer. It is biologically desired to be capable of identifying cancer pathway modules through coordination between coverage and exclusivity. There have been a few approaches developed for this purpose, but they all have limitations in practice due to their computational complexity and prediction accuracy. We present a network based approach, CovEx, to predict the specific patient oriented modules by 1) discovering candidate modules for each considered gene, 2) extracting significant candidates by harmonizing coverage and exclusivity and, 3) further selecting the patient oriented modules based on a set cover model. Applying CovEx to pan-cancer datasets spanning 12 cancer types collecting from public database TCGA, it demonstrates significant superiority over the current leading competitors in performance. It is published under GNU GENERAL PUBLIC LICENSE and the source code is available at: https://sourceforge.net/projects/cancer-pathway/files/.
人们普遍认为,癌症是由个体一生中积累的体细胞突变驱动的。癌症突变可能针对相对少数的细胞功能模块。不同癌症患者的异质性使得难以识别与癌症相关的驱动突变或功能模块。从生物学角度来看,希望能够通过覆盖度和排他性之间的协调来识别癌症通路模块。为此已经开发了一些方法,但由于它们的计算复杂性和预测准确性,在实践中都存在局限性。我们提出了一种基于网络的方法CovEx,通过以下步骤预测特定患者导向的模块:1)为每个考虑的基因发现候选模块;2)通过协调覆盖度和排他性提取显著候选模块;3)基于集合覆盖模型进一步选择患者导向的模块。将CovEx应用于从公共数据库TCGA收集的涵盖12种癌症类型的泛癌数据集,结果表明它在性能上明显优于当前领先的竞争对手。它是根据GNU通用公共许可证发布的,源代码可在以下网址获取:https://sourceforge.net/projects/cancer-pathway/files/ 。