Department of Computer Engineering, Antalya Bilim University, Antalya, Turkey.
Electrical and Computer Engineering Graduate Program, Antalya Bilim University, Antalya, Turkey.
BMC Bioinformatics. 2021 Feb 10;22(1):62. doi: 10.1186/s12859-021-03989-w.
Recent cancer genomic studies have generated detailed molecular data on a large number of cancer patients. A key remaining problem in cancer genomics is the identification of driver genes.
We propose BetweenNet, a computational approach that integrates genomic data with a protein-protein interaction network to identify cancer driver genes. BetweenNet utilizes a measure based on betweenness centrality on patient specific networks to identify the so-called outlier genes that correspond to dysregulated genes for each patient. Setting up the relationship between the mutated genes and the outliers through a bipartite graph, it employs a random-walk process on the graph, which provides the final prioritization of the mutated genes. We compare BetweenNet against state-of-the art cancer gene prioritization methods on lung, breast, and pan-cancer datasets.
Our evaluations show that BetweenNet is better at recovering known cancer genes based on multiple reference databases. Additionally, we show that the GO terms and the reference pathways enriched in BetweenNet ranked genes and those that are enriched in known cancer genes overlap significantly when compared to the overlaps achieved by the rankings of the alternative methods.
最近的癌症基因组研究已经为大量癌症患者生成了详细的分子数据。癌症基因组学中一个关键的遗留问题是确定驱动基因。
我们提出了 BetweenNet,这是一种计算方法,它将基因组数据与蛋白质-蛋白质相互作用网络集成在一起,以识别癌症驱动基因。BetweenNet 利用基于患者特定网络上的中间中心性的度量来识别所谓的异常基因,这些基因对应于每个患者失调的基因。通过二分图在突变基因和异常基因之间建立关系,它在图上使用随机游走过程,为突变基因提供最终的优先级排序。我们将 BetweenNet 与肺癌、乳腺癌和泛癌数据集上的最先进的癌症基因优先级排序方法进行了比较。
我们的评估表明,基于多个参考数据库,BetweenNet 更善于恢复已知的癌症基因。此外,我们还表明,与其他方法的排名所实现的重叠相比,在 BetweenNet 排名的基因中富集的 GO 术语和参考途径以及在已知癌症基因中富集的基因重叠显著。