Baur Brittany, Bozdag Serdar
Department of Mathematics, Statistics and Computer Science, Marquette University, Milwaukee, WI, USA.
Department of Mathematics, Statistics and Computer Science, Marquette University, Milwaukee, WI, USA.
Genomics. 2017 Jul;109(3-4):233-240. doi: 10.1016/j.ygeno.2017.04.004. Epub 2017 Apr 21.
Copy number amplifications and deletions that are recurrent in cancer samples harbor genes that confer a fitness advantage to cancer tumor proliferation and survival. One important challenge in computational biology is to separate the causal (i.e., driver) genes from passenger genes in large, aberrated regions. Many previous studies focus on the genes within the aberration (i.e., cis genes), but do not utilize the genes that are outside of the aberrated region and dysregulated as a result of the aberration (i.e., trans genes). We propose a computational pipeline, called ProcessDriver, that prioritizes candidate drivers by relating cis genes to dysregulated trans genes and biological processes. ProcessDriver is based on the assumption that a driver cis gene should be closely associated with the dysregulated trans genes and biological processes, as opposed to previous studies that assume a driver cis gene should be the most correlated gene to the copy number of an aberrated region. We applied our method on breast, bladder and ovarian cancer data from the Cancer Genome Atlas database. Our results included previously known driver genes and cancer genes, as well as potentially novel driver genes. Additionally, many genes in the final set of drivers were linked to new tumor events after initial treatment using survival analysis. Our results highlight the importance of selecting driver genes based on their widespread downstream effects in trans.
在癌症样本中反复出现的拷贝数扩增和缺失携带着赋予癌症肿瘤增殖和存活适应性优势的基因。计算生物学中的一个重要挑战是在大的异常区域中将因果(即驱动)基因与乘客基因区分开来。许多先前的研究聚焦于异常区域内的基因(即顺式基因),但未利用异常区域外且因异常而失调的基因(即反式基因)。我们提出了一种名为ProcessDriver的计算流程,通过将顺式基因与失调的反式基因及生物学过程相关联来对候选驱动基因进行优先级排序。ProcessDriver基于这样一种假设,即驱动顺式基因应与失调的反式基因及生物学过程密切相关,这与先前假设驱动顺式基因应是与异常区域拷贝数最相关基因的研究不同。我们将我们的方法应用于来自癌症基因组图谱数据库的乳腺癌、膀胱癌和卵巢癌数据。我们的结果包括先前已知的驱动基因和癌症基因,以及潜在的新驱动基因。此外,使用生存分析对最终的驱动基因集进行初步处理后,许多基因与新的肿瘤事件相关联。我们的结果凸显了基于其广泛的反式下游效应选择驱动基因的重要性。