Sakoparnig Thomas, Fried Patrick, Beerenwinkel Niko
Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland; SIB Swiss Institute of Bioinformatics, Basel, Switzerland.
PLoS Comput Biol. 2015 Jan 8;11(1):e1004027. doi: 10.1371/journal.pcbi.1004027. eCollection 2015 Jan.
Cancer drivers are genomic alterations that provide cells containing them with a selective advantage over their local competitors, whereas neutral passengers do not change the somatic fitness of cells. Cancer-driving mutations are usually discriminated from passenger mutations by their higher degree of recurrence in tumor samples. However, there is increasing evidence that many additional driver mutations may exist that occur at very low frequencies among tumors. This observation has prompted alternative methods for driver detection, including finding groups of mutually exclusive mutations and incorporating prior biological knowledge about gene function or network structure. Dependencies among drivers due to epistatic interactions can also result in low mutation frequencies, but this effect has been ignored in driver detection so far. Here, we present a new computational approach for identifying genomic alterations that occur at low frequencies because they depend on other events. Unlike passengers, these constrained mutations display punctuated patterns of occurrence in time. We test this driver-passenger discrimination approach based on mutation timing in extensive simulation studies, and we apply it to cross-sectional copy number alteration (CNA) data from ovarian cancer, CNA and single-nucleotide variant (SNV) data from breast tumors and SNV data from colorectal cancer. Among the top ranked predicted drivers, we find low-frequency genes that have already been shown to be involved in carcinogenesis, as well as many new candidate drivers. The mutation timing approach is orthogonal and complementary to existing driver prediction methods. It will help identifying from cancer genome data the alterations that drive tumor progression.
癌症驱动因素是基因组改变,它使含有这些改变的细胞相对于其局部竞争细胞具有选择优势,而中性乘客突变则不会改变细胞的体细胞适应性。癌症驱动突变通常通过其在肿瘤样本中较高的复发程度与乘客突变区分开来。然而,越来越多的证据表明,可能存在许多其他在肿瘤中以极低频率发生的驱动突变。这一观察结果促使人们采用替代方法来检测驱动因素,包括寻找相互排斥的突变组以及纳入有关基因功能或网络结构的先验生物学知识。由于上位性相互作用导致的驱动因素之间的依赖性也可能导致低突变频率,但到目前为止,这种效应在驱动因素检测中一直被忽视。在这里,我们提出了一种新的计算方法,用于识别因依赖其他事件而以低频率发生的基因组改变。与乘客突变不同,这些受约束的突变在时间上呈现出间断的发生模式。我们在广泛的模拟研究中基于突变时间测试了这种驱动因素与乘客突变的区分方法,并将其应用于卵巢癌的横断面拷贝数改变(CNA)数据、乳腺癌的CNA和单核苷酸变异(SNV)数据以及结直肠癌的SNV数据。在排名靠前的预测驱动因素中,我们发现了已经被证明与致癌作用有关的低频基因,以及许多新的候选驱动因素。突变时间方法与现有的驱动因素预测方法是正交且互补的。它将有助于从癌症基因组数据中识别出驱动肿瘤进展的改变。