Stead Lucy F, Thygesen Helene, Westhead David R, Rabbitts Pamela
Leeds Institute of Cancer and Pathology, University of Leeds, St James's University Hospital, Leeds, United Kingdom.
Int J Cancer. 2015 Jan 1;136(1):241-5. doi: 10.1002/ijc.28951. Epub 2014 May 14.
The catalogue of tumour-specific somatic mutations (SMs) is growing rapidly owing to the advent of next-generation sequencing. Identifying those mutations responsible for the development and progression of the disease, so-called driver mutations, will increase our understanding of carcinogenesis and provide candidates for targeted therapeutics. The phenotypic consequence(s) of driver mutations cause them to be selected for within the tumour environment, such that many approaches aimed at distinguishing drivers are based on finding significantly somatically mutated genes. Currently, these methods are designed to analyse, or be specifically applied to, nonsynonymous mutations: those that alter an encoded protein. However, growing evidence suggests the involvement of noncoding transcripts in carcinogenesis, mutations in which may also be disease-driving. We wished to test the hypothesis that common DNA variation rates within humans can be used as a baseline from which to score the rate of SMs, irrespective of coding capacity. We preliminarily tested this by applying it to a dataset of 159,498 SMs and using the results to rank genes. This resulted in significant enrichment of known cancer genes, indicating that the approach has merit. As additional data from cancer sequencing studies are made publicly available, this approach can be refined and applied to specific cancer subtypes. We named this preliminary version of our approach PRISMAD (polymorphism rates indicate somatic mutations as drivers) and have made it publicly accessible, with scripts, via a link at www.precancer.leeds.ac.uk/software-and-datasets.
由于下一代测序技术的出现,肿瘤特异性体细胞突变(SMs)的目录正在迅速增加。识别那些导致疾病发生和发展的突变,即所谓的驱动突变,将增进我们对癌症发生机制的理解,并为靶向治疗提供候选靶点。驱动突变的表型后果促使它们在肿瘤环境中被选择出来,因此许多旨在区分驱动突变的方法都是基于寻找显著的体细胞突变基因。目前,这些方法旨在分析非同义突变,即那些改变编码蛋白质的突变,或者专门应用于此类突变。然而,越来越多的证据表明非编码转录本参与了癌症发生,其中的突变也可能是疾病驱动因素。我们希望检验这样一个假设:人类常见的DNA变异率可以作为一个基线,用以对体细胞突变率进行评分,而不论其编码能力如何。我们通过将其应用于一个包含159498个体细胞突变的数据集并利用结果对基因进行排名,初步测试了这一假设。这导致已知癌症基因的显著富集,表明该方法具有价值。随着癌症测序研究的更多数据公开可用,这种方法可以得到改进并应用于特定的癌症亚型。我们将我们方法的这个初步版本命名为PRISMAD(多态性率表明体细胞突变是驱动因素),并通过www.precancer.leeds.ac.uk/software-and-datasets上的链接,以脚本的形式使其公开可用。