Biometric Research Branch, National Cancer Institute, Bethesda, Maryland, USA.
BMC Bioinformatics. 2013 Dec 13;14:363. doi: 10.1186/1471-2105-14-363.
Recent developments in high-throughput genomic technologies make it possible to have a comprehensive view of genomic alterations in tumors on a whole genome scale. Only a small number of somatic alterations detected in tumor genomes are driver alterations which drive tumorigenesis. Most of the somatic alterations are passengers that are neutral to tumor cell selection. Although most research efforts are focused on analyzing driver alterations, the passenger alterations also provide valuable information about the history of tumor development.
In this paper, we develop a method for estimating the age of the tumor lineage and the timing of the driver alterations based on the number of passenger alterations. This method also identifies mutator genes which increase genomic instability when they are altered and provides estimates of the increased rate of alterations caused by each mutator gene. We applied this method to copy number data and DNA sequencing data for ovarian and lung tumors. We identified well known mutators such as TP53, PRKDC, BRCA1/2 as well as new mutator candidates PPP2R2A and the chromosomal region 22q13.33. We found that most mutator genes alter early during tumorigenesis and were able to estimate the age of individual tumor lineage in cell generations.
This is the first computational method to identify mutator genes and to take into account the increase of the alteration rate by mutator genes, providing more accurate estimates of the tumor age and the timing of driver alterations.
高通量基因组技术的最新进展使得在全基因组范围内全面观察肿瘤中的基因组改变成为可能。在肿瘤基因组中检测到的少数体细胞改变是驱动肿瘤发生的驱动改变。大多数体细胞改变是对肿瘤细胞选择呈中性的乘客改变。尽管大多数研究都集中在分析驱动改变上,但乘客改变也为肿瘤发展的历史提供了有价值的信息。
在本文中,我们开发了一种基于乘客改变数量来估计肿瘤谱系年龄和驱动改变时间的方法。该方法还识别了在改变时增加基因组不稳定性的突变基因,并提供了每个突变基因引起的改变率增加的估计。我们将该方法应用于卵巢和肺癌的拷贝数数据和 DNA 测序数据。我们确定了已知的突变基因,如 TP53、PRKDC、BRCA1/2,以及新的突变候选基因 PPP2R2A 和染色体区域 22q13.33。我们发现大多数突变基因在肿瘤发生的早期就发生了改变,并能够估计单个肿瘤谱系的细胞世代年龄。
这是第一个识别突变基因并考虑突变基因改变率增加的计算方法,它提供了更准确的肿瘤年龄和驱动改变时间的估计。