Sun Shulei, Thorson John A, Murray Sarah S
Center for Advanced Laboratory Medicine, University of California San Diego Health, La Jolla, CA, USA.
Department of Pathology, University of California San Diego, La Jolla, CA, USA.
Methods Mol Biol. 2019;1908:49-60. doi: 10.1007/978-1-4939-9004-7_4.
The use of next-generation sequencing technologies has enabled the analysis of a wide spectrum of somatic mutations in tumors. This analysis can be carried out using various strategies including the use of small panels of focused, clinically actionable genes, large panels of cancer-related genes, whole exomes, and the entire genome. One of the main goals in these analyses is to identify key mutations in these tumors that drive the oncogenic process. Depending on the gene, mutations can have altering effects, such as loss of function mutations in tumor suppressor genes, to mutations that activate genes such as kinases involved with cell cycle progression or proliferation. Once the sequencing process is complete, and the alignment of the large collection of reads to the reference genome and variant calling has been carried out, one is left with a large collection of variants. The challenge then becomes assigning where the variant resides in the genome with respect to coding regions, splice site regions, regulatory regions, and what potential functional effect these variants may have on the resulting protein. Other helpful information includes determining if the variant has been identified before, and if so, the tumor type associated with the variant. In addition, if the tumor profiling experiment is not conducted with a matched specimen representing the inherited genome, various tools are helpful to determine if the variant is likely to be an inherited polymorphism or a somatic event. In this chapter, we review the various tools available for annotating variants to assist in filtering down and prioritizing the hundreds to thousands of variants down to the key variants likely to be driver mutations and relevant to the tumor being profiled.
新一代测序技术的应用使得对肿瘤中广泛的体细胞突变进行分析成为可能。这种分析可以通过多种策略来进行,包括使用少量聚焦于临床可操作基因的小面板、大量癌症相关基因的大面板、全外显子组以及整个基因组。这些分析的主要目标之一是识别驱动致癌过程的肿瘤中的关键突变。根据基因的不同,突变可能会产生不同的影响,例如肿瘤抑制基因中的功能丧失突变,以及激活与细胞周期进程或增殖相关的激酶等基因的突变。一旦测序过程完成,并且将大量读数与参考基因组进行比对并进行变异检测后,就会得到大量的变异。接下来的挑战是确定变异在基因组中相对于编码区、剪接位点区域、调控区域的位置,以及这些变异可能对产生的蛋白质有何种潜在的功能影响。其他有用的信息包括确定该变异是否之前已被识别,如果是,与该变异相关的肿瘤类型。此外,如果肿瘤分析实验不是用代表遗传基因组的匹配样本进行的,各种工具将有助于确定该变异是可能是遗传多态性还是体细胞事件。在本章中,我们将回顾可用于注释变异的各种工具,以协助将数百至数千个变异筛选并排序,确定哪些是可能的驱动突变且与所分析的肿瘤相关的关键变异。