Vaksman Zalman, Fonville Natalie C, Tae Hongseok, Garner Harold R
Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia, 24061, United States of America.
Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, Virginia, 24061, United States of America; Genomeon LLC, Floyd, Virginia, 24091, United States of America.
PLoS One. 2014 Nov 17;9(11):e110263. doi: 10.1371/journal.pone.0110263. eCollection 2014.
Microsatellites (MST), tandem repeats of 1-6 nucleotide motifs, are mutational hot-spots with a bias for insertions and deletions (INDELs) rather than single nucleotide polymorphisms (SNPs). The majority of MST instability studies are limited to a small number of loci, the Bethesda markers, which are only informative for a subset of colorectal cancers. In this paper we evaluate non-haplotype alleles present within next-gen sequencing data to evaluate somatic MST variation (SMV) within DNA repair proficient and DNA repair defective cell lines. We confirm that alleles present within next-gen data that do not contribute to the haplotype can be reliably quantified and utilized to evaluate the SMV without requiring comparisons of matched samples. We observed that SMV patterns found in DNA repair proficient cell lines without DNA repair defects, MCF10A, HEK293 and PD20 RV:D2, had consistent patterns among samples. Further, we were able to confirm that changes in SMV patterns in cell lines lacking functional BRCA2, FANCD2 and mismatch repair were consistent with the different pathways perturbed. Using this new exome sequencing analysis approach we show that DNA instability can be identified in a sample and that patterns of instability vary depending on the impaired DNA repair mechanism, and that genes harboring minor alleles are strongly associated with cancer pathways. The MST Minor Allele Caller used for this study is available at https://github.com/zalmanv/MST_minor_allele_caller.
微卫星(MST)是1 - 6个核苷酸基序的串联重复序列,是突变热点,倾向于发生插入和缺失(INDEL)而非单核苷酸多态性(SNP)。大多数MST不稳定性研究仅限于少数几个位点,即贝塞斯达标记,这些标记仅对一部分结直肠癌具有信息价值。在本文中,我们评估了二代测序数据中存在的非单倍型等位基因,以评估DNA修复功能正常和DNA修复缺陷细胞系中的体细胞MST变异(SMV)。我们证实,二代数据中不构成单倍型的等位基因可以可靠地定量,并用于评估SMV,而无需对匹配样本进行比较。我们观察到,在没有DNA修复缺陷的DNA修复功能正常的细胞系MCF10A、HEK293和PD20 RV:D2中发现的SMV模式在样本之间具有一致性。此外,我们能够证实,缺乏功能性BRCA2、FANCD2和错配修复的细胞系中SMV模式的变化与不同的受干扰途径一致。使用这种新的外显子组测序分析方法,我们表明可以在样本中识别DNA不稳定性,并且不稳定性模式因受损的DNA修复机制而异,并且携带次要等位基因的基因与癌症途径密切相关。本研究使用的MST次要等位基因调用程序可在https://github.com/zalmanv/MST_minor_allele_caller获取。