Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom.
Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom.
Genome Res. 2018 Sep;28(9):1264-1271. doi: 10.1101/gr.231688.117. Epub 2018 Aug 13.
Somatic mutations show variation in density across cancer genomes. Previous studies have shown that chromatin organization and replication time domains are correlated with, and thus predictive of, this variation. Here, we analyze 1809 whole-genome sequences from 10 cancer types to show that a subset of repetitive DNA sequences, called non-B motifs that predict noncanonical secondary structure formation can independently account for variation in mutation density. Combined with epigenetic factors and replication timing, the variance explained can be improved to 43%-76%. Approximately twofold mutation enrichment is observed directly within non-B motifs, is focused on exposed structural components, and is dependent on physical properties that are optimal for secondary structure formation. Therefore, there is mounting evidence that secondary structures arising from non-B motifs are not simply associated with increased mutation density-they are possibly causally implicated. Our results suggest that they are determinants of mutagenesis and increase the likelihood of recurrent mutations in the genome. This analysis calls for caution in the interpretation of recurrent mutations and highlights the importance of taking non-B motifs that can simply be inferred from the reference sequence into consideration in background models of mutability henceforth.
体细胞突变在癌症基因组中显示出密度变化。先前的研究表明,染色质组织和复制时间域与这种变化相关,因此可以预测这种变化。在这里,我们分析了来自 10 种癌症类型的 1809 个全基因组序列,表明一组称为非 B 基序的重复 DNA 序列可以独立解释突变密度的变化,这些基序可以预测非典型的二级结构形成。将非 B 基序与表观遗传因子和复制时间结合起来,可以将解释的方差提高到 43%-76%。在非 B 基序内直接观察到大约两倍的突变富集,集中在暴露的结构成分上,并且依赖于最适合二级结构形成的物理性质。因此,越来越多的证据表明,非 B 基序产生的二级结构不仅仅与增加的突变密度有关,它们可能与突变有关。我们的研究结果表明,它们是诱变的决定因素,并增加了基因组中反复突变的可能性。这种分析呼吁在解释反复突变时要谨慎,并强调在今后的变易性背景模型中考虑从参考序列中简单推断出的非 B 基序的重要性。