Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, United Kingdom.
Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, OX3 9DU, United Kingdom.
Genome Res. 2019 Jul;29(7):1047-1056. doi: 10.1101/gr.239756.118. Epub 2019 Jun 21.
Approximately 2% of de novo single-nucleotide variants (SNVs) appear as part of clustered mutations that create multinucleotide variants (MNVs). MNVs are an important source of genomic variability as they are more likely to alter an encoded protein than a SNV, which has important implications in disease as well as evolution. Previous studies of MNVs have focused on their mutational origins and have not systematically evaluated their functional impact and contribution to disease. We identified 69,940 MNVs and 91 de novo MNVs in 6688 exome-sequenced parent-offspring trios from the Deciphering Developmental Disorders Study comprising families with severe developmental disorders. We replicated the previously described MNV mutational signatures associated with DNA polymerase zeta, an error-prone translesion polymerase, and the APOBEC family of DNA deaminases. We estimate the simultaneous MNV germline mutation rate to be 1.78 × 10 mutations per base pair per generation. We found that most MNVs within a single codon create a missense change that could not have been created by a SNV. MNV-induced missense changes were, on average, more physicochemically divergent, were more depleted in highly constrained genes (pLI ≥ 0.9), and were under stronger purifying selection compared with SNV-induced missense changes. We found that de novo MNVs were significantly enriched in genes previously associated with developmental disorders in affected children. This shows that MNVs can be more damaging than SNVs even when both induce missense changes, and are an important variant type to consider in relation to human disease.
大约 2%的从头单核苷酸变异(SNV)会作为簇状突变的一部分出现,从而产生多核苷酸变异(MNV)。MNV 是基因组变异性的重要来源,因为它们比 SNV 更有可能改变编码蛋白,而 SNV 对疾病和进化都有重要影响。先前对 MNV 的研究主要集中在它们的突变起源上,而没有系统地评估它们的功能影响及其对疾病的贡献。我们在来自“破解发育障碍研究”的 6688 个外显子组测序的亲子三代 6688 个个体中鉴定了 69940 个 MNV 和 91 个新出现的 MNV,这些个体来自患有严重发育障碍的家庭。我们复制了先前描述的与 DNA 聚合酶 zeta(一种易错跨损伤聚合酶)和 APOBEC 家族的 DNA 脱氨酶相关的 MNV 突变特征。我们估计同时发生的 MNV 种系突变率为每个碱基对每代 1.78×10 个突变。我们发现,单个密码子内的大多数 MNV 会产生一种错义变化,而这种变化不可能由 SNV 产生。MNV 诱导的错义变化平均在物理化学上更为不同,在高度约束的基因(pLI≥0.9)中更为匮乏,并且与 SNV 诱导的错义变化相比受到更强的纯化选择。我们发现,新出现的 MNV 在受影响儿童中与发育障碍相关的基因中明显富集。这表明,即使 MNV 和 SNV 都诱导错义变化,MNV 也可能比 SNV 更具破坏性,并且是与人类疾病相关的重要变异类型。