Department of Biological Sciences, Columbia University, New York, United States.
Department of Statistics, University of Oxford, Oxford, United Kingdom.
Elife. 2023 Jan 17;12:e83172. doi: 10.7554/eLife.83172.
Causal loss-of-function (LOF) variants for Mendelian and severe complex diseases are enriched in 'mutation intolerant' genes. We show how such observations can be interpreted in light of a model of mutation-selection balance and use the model to relate the pathogenic consequences of LOF mutations at present to their evolutionary fitness effects. To this end, we first infer posterior distributions for the fitness costs of LOF mutations in 17,318 autosomal and 679 X-linked genes from exome sequences in 56,855 individuals. Estimated fitness costs for the loss of a gene copy are typically above 1%; they tend to be largest for X-linked genes, whether or not they have a Y homolog, followed by autosomal genes and genes in the pseudoautosomal region. We compare inferred fitness effects for all possible de novo LOF mutations to those of de novo mutations identified in individuals diagnosed with one of six severe, complex diseases or developmental disorders. Probands carry an excess of mutations with estimated fitness effects above 10%; as we show by simulation, when sampled in the population, such highly deleterious mutations are typically only a couple of generations old. Moreover, the proportion of highly deleterious mutations carried by probands reflects the typical age of onset of the disease. The study design also has a discernible influence: a greater proportion of highly deleterious mutations is detected in pedigree than case-control studies, and for autism, in simplex than multiplex families and in female versus male probands. Thus, anchoring observations in human genetics to a population genetic model allows us to learn about the fitness effects of mutations identified by different mapping strategies and for different traits.
孟德尔遗传疾病和严重复杂疾病的因果丧失功能(LOF)变异体在“突变不耐受”基因中富集。我们展示了如何根据突变-选择平衡模型来解释这些观察结果,并使用该模型将目前 LOF 突变的致病后果与其进化适应度效应联系起来。为此,我们首先从 56855 个人的外显子组序列中推断出 17318 个常染色体和 679 个 X 连锁基因中 LOF 突变的适应度成本的后验分布。失去一个基因拷贝的适应度成本通常高于 1%;对于无论是否有 Y 同源物的 X 连锁基因,它们的适应度成本往往最大,其次是常染色体基因和假常染色体区域的基因。我们将所有可能的新生 LOF 突变的推断适应度效应与在被诊断患有六种严重复杂疾病或发育障碍之一的个体中发现的新生突变进行比较。先证者携带的具有估计适应度效应超过 10%的突变过多;正如我们通过模拟所展示的,当在人群中抽样时,这种高度有害的突变通常只有几代人的历史。此外,先证者携带的高度有害突变的比例反映了疾病的典型发病年龄。研究设计也有明显的影响:在系谱而非病例对照研究中,以及在孤独症中,在单倍体而非多倍体家族中,以及在女性先证者中,检测到的高度有害突变的比例更大。因此,将人类遗传学中的观察结果与群体遗传学模型联系起来,可以让我们了解不同映射策略和不同特征所识别的突变的适应度效应。