Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, MB R3E 0J9, Canada.
Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA.
Am J Hum Genet. 2018 Apr 5;102(4):658-675. doi: 10.1016/j.ajhg.2018.02.013. Epub 2018 Mar 15.
Genomic regions of autozygosity (ROAs) represent segments of individual genomes that are homozygous for haplotypes inherited identical-by-descent (IBD) from a common ancestor. ROAs are nonuniformly distributed across the genome, and increased ROA levels are a reported risk factor for numerous complex diseases. Previously, we hypothesized that long ROAs are enriched for deleterious homozygotes as a result of young haplotypes with recent deleterious mutations-relatively untouched by purifying selection-being paired IBD as a consequence of recent parental relatedness, a pattern supported by ROA and whole-exome sequence data on 27 individuals. Here, we significantly bolster support for our hypothesis and expand upon our original analyses using ROA and whole-genome sequence data on 2,436 individuals from The 1000 Genomes Project. Considering CADD deleteriousness scores, we reaffirm our previous observation that long ROAs are enriched for damaging homozygotes worldwide. We show that strongly damaging homozygotes experience greater enrichment than weaker damaging homozygotes, while overall enrichment varies appreciably among populations. Mendelian disease genes and those encoding FDA-approved drug targets have significantly increased rates of gain in damaging homozygotes with increasing ROA coverage relative to all other genes. In genes implicated in eight complex phenotypes for which ROA levels have been identified as a risk factor, rates of gain in damaging homozygotes vary across phenotypes and populations but frequently differ significantly from non-disease genes. These findings highlight the potential confounding effects of population background in the assessment of associations between ROA levels and complex disease risk, which might underlie reported inconsistencies in ROA-phenotype associations.
个体基因组的同源区域(ROA)代表个体基因组的某些片段,这些片段的单倍型来自共同祖先的完全一致遗传(IBD)。ROA 在基因组中分布不均匀,并且增加的 ROA 水平是许多复杂疾病的报告风险因素。以前,我们假设长 ROA 由于最近具有有害突变的年轻单倍型而富集有害纯合子,这些突变相对不受最近的选择压力影响,由于最近的父母亲缘关系而配对 IBD,这种模式得到了 27 个人的 ROA 和全外显子组序列数据的支持。在这里,我们使用来自 1000 基因组计划的 2436 个人的 ROA 和全基因组序列数据,极大地支持了我们的假设,并扩展了我们的原始分析。考虑到 CADD 有害性评分,我们重申了我们之前的观察结果,即在全球范围内,长 ROA 富集有害纯合子。我们表明,强有害纯合子比弱有害纯合子经历更大的富集,而整体富集在不同人群中差异很大。孟德尔疾病基因和编码 FDA 批准的药物靶点的基因,在 ROA 覆盖率增加时,有害纯合子的增益率显著高于所有其他基因。在 8 个复杂表型中,ROA 水平被确定为风险因素的基因中,有害纯合子的增益率在表型和人群中存在差异,但通常与非疾病基因有显著差异。这些发现强调了在评估 ROA 水平与复杂疾病风险之间的关联时,人群背景的潜在混杂效应,这可能是报告的 ROA-表型关联不一致的原因。