Medical and Population Genetics, Broad Institute, Cambridge, MA, United States of America.
Department of Biomedical Data Science, Stanford University, Stanford, CA, United States of America.
PLoS Genet. 2018 May 24;14(5):e1007329. doi: 10.1371/journal.pgen.1007329. eCollection 2018 May.
As part of a broader collaborative network of exome sequencing studies, we developed a jointly called data set of 5,685 Ashkenazi Jewish exomes. We make publicly available a resource of site and allele frequencies, which should serve as a reference for medical genetics in the Ashkenazim (hosted in part at https://ibd.broadinstitute.org, also available in gnomAD at http://gnomad.broadinstitute.org). We estimate that 34% of protein-coding alleles present in the Ashkenazi Jewish population at frequencies greater than 0.2% are significantly more frequent (mean 15-fold) than their maximum frequency observed in other reference populations. Arising via a well-described founder effect approximately 30 generations ago, this catalog of enriched alleles can contribute to differences in genetic risk and overall prevalence of diseases between populations. As validation we document 148 AJ enriched protein-altering alleles that overlap with "pathogenic" ClinVar alleles (table available at https://github.com/macarthur-lab/clinvar/blob/master/output/clinvar.tsv), including those that account for 10-100 fold differences in prevalence between AJ and non-AJ populations of some rare diseases, especially recessive conditions, including Gaucher disease (GBA, p.Asn409Ser, 8-fold enrichment); Canavan disease (ASPA, p.Glu285Ala, 12-fold enrichment); and Tay-Sachs disease (HEXA, c.1421+1G>C, 27-fold enrichment; p.Tyr427IlefsTer5, 12-fold enrichment). We next sought to use this catalog, of well-established relevance to Mendelian disease, to explore Crohn's disease, a common disease with an estimated two to four-fold excess prevalence in AJ. We specifically attempt to evaluate whether strong acting rare alleles, particularly protein-truncating or otherwise large effect-size alleles, enriched by the same founder-effect, contribute excess genetic risk to Crohn's disease in AJ, and find that ten rare genetic risk factors in NOD2 and LRRK2 are enriched in AJ (p < 0.005), including several novel contributing alleles, show evidence of association to CD. Independently, we find that genomewide common variant risk defined by GWAS shows a strong difference between AJ and non-AJ European control population samples (0.97 s.d. higher, p<10-16). Taken together, the results suggest coordinated selection in AJ population for higher CD risk alleles in general. The results and approach illustrate the value of exome sequencing data in case-control studies along with reference data sets like ExAC (sites VCF available via FTP at ftp.broadinstitute.org/pub/ExAC_release/release0.3/) to pinpoint genetic variation that contributes to variable disease predisposition across populations.
作为更大的外显子组测序研究合作网络的一部分,我们开发了一个名为 5685 名阿什肯纳兹犹太人外显子组的联合数据集。我们公开提供了一个位点和等位基因频率资源,该资源应该作为阿什肯纳兹人群医学遗传学的参考(部分托管在 https://ibd.broadinstitute.org 上,也可在 gnomAD 中获得,网址为 http://gnomad.broadinstitute.org)。我们估计,在阿什肯纳兹犹太人群体中,频率大于 0.2%的 34%的蛋白编码等位基因明显更频繁(平均 15 倍),超过了在其他参考人群中观察到的最大频率。这种丰富等位基因的出现是由于大约 30 代前的一个可描述的创始人效应,这种基因库可以导致不同人群之间的遗传风险和总体疾病患病率的差异。作为验证,我们记录了 148 个 AJ 丰富的蛋白改变等位基因,这些等位基因与“致病性”ClinVar 等位基因重叠(可在 https://github.com/macarthur-lab/clinvar/blob/master/output/clinvar.tsv 获得表格),其中包括在一些罕见疾病中,AJ 和非 AJ 人群之间的患病率存在 10-100 倍差异的等位基因,尤其是隐性疾病,包括戈谢氏病(GBA,p.Asn409Ser,8 倍富集);Canavan 病(ASPA,p.Glu285Ala,12 倍富集);和 Tay-Sachs 病(HEXA,c.1421+1G>C,27 倍富集;p.Tyr427IlefsTer5,12 倍富集)。接下来,我们试图利用这个已经确立的孟德尔疾病相关性目录,来探索常见疾病克罗恩病,该疾病在 AJ 中的患病率估计高出两到四倍。我们特别试图评估是否由同一创始人效应富集的强作用罕见等位基因,特别是蛋白截断或其他大效应大小的等位基因,对 AJ 中的克罗恩病有额外的遗传风险。我们发现,NOD2 和 LRRK2 中的十个罕见的遗传风险因素在 AJ 中富集(p<0.005),包括几个新的致病等位基因,显示出与 CD 相关的证据。独立地,我们发现由 GWAS 定义的全基因组常见变异风险在 AJ 和非 AJ 欧洲对照人群样本之间存在很大差异(高 0.97 个标准差,p<10-16)。总之,结果表明 AJ 人群中一般对 CD 风险等位基因进行了协调选择。这些结果和方法说明了外显子组测序数据在病例对照研究中的价值,以及像 ExAC(位点 VCF 可通过 ftp.broadinstitute.org/pub/ExAC_release/release0.3/ 上的 FTP 获得)这样的参考数据集,以确定导致不同人群疾病易感性差异的遗传变异。