Institute of Biomedical and Clinical Science, University of Exeter Medical School, Research, Innovation, Learning and Development building, Royal Devon & Exeter Hospital, Barrack Road, Exeter EX2 5DW, UK.
Institute of Biomedical and Clinical Science, University of Exeter Medical School, Research, Innovation, Learning and Development building, Royal Devon & Exeter Hospital, Barrack Road, Exeter EX2 5DW, UK.
Am J Hum Genet. 2019 Feb 7;104(2):275-286. doi: 10.1016/j.ajhg.2018.12.015. Epub 2019 Jan 18.
More than 100,000 genetic variants are classified as disease causing in public databases. However, the true penetrance of many of these rare alleles is uncertain and might be over-estimated by clinical ascertainment. Here, we use data from 379,768 UK Biobank (UKB) participants of European ancestry to assess the pathogenicity and penetrance of putatively clinically important rare variants. Although rare variants are harder to genotype accurately than common variants, we were able to classify as high quality 1,244 of 4,585 (27%) putatively clinically relevant rare (MAF < 1%) variants genotyped on the UKB microarray. We defined as "clinically relevant" variants that were classified as either pathogenic or likely pathogenic in ClinVar or are in genes known to cause two specific monogenic diseases: maturity-onset diabetes of the young (MODY) and severe developmental disorders (DDs). We assessed the penetrance and pathogenicity of these high-quality variants by testing their association with 401 clinically relevant traits. 27 of the variants were associated with a UKB trait, and we were able to refine the penetrance estimate for some of the variants. For example, the HNF4A c.340C>T (p.Arg114Trp) (GenBank: NM_175914.4) variant associated with diabetes is <10% penetrant by the time an individual is 40 years old. We also observed associations with relevant traits for heterozygous carriers of some rare recessive conditions, e.g., heterozygous carriers of the ERCC4 c.2395C>T (p.Arg799Trp) variant that causes Xeroderma pigmentosum were more susceptible to sunburn. Finally, we refute the previous disease association of RNF135 in developmental disorders. In conclusion, this study shows that very large population-based studies will help refine our understanding of the pathogenicity of rare genetic variants.
超过 100,000 个遗传变异被归类为公共数据库中的致病突变。然而,许多这些罕见等位基因的真正外显率是不确定的,并且可能被临床确证所高估。在这里,我们使用来自欧洲血统的 379,768 名 UK Biobank(UKB)参与者的数据来评估具有潜在临床重要性的罕见变体的致病性和外显率。虽然罕见变体比常见变体更难准确基因分型,但我们能够将 UKB 微阵列上基因分型的 4,585 个(27%)具有潜在临床相关性的罕见(MAF < 1%)变体中的 1,244 个归类为高质量变体。我们将“具有临床相关性”的变体定义为 ClinVar 中归类为致病性或可能致病性的变体,或在已知导致两种特定单基因疾病的基因中:年轻发病的成年型糖尿病(MODY)和严重发育障碍(DDs)。我们通过测试这些变体与 401 种具有临床相关性的特征之间的关联来评估这些高质量变体的外显率和致病性。27 个变体与 UKB 特征相关,我们能够细化一些变体的外显率估计。例如,HNF4A c.340C>T (p.Arg114Trp)(GenBank:NM_175914.4)变体与糖尿病相关,到个体 40 岁时,其外显率<10%。我们还观察到一些罕见隐性疾病的杂合子携带者与相关特征之间的关联,例如,ERCC4 c.2395C>T(p.Arg799Trp)变体的杂合子携带者更容易晒伤,该变体导致着色性干皮病。最后,我们反驳了 RNF135 先前与发育障碍相关的疾病关联。总之,这项研究表明,非常大规模的基于人群的研究将有助于我们更好地理解罕见遗传变异的致病性。