Department of Medicine, University of Cambridge, Cambridge, UK.
Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
Nat Med. 2023 Mar;29(3):679-688. doi: 10.1038/s41591-023-02211-z. Epub 2023 Mar 16.
The genetic etiologies of more than half of rare diseases remain unknown. Standardized genome sequencing and phenotyping of large patient cohorts provide an opportunity for discovering the unknown etiologies, but this depends on efficient and powerful analytical methods. We built a compact database, the 'Rareservoir', containing the rare variant genotypes and phenotypes of 77,539 participants sequenced by the 100,000 Genomes Project. We then used the Bayesian genetic association method BeviMed to infer associations between genes and each of 269 rare disease classes assigned by clinicians to the participants. We identified 241 known and 19 previously unidentified associations. We validated associations with ERG, PMEPA1 and GPR156 by searching for pedigrees in other cohorts and using bioinformatic and experimental approaches. We provide evidence that (1) loss-of-function variants in the Erythroblast Transformation Specific (ETS)-family transcription factor encoding gene ERG lead to primary lymphoedema, (2) truncating variants in the last exon of transforming growth factor-β regulator PMEPA1 result in Loeys-Dietz syndrome and (3) loss-of-function variants in GPR156 give rise to recessive congenital hearing impairment. The Rareservoir provides a lightweight, flexible and portable system for synthesizing the genetic and phenotypic data required to study rare disease cohorts with tens of thousands of participants.
超过一半的罕见病的遗传病因仍然未知。对大量患者队列进行标准化的基因组测序和表型分析为发现未知病因提供了机会,但这取决于高效和强大的分析方法。我们构建了一个紧凑的数据库“Rareservoir”,其中包含了由 10 万基因组计划测序的 77539 名参与者的罕见变异基因型和表型。然后,我们使用贝叶斯遗传关联方法 BeviMed 来推断基因与临床医生分配给参与者的 269 种罕见疾病类别中的每一种之间的关联。我们确定了 241 个已知和 19 个以前未识别的关联。我们通过在其他队列中搜索家系以及使用生物信息学和实验方法,验证了 ERG、PMEPA1 和 GPR156 之间的关联。我们提供的证据表明:(1)编码转录因子 Erythroblast Transformation Specific (ETS) 家族的 ERG 基因中的功能丧失变异导致原发性淋巴水肿,(2)转化生长因子-β调节剂 PMEPA1 的最后外显子中的截断变异导致 Loeys-Dietz 综合征,(3)GPR156 中的功能丧失变异导致隐性先天性听力障碍。Rareservoir 提供了一个轻量级、灵活且可移植的系统,用于合成需要对成千上万名参与者的罕见病队列进行研究的遗传和表型数据。