Koyama Satoshi, Yu Zhi, Choi Seung Hoan, Jurgens Sean J, Selvaraj Margaret Sunitha, Klarin Derek, Huffman Jennifer E, Clarke Shoa L, Trinh Michael N, Ravi Akshaya, Dron Jacqueline S, Spinks Catherine, Surakka Ida, Bhatnagar Aarushi, Lannery Kim, Hornsby Whitney, Damrauer Scott M, Chang Kyong-Mi, Lynch Julie A, Assimes Themistocles L, Tsao Philip S, Rader Daniel J, Cho Kelly, Peloso Gina M, Ellinor Patrick T, Sun Yan V, Wilson Peter Wf, Program Million Veteran, Natarajan Pradeep
VA Boston Healthcare System, Boston, MA.
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA.
medRxiv. 2024 Sep 18:2024.09.17.24313718. doi: 10.1101/2024.09.17.24313718.
Rare coding alleles play crucial roles in the molecular diagnosis of genetic diseases. However, the systemic identification of these alleles has been challenging due to their scarcity in the general population. Here, we discovered and characterized rare coding alleles contributing to genetic dyslipidemia, a principal risk for coronary artery disease, among over a million individuals combining three large contemporary genetic datasets (the Million Veteran Program, n = 634,535, UK Biobank, n = 431,178, and the All of Us Research Program, n = 92,304) totaling 1,158,017 multi-ancestral individuals. Unlike previous rare variant studies in lipids, this study included 238,243 individuals (20.6%) from non-European-like populations. Testing 2,997,401 rare coding variants from diverse backgrounds, we identified 800 exome-wide significant associations across 209 genes including 176 predicted loss of function and 624 missense variants. Among these exome-wide associations, 130 associations were driven by non-European-like populations. Associated alleles are highly enriched in functional variant classes, showed significant additive and recessive associations, exhibited similar effects across populations, and resolved pathogenicity for variants enriched in African or South-Asian populations. Furthermore, we identified 5 lipid-related genes associated with coronary artery disease . Among them, is a potentially novel therapeutic target through the down regulation of LDLC by its silencing. This study provides resources and insights for understanding causal mechanisms, quantifying the expressivity of rare coding alleles, and identifying novel drug targets across diverse populations.
罕见编码等位基因在遗传疾病的分子诊断中起着关键作用。然而,由于这些等位基因在普通人群中稀少,对其进行系统性鉴定一直具有挑战性。在此,我们在合并了三个大型当代遗传数据集(百万退伍军人计划,n = 634,535;英国生物银行,n = 431,178;以及我们所有人研究计划,n = 92,304)的超过一百万人中发现并表征了导致遗传性血脂异常(冠状动脉疾病的主要风险因素)的罕见编码等位基因,这些数据集总计有1,158,017名多祖先个体。与以往关于脂质的罕见变异研究不同,本研究纳入了238,243名(20.6%)非欧洲裔人群个体。通过测试来自不同背景的2,997,401个罕见编码变异,我们在209个基因中鉴定出800个全外显子组显著关联,包括176个预测的功能丧失变异和624个错义变异。在这些全外显子组关联中,130个关联是由非欧洲裔人群驱动的。相关等位基因在功能变异类别中高度富集,显示出显著的加性和隐性关联,在不同人群中表现出相似的效应,并解决了非洲或南亚人群中富集变异的致病性问题。此外,我们鉴定出5个与冠状动脉疾病相关的脂质相关基因。其中, 通过沉默下调低密度脂蛋白胆固醇(LDLC),是一个潜在的新型治疗靶点。本研究为理解因果机制、量化罕见编码等位基因的表达性以及在不同人群中鉴定新型药物靶点提供了资源和见解。