The Jackson Laboratory, Bar Harbor, Maine, United States of America.
Program in Genetics, Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, United States of America.
PLoS Genet. 2020 Jun 3;16(6):e1008775. doi: 10.1371/journal.pgen.1008775. eCollection 2020 Jun.
Late-Onset Alzheimer's disease (LOAD) is a common, complex genetic disorder well-known for its heterogeneous pathology. The genetic heterogeneity underlying common, complex diseases poses a major challenge for targeted therapies and the identification of novel disease-associated variants. Case-control approaches are often limited to examining a specific outcome in a group of heterogenous patients with different clinical characteristics. Here, we developed a novel approach to define relevant transcriptomic endophenotypes and stratify decedents based on molecular profiles in three independent human LOAD cohorts. By integrating post-mortem brain gene co-expression data from 2114 human samples with LOAD, we developed a novel quantitative, composite phenotype that can better account for the heterogeneity in genetic architecture underlying the disease. We used iterative weighted gene co-expression network analysis (WGCNA) to reduce data dimensionality and to isolate gene sets that are highly co-expressed within disease subtypes and represent specific molecular pathways. We then performed single variant association testing using whole genome-sequencing data for the novel composite phenotype in order to identify genetic loci that contribute to disease heterogeneity. Distinct LOAD subtypes were identified for all three study cohorts (two in ROSMAP, three in Mayo Clinic, and two in Mount Sinai Brain Bank). Single variant association analysis identified a genome-wide significant variant in TMEM106B (p-value < 5×10-8, rs1990620G) in the ROSMAP cohort that confers protection from the inflammatory LOAD subtype. Taken together, our novel approach can be used to stratify LOAD into distinct molecular subtypes based on affected disease pathways.
迟发性阿尔茨海默病(LOAD)是一种常见的、复杂的遗传疾病,其病理学具有明显的异质性。常见的、复杂疾病的遗传异质性对靶向治疗和新的疾病相关变异体的识别构成了重大挑战。病例对照方法通常仅限于在具有不同临床特征的异质性患者组中检查特定的结果。在这里,我们开发了一种新方法,通过在三个独立的人类 LOAD 队列中根据分子谱定义相关的转录组内表型并对死者进行分层。通过将 2114 个人类样本的死后大脑基因共表达数据与 LOAD 整合,我们开发了一种新的定量、综合表型,可以更好地解释疾病遗传结构的异质性。我们使用迭代加权基因共表达网络分析(WGCNA)来降低数据维度,并分离出在疾病亚型内高度共表达并代表特定分子途径的基因集。然后,我们使用全基因组测序数据对新型综合表型进行单变体关联测试,以确定导致疾病异质性的遗传位点。所有三个研究队列都确定了不同的 LOAD 亚型(ROSAP 中有两个,Mayo 诊所中有三个,Mount Sinai 脑库中有两个)。单变体关联分析确定了 ROSMAP 队列中 TMEM106B 中的一个全基因组显著变异(p 值<5×10-8,rs1990620G),该变异可保护免受炎症性 LOAD 亚型的影响。总之,我们的新方法可以用于根据受影响的疾病途径将 LOAD 分层为不同的分子亚型。