Zhuang Xiaowei, Xu Gang, Amei Amei, Cordes Dietmar, Wang Zuoheng, Oh Edwin C
bioRxiv. 2023 Oct 17:2023.10.17.562756. doi: 10.1101/2023.10.17.562756.
The development and progression of Alzheimer's disease (AD) is a complex process that can change over time, during which genetic influences on phenotypes may also fluctuate. Incorporating longitudinal phenotypes in genome wide association studies (GWAS) could help unmask genetic loci with time-varying effects. In this study, we incorporated a varying coefficient test in a longitudinal GWAS model to identify single nucleotide polymorphisms (SNPs) that may have time- or age-dependent effects in AD.
Genotype data from 1,877 participants in the Alzheimer's Neuroimaging Data Initiative (ADNI) were imputed using the Haplotype Reference Consortium (HRC) panel, resulting in 9,573,130 SNPs. Subjects' longitudinal impairment status at each visit was considered as a binary and clinical phenotype. Participants' composite standardized uptake value ratio (SUVR) derived from each longitudinal amyloid PET scan was considered as a continuous and biological phenotype. The retrospective varying coefficient mixed model association test (RVMMAT) was used in longitudinal GWAS to detect time-varying genetic effects on the impairment status and SUVR measures. Post-hoc analyses were performed on genome-wide significant SNPs, including 1) pathway analyses; 2) age-stratified genotypic comparisons and regression analyses; and 3) replication analyses using data from the National Alzheimer's Coordinating Center (NACC).
Our model identified 244 genome-wide significant SNPs that revealed time-varying genetic effects on the clinical impairment status in AD; among which, 12 SNPs on chromosome 19 were successfully replicated using data from NACC. Post-hoc age-stratified analyses indicated that for most of these 244 SNPs, the maximum genotypic effect on impairment status occurred between 70 to 80 years old, and then declined with age. Our model further identified 73 genome-wide significant SNPs associated with the temporal variation of amyloid accumulation. For these SNPs, an increasing genotypic effect on PET-SUVR was observed as participants' age increased. Functional pathway analyses on significant SNPs for both phenotypes highlighted the involvement and disruption of immune responses- and neuroinflammation-related pathways in AD.
We demonstrate that longitudinal GWAS models with time-varying coefficients can boost the statistical power in AD-GWAS. In addition, our analyses uncovered potential time-varying genetic variants on repeated measurements of clinical and biological phenotypes in AD.
阿尔茨海默病(AD)的发生和发展是一个复杂的过程,且会随时间变化,在此期间基因对表型的影响也可能波动。在全基因组关联研究(GWAS)中纳入纵向表型有助于揭示具有时间变化效应的基因座。在本研究中,我们在纵向GWAS模型中纳入了可变系数检验,以识别在AD中可能具有时间或年龄依赖性效应的单核苷酸多态性(SNP)。
使用单倍型参考联盟(HRC)面板对来自阿尔茨海默病神经影像数据倡议(ADNI)的1877名参与者的基因型数据进行填充,得到9573130个SNP。每次访视时受试者的纵向损伤状态被视为二元临床表型。从每次纵向淀粉样蛋白PET扫描得出的参与者复合标准化摄取值比率(SUVR)被视为连续生物学表型。纵向GWAS中使用回顾性可变系数混合模型关联检验(RVMMAT)来检测基因对损伤状态和SUVR测量值的时间变化效应。对全基因组显著的SNP进行事后分析,包括1)通路分析;2)年龄分层的基因型比较和回归分析;3)使用来自国家阿尔茨海默病协调中心(NACC)的数据进行重复分析。
我们的模型识别出244个全基因组显著的SNP,这些SNP揭示了基因对AD临床损伤状态的时间变化效应;其中,使用NACC的数据成功重复验证了19号染色体上的12个SNP。事后年龄分层分析表明,对于这244个SNP中的大多数,对损伤状态的最大基因型效应发生在70至80岁之间,然后随年龄下降。我们的模型进一步识别出73个与淀粉样蛋白积累的时间变化相关的全基因组显著SNP。对于这些SNP,随着参与者年龄的增加,观察到对PET-SUVR的基因型效应增加。对两种表型的显著SNP进行的功能通路分析突出了免疫反应和神经炎症相关通路在AD中的参与和破坏。
我们证明具有可变系数的纵向GWAS模型可以提高AD-GWAS中的统计效力。此外,我们的分析揭示了AD中临床和生物学表型重复测量时潜在的时间变化遗传变异。