Unit of Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, Odense, Denmark.
Unit of Human Genetics, Department of Clinical Research, University of Southern Denmark, Odense, Denmark.
Aging Cell. 2021 Feb;20(2):e13293. doi: 10.1111/acel.13293. Epub 2021 Feb 2.
Privileged by rapid increase in available epigenomic data, epigenome-wide association studies (EWAS) are to make a profound contribution to understand the molecular mechanism of DNA methylation in cognitive aging. Current statistical methods used in EWAS are dominated by models based on multiple assumptions, for example, linear relationship between molecular profiles and phenotype, normal distribution for the methylation data and phenotype. In this study, we applied an assumption-free method, the generalized correlation coefficient (GCC), and compare it to linear models, namely the linear mixed model and kinship model. We use DNA methylation associated with a cognitive score in 400 and 206 twins as discovery and replication samples respectively. DNA methylation associated with cognitive function using GCC, linear mixed model, and kinship model, identified 65 CpGs (p < 1e-04) from discovery sample displaying both nonlinear and linear correlations. Replication analysis successfully replicated 9 of these top CpGs. When combining results of GCC and linear models to cover diverse patterns of relationships, we identified genes like KLHDC4, PAPSS2, and MRPS18B as well as pathways including focal adhesion, axon guidance, and some neurological signaling. Genomic region-based analysis found 15 methylated regions harboring 11 genes, with three verified in gene expression analysis, also the 11 genes were related to top functional clusters including neurohypophyseal hormone and maternal aggressive behaviors. The GCC approach detects valuable methylation sites missed by traditional linear models. A combination of methylation markers from GCC and linear models enriched biological pathways sensible in neurological function that could implicate cognitive performance and cognitive aging.
得益于可供分析的表观基因组数据的快速增加,全基因组关联研究(EWAS)将对理解 DNA 甲基化在认知衰老中的分子机制做出重大贡献。当前 EWAS 中使用的统计方法主要基于多种假设的模型,例如分子谱和表型之间的线性关系、甲基化数据和表型的正态分布。在这项研究中,我们应用了一种无假设的方法,广义相关系数(GCC),并将其与线性模型,即线性混合模型和亲缘关系模型进行了比较。我们使用与认知评分相关的 400 对和 206 对双胞胎的 DNA 甲基化作为发现和复制样本。使用 GCC、线性混合模型和亲缘关系模型,我们在发现样本中确定了与认知功能相关的 65 个 CpG(p < 1e-04),这些 CpG 同时显示出非线性和线性相关性。复制分析成功复制了这 9 个 top CpG 中的 9 个。当将 GCC 和线性模型的结果结合起来以涵盖不同的关系模式时,我们确定了 KLHDC4、PAPSS2 和 MRPS18B 等基因以及焦点黏附、轴突导向和一些神经信号等途径。基于基因组区域的分析发现了 15 个含有 11 个基因的甲基化区域,其中 3 个在基因表达分析中得到了验证,这 11 个基因与包括神经垂体激素和母性行为攻击等功能簇相关。GCC 方法可以检测到传统线性模型错过的有价值的甲基化位点。来自 GCC 和线性模型的甲基化标记的组合丰富了与神经功能相关的生物途径,这些途径可能与认知表现和认知衰老有关。