National Center for PTSD, VA Boston Healthcare System, Boston, MA, USA.
Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA.
Epigenetics. 2023 Dec;18(1):2207959. doi: 10.1080/15592294.2023.2207959.
Differentially methylated regions (DMRs) are genomic regions with methylation patterns across multiple CpG sites that are associated with a phenotype. In this study, we proposed a Principal Component (PC) based DMR analysis method for use with data generated using the Illumina Infinium MethylationEPIC BeadChip (EPIC) array. We obtained methylation residuals by regressing the M-values of CpGs within a region on covariates, extracted PCs of the residuals, and then combined association information across PCs to obtain regional significance. Simulation-based genome-wide false positive (GFP) rates and true positive rates were estimated under a variety of conditions before determining the final version of our method, which we have named DMR. Then, DMR and another DMR method, coMethDMR, were used to perform epigenome-wide analyses of several phenotypes known to have multiple associated methylation loci (age, sex, and smoking) in a discovery and a replication cohort. Among regions that were analysed by both methods, DMR identified 50% more genome-wide significant age-associated DMRs than coMethDMR. The replication rate for the loci that were identified by only DMR was higher than the rate for those that were identified by only coMethDMR (90% for DMRPC vs. 76% for coMethDMR). Furthermore, DMR identified replicable associations in regions of moderate between-CpG correlation which are typically not analysed by coMethDMR. For the analyses of sex and smoking, the advantage of DMR was less clear. In conclusion, DMR is a new powerful DMR discovery tool that retains power in genomic regions with moderate correlation across CpGs.
差异甲基化区域(DMR)是指在多个 CpG 位点上具有甲基化模式的基因组区域,与表型相关。在这项研究中,我们提出了一种基于主成分(PC)的 DMR 分析方法,用于处理使用 Illumina Infinium MethylationEPIC BeadChip(EPIC)阵列生成的数据。我们通过将区域内 CpG 的 M 值与协变量进行回归,获得甲基化残差,提取残差的 PC,然后结合跨 PC 的关联信息,获得区域显著性。在确定我们的方法(命名为 DMR)的最终版本之前,我们在多种条件下模拟了全基因组假阳性(GFP)率和真阳性率,然后使用 DMR 和另一种 DMR 方法 coMethDMR 对几个已知具有多个相关甲基化位点的表型(年龄、性别和吸烟)进行全基因组分析,在发现和复制队列中。在两种方法分析的区域中,DMR 比 coMethDMR 识别出的与年龄相关的 DMR 多 50%。仅 DMR 识别出的位点的复制率高于仅 coMethDMR 识别出的位点的复制率(DMRPC 为 90%,coMethDMR 为 76%)。此外,DMR 还在通常不会被 coMethDMR 分析的中等 CpG 相关性区域中识别出可复制的关联。对于性别和吸烟的分析,DMR 的优势不太明显。总之,DMR 是一种新的强大的 DMR 发现工具,在 CpG 相关性中等的基因组区域中保持了强大的功能。