Department of Psychiatry, University of Iowa, Iowa City, IA 52242, USA.
Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA.
Genes (Basel). 2022 Apr 13;13(4):683. doi: 10.3390/genes13040683.
Type 2 diabetes mellitus (T2D) has a complex genetic and environmental architecture that underlies its development and clinical presentation. Despite the identification of well over a hundred genetic variants and CpG sites that associate with T2D, a robust biosignature that could be used to prevent or forestall clinical disease has not been developed. Based on the premise that underlying genetic variation influences DNA methylation (DNAm) independently of or in combination with environmental exposures, we assessed the ability of local and distal gene x methylation (GxMeth) interactive effects to improve cg19693031 models for predicting T2D status in an African American cohort. Using genome-wide genetic data from 506 subjects, we identified a total of 1476 GxMeth terms associated with HbA1c values. The GxMeth SNPs map to biological pathways associated with the development and complications of T2D, with genetically contextual differences in methylation observed only in diabetic subjects for two GxMeth SNPs (rs2390998 AG vs. GG, = 4.63 × 10, Δ = 13%, effect size = 0.16 [95% CI = 0.05, 0.32]; rs1074390 AA vs. GG, = 3.93 × 10, Δ = 9%, effect size = 0.38 [95% CI = 0.12, 0.56]. Using a repeated stratified k-fold cross-validation approach, a series of balanced random forest classifiers with random under-sampling were built to evaluate the addition of GxMeth terms to cg19693031 models to discriminate between normoglycemic controls versus T2D subjects. The results were compared to those obtained from models incorporating only the covariates (age, sex and BMI) and the addition of cg19693031. We found a post-pruned classifier incorporating 10 GxMeth SNPs and cg19693031 adjusted for covariates predicted the T2D status, with the AUC, sensitivity, specificity and precision of the positive target class being 0.76, 0.81, 0.70 and 0.63, respectively. Comparatively, the AUC, sensitivity, specificity and precision using the covariates and cg19693031 were only 0.71, 0.74, 0.67 and 0.59, respectively. Collectively, we demonstrate correcting for genetic confounding of cg19693031 improves its ability to detect type 2 diabetes. We conclude that an integrated genetic-epigenetic approach could inform personalized medicine programming for more effective prevention and treatment of T2D.
2 型糖尿病(T2D)具有复杂的遗传和环境结构,这是其发生和临床表现的基础。尽管已经发现了 100 多个与 T2D 相关的遗传变异和 CpG 位点,但尚未开发出一种可靠的生物标志物来预防或阻止临床疾病。基于这样的前提,即潜在的遗传变异独立于或与环境暴露相结合影响 DNA 甲基化(DNAm),我们评估了局部和远端基因 x 甲基化(GxMeth)相互作用效应的能力,以改善 cg19693031 模型,从而预测非裔美国人队列中的 T2D 状态。使用来自 506 名受试者的全基因组遗传数据,我们总共鉴定出 1476 个与 HbA1c 值相关的 GxMeth 术语。GxMeth SNPs 映射到与 T2D 发展和并发症相关的生物途径,仅在糖尿病患者中观察到两个 GxMeth SNPs(rs2390998 AG 与 GG, = 4.63 × 10,Δ = 13%,效应大小 = 0.16 [95% CI = 0.05, 0.32];rs1074390 AA 与 GG, = 3.93 × 10,Δ = 9%,效应大小 = 0.38 [95% CI = 0.12, 0.56])的遗传背景差异中的甲基化。使用重复分层 k 折交叉验证方法,构建了一系列带有随机欠采样的平衡随机森林分类器,以评估将 GxMeth 项添加到 cg19693031 模型中以区分正常血糖对照与 T2D 受试者的能力。将结果与仅包含协变量(年龄、性别和 BMI)和 cg19693031 调整的模型的结果进行了比较。我们发现,一个包含 10 个 GxMeth SNPs 和 cg19693031 的修剪后分类器,可预测 T2D 状态,阳性目标类别的 AUC、敏感性、特异性和精度分别为 0.76、0.81、0.70 和 0.63。相比之下,仅使用协变量和 cg19693031 的 AUC、敏感性、特异性和精度分别为 0.71、0.74、0.67 和 0.59。总的来说,我们证明了校正 cg19693031 的遗传混杂可以提高其检测 2 型糖尿病的能力。我们得出结论,综合遗传-表观遗传方法可以为 2 型糖尿病的更有效预防和治疗提供个性化医学编程信息。