Carreras-Gallo Natalia, Dwaraka Varun B, Jima Dereje D, Skaar David A, Mendez Tavis L, Planchart Antonio, Zhou Wanding, Jirtle Randy L, Smith Ryan, Hoyo Cathrine
bioRxiv. 2024 Jan 16:2024.01.15.575646. doi: 10.1101/2024.01.15.575646.
Differentially methylated imprint control regions (ICRs) regulate the monoallelic expression of imprinted genes. Their epigenetic dysregulation by environmental exposures throughout life results in the formation of common chronic diseases. Unfortunately, existing Infinium methylation arrays lack the ability to profile these regions adequately. Whole genome bisulfite sequencing (WGBS) is the unique method able to profile these regions, but it is very expensive and it requires not only a high coverage but it is also computationally intensive to assess those regions.
To address this deficiency, we developed a custom methylation array containing 22,819 probes. Among them, 9,757 probes map to 1,088 out of the 1,488 candidate ICRs recently described. To assess the performance of the array, we created matched samples processed with the Human Imprintome array and WGBS, which is the current standard method for assessing the methylation of the Human Imprintome. We compared the methylation levels from the shared CpG sites and obtained a mean R = 0.569. We also created matched samples processed with the Human Imprintome array and the Infinium Methylation EPIC v2 array and obtained a mean R = 0.796. Furthermore, replication experiments demonstrated high reliability (ICC: 0.799-0.945).
Our custom array will be useful for replicable and accurate assessment, mechanistic insight, and targeted investigation of ICRs. This tool should accelerate the discovery of ICRs associated with a wide range of diseases and exposures, and advance our understanding of genomic imprinting and its relevance in development and disease formation throughout the life course.
差异甲基化印记控制区域(ICR)调控印记基因的单等位基因表达。它们在一生中因环境暴露导致的表观遗传失调会引发常见慢性疾病的形成。遗憾的是,现有的Infinium甲基化芯片缺乏充分分析这些区域的能力。全基因组亚硫酸氢盐测序(WGBS)是能够分析这些区域的唯一方法,但它成本高昂,不仅需要高覆盖率,而且评估这些区域的计算量也很大。
为弥补这一不足,我们开发了一种包含22,819个探针的定制甲基化芯片。其中,9,757个探针映射到最近描述的1,488个候选ICR中的1,088个。为评估该芯片的性能,我们创建了与人类印记组芯片和WGBS(目前评估人类印记组甲基化的标准方法)处理的匹配样本。我们比较了共享CpG位点的甲基化水平,得到平均R = 0.569。我们还创建了与人类印记组芯片和Infinium甲基化EPIC v2芯片处理的匹配样本,得到平均R = 0.796。此外,重复实验证明了高可靠性(组内相关系数:0.799 - 0.945)。
我们的定制芯片将有助于对ICR进行可重复且准确的评估、机理洞察和靶向研究。该工具应能加速发现与多种疾病和暴露相关的ICR,并推进我们对基因组印记及其在整个生命过程中与发育和疾病形成相关性的理解。