Computational Biology, Max Planck Institute for Informatics, Saarland Informatics Campus, 66123 Saarbrücken, Germany.
Graduate School of Computer Science, Saarland Informatics Campus, 66123 Saarbrücken, Germany.
Nucleic Acids Res. 2020 May 7;48(8):e46. doi: 10.1093/nar/gkaa120.
DNA methylation is an epigenetic mark with important regulatory roles in cellular identity and can be quantified at base resolution using bisulfite sequencing. Most studies are limited to the average DNA methylation levels of individual CpGs and thus neglect heterogeneity within the profiled cell populations. To assess this within-sample heterogeneity (WSH) several window-based scores that quantify variability in DNA methylation in sequencing reads have been proposed. We performed the first systematic comparison of four published WSH scores based on simulated and publicly available datasets. Moreover, we propose two new scores and provide guidelines for selecting appropriate scores to address cell-type heterogeneity, cellular contamination and allele-specific methylation. Most of the measures were sensitive in detecting DNA methylation heterogeneity in these scenarios, while we detected differences in susceptibility to technical bias. Using recently published DNA methylation profiles of Ewing sarcoma samples, we show that DNA methylation heterogeneity provides information complementary to the DNA methylation level. WSH scores are powerful tools for estimating variance in DNA methylation patterns and have the potential for detecting novel disease-associated genomic loci not captured by established statistics. We provide an R-package implementing the WSH scores for integration into analysis workflows.
DNA 甲基化是一种表观遗传标记,在细胞身份中具有重要的调节作用,可以通过亚硫酸氢盐测序以碱基分辨率进行定量。大多数研究仅限于单个 CpG 的平均 DNA 甲基化水平,因此忽略了所分析细胞群体中的异质性。为了评估这种样本内异质性 (WSH),已经提出了几种基于测序读数中 DNA 甲基化变异性的基于窗口的评分方法。我们基于模拟和公开可用数据集对四个已发表的 WSH 评分进行了首次系统比较。此外,我们提出了两个新的评分,并提供了选择合适评分来解决细胞类型异质性、细胞污染和等位基因特异性甲基化的指导原则。在这些情况下,大多数措施都能够灵敏地检测 DNA 甲基化异质性,而我们检测到了对技术偏差敏感性的差异。使用最近发表的尤文肉瘤样本的 DNA 甲基化图谱,我们表明 DNA 甲基化异质性提供了与 DNA 甲基化水平互补的信息。WSH 评分是估计 DNA 甲基化模式方差的有力工具,并且有可能检测到由既定统计数据未捕获的新型与疾病相关的基因组位点。我们提供了一个实现 WSH 评分的 R 包,用于集成到分析工作流程中。