Division of Developmental Biology, Eunice Kennedy-Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA.
Nucleic Acids Res. 2024 May 22;52(9):e45. doi: 10.1093/nar/gkae288.
Recent studies have combined DNA methyltransferase footprinting of genomic DNA in nuclei with long-read sequencing, resulting in detailed chromatin maps for multi-kilobase stretches of genomic DNA from one cell. Theoretically, nucleosome footprints and nucleosome-depleted regions can be identified using M.EcoGII, which methylates adenines in any sequence context, providing a high-resolution map of accessible regions in each DNA molecule. Here, we report PacBio long-read sequence data for budding yeast nuclei treated with M.EcoGII and a bioinformatic pipeline which corrects for three key challenges undermining this promising method. First, detection of m6A in individual DNA molecules by the PacBio software is inefficient, resulting in false footprints predicted by random gaps of seemingly unmethylated adenines. Second, there is a strong bias against m6A base calling as AT content increases. Third, occasional methylation occurs within nucleosomes, breaking up their footprints. After correcting for these issues, our pipeline calculates a correlation coefficient-based score indicating the extent of chromatin heterogeneity within the cell population for every gene. Although the population average is consistent with that derived using other techniques, we observe a wide range of heterogeneity in nucleosome positions at the single-molecule level, probably reflecting cellular chromatin dynamics.
最近的研究将基因组 DNA 中核内 DNA 甲基转移酶足迹与长读测序相结合,从而从一个细胞中获得了多千碱基长度的基因组 DNA 的详细染色质图谱。从理论上讲,可以使用 M.EcoGII 识别核小体足迹和核小体耗竭区域,该酶可以在任何序列背景下甲基化腺嘌呤,从而为每个 DNA 分子中可及区域提供高分辨率图谱。在这里,我们报告了用 M.EcoGII 处理的芽殖酵母核的 PacBio 长读序列数据,以及一种生物信息学管道,该管道纠正了破坏这种有前途方法的三个关键挑战。首先,PacBio 软件对单个 DNA 分子中 m6A 的检测效率低下,导致看似未甲基化的腺嘌呤随机出现的间隙产生错误的足迹预测。其次,随着 AT 含量的增加,m6A 碱基调用存在很强的偏差。第三,偶尔会在核小体内部发生甲基化,破坏它们的足迹。在纠正这些问题后,我们的管道为每个基因计算了一个基于相关系数的分数,该分数表示细胞群体中染色质异质性的程度。尽管群体平均值与使用其他技术得出的平均值一致,但我们在单分子水平上观察到核小体位置的异质性范围很广,这可能反映了细胞染色质动力学。