Quantitative and Computational Biology Section, University of Southern California, Childs Way, Los Angeles, California, USA.
Watson School of Biological Sciences, Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, New York, 11724, USA.
Epigenetics Chromatin. 2020 Oct 2;13(1):39. doi: 10.1186/s13072-020-00363-7.
Partially methylated domains (PMDs) are a hallmark of epigenomes in reproducible and specific biological contexts, including cancer cells, the placenta, and cultured cell lines. Existing methods for deciding whether PMDs exist in a sample, as well as their identification, are few, often tailored to specific biological questions, and require high coverage samples for accurate identification.
In this study, we outline a set of axioms that take a step towards a functional definition for PMDs, describe an improved method for comparable PMD detection across samples with substantially differing sequencing depths, and refine the decision criteria for whether a sample contains PMDs using a data-driven approach. Applying our method to 267 methylomes from 7 species, we corroborated recent results regarding the general association between replication timing and PMD state, and report identification of several reproducibly "escapee" genes within late-replicating domains that escape the reduced expression and hypomethylation of their immediate genomic neighborhood. We also explored the discordant PMD state of orthologous genes between human and mouse, and observed a directional association of PMD state with gene expression and local gene density.
Our improved method makes low sequencing depth, population-level studies of PMD variation possible and our results further refine the model of PMD formation as one where sequence context and regional epigenomic features both play a role in gradual genome-wide hypomethylation.
部分甲基化区域(PMDs)是可重复且特定生物背景下(包括癌细胞、胎盘和培养细胞系)表观基因组的标志。目前存在的判断样本中是否存在 PMD 及其鉴定的方法较少,通常针对特定的生物学问题,并且需要高覆盖度的样本才能进行准确的鉴定。
在本研究中,我们概述了一组公理,这些公理朝着 PMD 的功能定义迈出了一步,描述了一种改进的方法,可在具有显著不同测序深度的样本中进行可比的 PMD 检测,并使用数据驱动的方法细化了判断样本是否包含 PMD 的决策标准。将我们的方法应用于来自 7 个物种的 267 个甲基组,我们证实了最近关于复制时间与 PMD 状态之间一般关联的结果,并报告了在复制晚期区域内鉴定出几个可重复出现的“逃逸”基因,这些基因逃避了其邻近基因组区域的表达减少和低甲基化。我们还探索了人类和小鼠之间同源基因的不一致的 PMD 状态,并观察到 PMD 状态与基因表达和局部基因密度之间存在定向关联。
我们改进的方法使得在低测序深度、群体水平研究 PMD 变异性成为可能,并且我们的结果进一步细化了 PMD 形成的模型,即序列背景和区域表观基因组特征都在整个基因组逐渐去甲基化过程中发挥作用。