Graduate Program of Bioinformatics and Systems Biology, University of California, San Diego, La Jolla, CA 92093-0359.
Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093-0359.
Proc Natl Acad Sci U S A. 2019 Feb 26;116(9):3668-3677. doi: 10.1073/pnas.1813565116. Epub 2019 Feb 12.
Histones are modified by enzymes that act in a locus, cell-type, and developmental stage-specific manner. The recruitment of enzymes to chromatin is regulated at multiple levels, including interaction with sequence-specific DNA-binding factors. However, the DNA-binding specificity of the regulatory factors that orchestrate specific histone modifications has not been broadly mapped. We have analyzed 6 histone marks (H3K4me1, H3K4me3, H3K27ac, H3K27me3, K3H9me3, H3K36me3) across 121 human cell types and tissues from the NIH Roadmap Epigenomics Project as well as 8 histone marks (with addition of H3K4me2 and H3K9ac) from the mouse ENCODE Consortium. We have identified 361 and 369 DNA motifs in human and mouse, respectively, that are the most predictive of each histone mark. Interestingly, 107 human motifs are conserved between the two species. In human embryonic cell line H1, we mutated only the found DNA motifs at particular loci and the significant reduction of H3K27ac levels validated the regulatory roles of the perturbed motifs. The functionality of these motifs was also supported by the evidence that histone-associated motifs, especially H3K4me3 motifs, significantly overlap with the expression of quantitative trait loci SNPs in cancer patients more than the known and random motifs. Furthermore, we observed possible feedbacks to control chromatin dynamics as the found motifs appear in the promoters or enhancers associated with various histone modification enzymes. These results pave the way toward revealing the molecular mechanisms of epigenetic events, such as histone modification dynamics and epigenetic priming.
组蛋白通过在特定位置、细胞类型和发育阶段发挥作用的酶进行修饰。酶在染色质上的募集受到多个层面的调控,包括与序列特异性 DNA 结合因子的相互作用。然而,调控特定组蛋白修饰的调节因子的 DNA 结合特异性尚未得到广泛描绘。我们分析了来自 NIH 表观基因组学计划的 121 个人类细胞类型和组织中的 6 种组蛋白标记物(H3K4me1、H3K4me3、H3K27ac、H3K27me3、K3H9me3、H3K36me3),以及来自小鼠 ENCODE 联盟的 8 种组蛋白标记物(增加了 H3K4me2 和 H3K9ac)。我们在人类和小鼠中分别鉴定了 361 个和 369 个 DNA 基序,它们是每个组蛋白标记物最具预测性的基序。有趣的是,107 个人类基序在两个物种之间是保守的。在人类胚胎细胞系 H1 中,我们只在特定基因座突变了发现的 DNA 基序,并且 H3K27ac 水平的显著降低验证了扰动基序的调节作用。这些基序的功能还得到了以下证据的支持,即组蛋白相关基序,特别是 H3K4me3 基序,在癌症患者中与数量性状基因座 SNP 的表达显著重叠,超过了已知和随机基序。此外,我们观察到可能的反馈来控制染色质动力学,因为发现的基序出现在与各种组蛋白修饰酶相关的启动子或增强子中。这些结果为揭示表观遗传事件的分子机制铺平了道路,例如组蛋白修饰动力学和表观遗传启动。