Natsume-Kitatani Yayoi, Mamitsuka Hiroshi
Japan Science and Technology Agency, PRESTO (Precursory Research for Embryonic Science and Technology), Saitama, Japan.
Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan.
PLoS One. 2016 Mar 22;11(3):e0151917. doi: 10.1371/journal.pone.0151917. eCollection 2016.
Four different histones (H2A, H2B, H3, and H4; two subunits each) constitute a histone octamer, around which DNA wraps to form histone-DNA complexes called nucleosomes. Amino acid residues in each histone are occasionally modified, resulting in several biological effects, including differential regulation of transcription. Core promoters that encompass the transcription start site have well-conserved DNA motifs, including the initiator (Inr), TATA box, and DPE, which are collectively called the core promoter elements (CPEs). In this study, we systematically studied the associations between the CPEs and histone modifications by integrating the Drosophila Core Promoter Database and time-series ChIP-seq data for histone modifications (H3K4me3, H3K27ac, and H3K27me3) during development in Drosophila melanogaster via the modENCODE project. We classified 96 core promoters into four groups based on the presence or absence of the TATA box or DPE, calculated the histone modification ratio at the core promoter region, and transcribed region for each core promoter. We found that the histone modifications in TATA-less groups were static during development and that the core promoters could be clearly divided into three types: i) core promoters with continuous active marks (H3K4me3 and H3K27ac), ii) core promoters with a continuous inactive mark (H3K27me3) and occasional active marks, and iii) core promoters with occasional histone modifications. Linear regression analysis and non-linear regression by random forest showed that the TATA-containing groups included core promoters without histone modifications, for which the measured RNA expression values were not predictable accurately from the histone modification status. DPE-containing groups had a higher relative frequency of H3K27me3 in both the core promoter region and transcribed region. In summary, our analysis showed that there was a systematic link between the existence of the CPEs and the dynamics, frequency and influence on transcriptional activity of histone modifications.
四种不同的组蛋白(H2A、H2B、H3和H4;各两个亚基)构成一个组蛋白八聚体,DNA围绕该八聚体缠绕,形成称为核小体的组蛋白-DNA复合物。每个组蛋白中的氨基酸残基偶尔会发生修饰,从而产生多种生物学效应,包括转录的差异调控。包含转录起始位点的核心启动子具有高度保守的DNA基序,包括起始子(Inr)、TATA盒和下游核心启动子元件(DPE),它们统称为核心启动子元件(CPE)。在本研究中,我们通过果蝇功能元件百科全书(modENCODE)项目,整合果蝇核心启动子数据库以及黑腹果蝇发育过程中组蛋白修饰(H3K4me3、H3K27ac和H3K27me3)的时间序列染色质免疫沉淀测序(ChIP-seq)数据,系统地研究了CPE与组蛋白修饰之间的关联。我们根据TATA盒或DPE的有无,将96个核心启动子分为四组,计算每个核心启动子在核心启动子区域和转录区域的组蛋白修饰率。我们发现,无TATA盒组中的组蛋白修饰在发育过程中是静态的,并且核心启动子可以清楚地分为三种类型:i)具有连续活性标记(H3K4me3和H3K27ac)的核心启动子,ii)具有连续非活性标记(H3K27me3)和偶尔活性标记的核心启动子,iii)具有偶尔组蛋白修饰的核心启动子。线性回归分析和随机森林的非线性回归表明,含TATA盒组包括没有组蛋白修饰的核心启动子,对于这些核心启动子,无法根据组蛋白修饰状态准确预测测得的RNA表达值。含DPE组在核心启动子区域和转录区域中H3K27me3的相对频率较高。总之,我们的分析表明,CPE的存在与组蛋白修饰的动态变化、频率及其对转录活性的影响之间存在系统联系。