Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany.
The Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, 10115 Berlin, Germany.
Nucleic Acids Res. 2021 Dec 2;49(21):12178-12195. doi: 10.1093/nar/gkab1100.
Embryonic stem cells (ESCs) can differentiate into any given cell type and therefore represent a versatile model to study the link between gene regulation and differentiation. To quantitatively assess the dynamics of enhancer activity during the early stages of murine ESC differentiation, we analyzed accessible genomic regions using STARR-seq, a massively parallel reporter assay. This resulted in a genome-wide quantitative map of active mESC enhancers, in pluripotency and during the early stages of differentiation. We find that only a minority of accessible regions is active and that such regions are enriched near promoters, characterized by specific chromatin marks, enriched for distinct sequence motifs, and modeling shows that active regions can be predicted from sequence alone. Regions that change their activity upon retinoic acid-induced differentiation are more prevalent at distal intergenic regions when compared to constitutively active enhancers. Further, analysis of differentially active enhancers verified the contribution of individual TF motifs toward activity and inducibility as well as their role in regulating endogenous genes. Notably, the activity of retinoic acid receptor alpha (RARα) occupied regions can either increase or decrease upon the addition of its ligand, retinoic acid, with the direction of the change correlating with spacing and orientation of the RARα consensus motif and the co-occurrence of additional sequence motifs. Together, our genome-wide enhancer activity map elucidates features associated with enhancer activity levels, identifies regulatory regions disregarded by computational prediction tools, and provides a resource for future studies into regulatory elements in mESCs.
胚胎干细胞 (ESCs) 可以分化为任何给定的细胞类型,因此代表了研究基因调控与分化之间联系的多功能模型。为了定量评估鼠 ESC 分化早期增强子活性的动力学,我们使用 STARR-seq(一种大规模平行报告物测定法)分析了可及基因组区域。这导致了活跃的 mESC 增强子的全基因组定量图谱,包括在多能性和早期分化阶段。我们发现只有少数可及区域是活跃的,并且这些区域在启动子附近富集,具有特定的染色质标记、富含独特的序列基序,并且建模表明可以仅从序列预测活跃区域。与组成性活跃增强子相比,在视黄酸诱导分化时改变其活性的区域在远端基因间区域更为普遍。此外,对差异活跃增强子的分析验证了单个 TF 基序对活性和诱导性的贡献,以及它们在调节内源性基因中的作用。值得注意的是,视黄酸受体 α (RARα) 占据区域的活性在添加其配体视黄酸时可以增加或减少,变化的方向与 RARα 共识基序的间隔和取向以及其他序列基序的共同出现相关。总之,我们的全基因组增强子活性图谱阐明了与增强子活性水平相关的特征,确定了被计算预测工具忽略的调控区域,并为未来研究 mESCs 中的调控元件提供了资源。