Edison Center for Genome Sciences and Systems Biology, Washington University in St. Louis, St. Louis, United States.
Department of Genetics, Washington University in St. Louis, St. Louis, United States.
Elife. 2020 Feb 11;9:e41279. doi: 10.7554/eLife.41279.
In embryonic stem cells (ESCs), a core transcription factor (TF) network establishes the gene expression program necessary for pluripotency. To address how interactions between four key TFs contribute to regulation in mouse ESCs, we assayed two massively parallel reporter assay (MPRA) libraries composed of binding sites for SOX2, POU5F1 (OCT4), KLF4, and ESRRB. Comparisons between synthetic -regulatory elements and genomic sequences with comparable binding site configurations revealed some aspects of a regulatory grammar. The expression of synthetic elements is influenced by both the number and arrangement of binding sites. This grammar plays only a small role for genomic sequences, as the relative activities of genomic sequences are best explained by the predicted occupancy of binding sites, regardless of binding site identity and positioning. Our results suggest that the effects of transcription factor binding sites (TFBS) are influenced by the order and orientation of sites, but that in the genome the overall occupancy of TFs is the primary determinant of activity.
在胚胎干细胞 (ESCs) 中,核心转录因子 (TF) 网络建立了多能性所必需的基因表达程序。为了解析四个关键 TF 之间的相互作用如何有助于调控小鼠 ESCs,我们分析了由 SOX2、POU5F1(OCT4)、KLF4 和 ESRRB 结合位点组成的两个大规模平行报告基因检测 (MPRA) 文库。合成调控元件与具有可比结合位点构型的基因组序列之间的比较揭示了调控语法的某些方面。合成元件的表达受结合位点数量和排列的影响。这种语法对基因组序列的作用很小,因为基因组序列的相对活性最好通过预测的结合位点占据来解释,而与结合位点的身份和定位无关。我们的结果表明,转录因子结合位点 (TFBS) 的影响受到位点的顺序和定向的影响,但在基因组中,TF 的整体占据是活性的主要决定因素。