Broad Institute of MIT and Harvard, Cambridge, MA, USA.
Geisel School of Medicine at Dartmouth, Hanover, NH, USA.
Nature. 2022 Jul;607(7917):176-184. doi: 10.1038/s41586-022-04877-w. Epub 2022 May 20.
Gene regulation in the human genome is controlled by distal enhancers that activate specific nearby promoters. A proposed model for this specificity is that promoters have sequence-encoded preferences for certain enhancers, for example, mediated by interacting sets of transcription factors or cofactors. This 'biochemical compatibility' model has been supported by observations at individual human promoters and by genome-wide measurements in Drosophila. However, the degree to which human enhancers and promoters are intrinsically compatible has not yet been systematically measured, and how their activities combine to control RNA expression remains unclear. Here we design a high-throughput reporter assay called enhancer × promoter self-transcribing active regulatory region sequencing (ExP STARR-seq) and applied it to examine the combinatorial compatibilities of 1,000 enhancer and 1,000 promoter sequences in human K562 cells. We identify simple rules for enhancer-promoter compatibility, whereby most enhancers activate all promoters by similar amounts, and intrinsic enhancer and promoter activities multiplicatively combine to determine RNA output (R = 0.82). In addition, two classes of enhancers and promoters show subtle preferential effects. Promoters of housekeeping genes contain built-in activating motifs for factors such as GABPA and YY1, which decrease the responsiveness of promoters to distal enhancers. Promoters of variably expressed genes lack these motifs and show stronger responsiveness to enhancers. Together, this systematic assessment of enhancer-promoter compatibility suggests a multiplicative model tuned by enhancer and promoter class to control gene transcription in the human genome.
人类基因组中的基因调控受远端增强子控制,这些增强子激活特定的附近启动子。这种特异性的一个假设模型是,启动子对某些增强子具有序列编码偏好,例如通过相互作用的转录因子或共因子介导。这种“生化兼容性”模型得到了单个人类启动子的观察结果和果蝇的全基因组测量的支持。然而,人类增强子和启动子在多大程度上具有内在兼容性尚未得到系统测量,它们的活性如何组合来控制 RNA 表达仍然不清楚。在这里,我们设计了一种称为增强子×启动子自我转录活性调控区测序(ExP STARR-seq)的高通量报告基因检测,并将其应用于检测 1000 个增强子和 1000 个启动子序列在人类 K562 细胞中的组合兼容性。我们确定了增强子-启动子兼容性的简单规则,即大多数增强子以相似的量激活所有启动子,并且内在的增强子和启动子活性以乘法方式组合来决定 RNA 输出(R=0.82)。此外,两类增强子和启动子显示出微妙的偏好效应。管家基因的启动子包含 GABPA 和 YY1 等因子的内置激活基序,这些基序降低了启动子对远端增强子的反应性。表达可变基因的启动子缺乏这些基序,对增强子的反应性更强。总之,这种对增强子-启动子兼容性的系统评估表明,一种乘法模型可以通过增强子和启动子的类别来调节,以控制人类基因组中的基因转录。