Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts 01605-0103, USA.
Nature. 2012 Sep 6;489(7414):109-13. doi: 10.1038/nature11279.
The vast non-coding portion of the human genome is full of functional elements and disease-causing regulatory variants. The principles defining the relationships between these elements and distal target genes remain unknown. Promoters and distal elements can engage in looping interactions that have been implicated in gene regulation. Here we have applied chromosome conformation capture carbon copy (5C) to interrogate comprehensively interactions between transcription start sites (TSSs) and distal elements in 1% of the human genome representing the ENCODE pilot project regions. 5C maps were generated for GM12878, K562 and HeLa-S3 cells and results were integrated with data from the ENCODE consortium. In each cell line we discovered >1,000 long-range interactions between promoters and distal sites that include elements resembling enhancers, promoters and CTCF-bound sites. We observed significant correlations between gene expression, promoter-enhancer interactions and the presence of enhancer RNAs. Long-range interactions show marked asymmetry with a bias for interactions with elements located ∼120 kilobases upstream of the TSS. Long-range interactions are often not blocked by sites bound by CTCF and cohesin, indicating that many of these sites do not demarcate physically insulated gene domains. Furthermore, only ∼7% of looping interactions are with the nearest gene, indicating that genomic proximity is not a simple predictor for long-range interactions. Finally, promoters and distal elements are engaged in multiple long-range interactions to form complex networks. Our results start to place genes and regulatory elements in three-dimensional context, revealing their functional relationships.
人类基因组的大片非编码区域充满了功能元件和致病调控变体。定义这些元件与远端靶基因之间关系的原则仍然未知。启动子和远端元件可以进行环化相互作用,这些相互作用已被牵连到基因调控中。在这里,我们应用染色体构象捕获碳拷贝(5C)全面研究了转录起始位点(TSS)和代表 ENCODE 试点项目区域的人类基因组 1%中的远端元件之间的相互作用。为 GM12878、K562 和 HeLa-S3 细胞生成了 5C 图谱,并将结果与 ENCODE 联盟的数据进行了整合。在每种细胞系中,我们发现了超过 1000 个启动子和远端位点之间的长程相互作用,其中包括类似于增强子、启动子和 CTCF 结合位点的元件。我们观察到基因表达、启动子-增强子相互作用和增强子 RNA 的存在之间存在显著相关性。长程相互作用显示出明显的不对称性,偏向于与位于 TSS 上游约 120kb 的元件相互作用。长程相互作用通常不受 CTCF 和黏连蛋白结合位点的阻碍,这表明许多这些位点并不标志着物理隔离的基因域。此外,只有约 7%的环化相互作用与最近的基因有关,这表明基因组接近度并不是长程相互作用的简单预测因素。最后,启动子和远端元件参与多个长程相互作用以形成复杂的网络。我们的结果开始将基因和调控元件置于三维环境中,揭示它们的功能关系。