van Berkum Nynke L, Lieberman-Aiden Erez, Williams Louise, Imakaev Maxim, Gnirke Andreas, Mirny Leonid A, Dekker Job, Lander Eric S
Program in Gene Function and Expression, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School.
J Vis Exp. 2010 May 6(39):1869. doi: 10.3791/1869.
The three-dimensional folding of chromosomes compartmentalizes the genome and and can bring distant functional elements, such as promoters and enhancers, into close spatial proximity (2-6). Deciphering the relationship between chromosome organization and genome activity will aid in understanding genomic processes, like transcription and replication. However, little is known about how chromosomes fold. Microscopy is unable to distinguish large numbers of loci simultaneously or at high resolution. To date, the detection of chromosomal interactions using chromosome conformation capture (3C) and its subsequent adaptations required the choice of a set of target loci, making genome-wide studies impossible (7-10). We developed Hi-C, an extension of 3C that is capable of identifying long range interactions in an unbiased, genome-wide fashion. In Hi-C, cells are fixed with formaldehyde, causing interacting loci to be bound to one another by means of covalent DNA-protein cross-links. When the DNA is subsequently fragmented with a restriction enzyme, these loci remain linked. A biotinylated residue is incorporated as the 5' overhangs are filled in. Next, blunt-end ligation is performed under dilute conditions that favor ligation events between cross-linked DNA fragments. This results in a genome-wide library of ligation products, corresponding to pairs of fragments that were originally in close proximity to each other in the nucleus. Each ligation product is marked with biotin at the site of the junction. The library is sheared, and the junctions are pulled-down with streptavidin beads. The purified junctions can subsequently be analyzed using a high-throughput sequencer, resulting in a catalog of interacting fragments. Direct analysis of the resulting contact matrix reveals numerous features of genomic organization, such as the presence of chromosome territories and the preferential association of small gene-rich chromosomes. Correlation analysis can be applied to the contact matrix, demonstrating that the human genome is segregated into two compartments: a less densely packed compartment containing open, accessible, and active chromatin and a more dense compartment containing closed, inaccessible, and inactive chromatin regions. Finally, ensemble analysis of the contact matrix, coupled with theoretical derivations and computational simulations, revealed that at the megabase scale Hi-C reveals features consistent with a fractal globule conformation.
染色体的三维折叠将基因组进行区室化,能够使启动子和增强子等远距离功能元件在空间上紧密靠近(2 - 6)。解读染色体组织与基因组活性之间的关系将有助于理解转录和复制等基因组过程。然而,对于染色体如何折叠却知之甚少。显微镜无法同时以高分辨率区分大量基因座。迄今为止,使用染色体构象捕获技术(3C)及其后续改进方法检测染色体相互作用需要选择一组目标基因座,这使得全基因组研究无法实现(7 - 10)。我们开发了Hi - C技术,它是3C技术的扩展,能够以无偏差、全基因组的方式识别长程相互作用。在Hi - C技术中,细胞用甲醛固定,使相互作用的基因座通过共价DNA - 蛋白质交联相互结合。当随后用限制性酶切割DNA时,这些基因座仍保持连接。在填补5'端突出端时掺入生物素化残基。接下来,在有利于交联DNA片段之间连接事件的稀释条件下进行平端连接。这产生了一个全基因组的连接产物文库,对应于最初在细胞核中彼此紧邻的片段对。每个连接产物在连接位点用生物素标记。文库被剪切,连接点用链霉亲和素磁珠拉下。随后可以使用高通量测序仪对纯化的连接点进行分析,从而得到相互作用片段的目录。对所得接触矩阵的直接分析揭示了基因组组织的许多特征,例如染色体区域的存在以及富含小基因的染色体的优先关联。相关分析可应用于接触矩阵,表明人类基因组被分为两个区室:一个包装密度较低的区室,包含开放、可及且活跃的染色质;另一个密度较高的区室,包含封闭、不可及且无活性的染色质区域。最后,对接触矩阵的整体分析,结合理论推导和计算模拟,揭示了在兆碱基尺度上Hi - C技术揭示的特征与分形球状体构象一致。