Structural Studies Division, MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK.
Nucleic Acids Res. 2013 Jan;41(2):701-10. doi: 10.1093/nar/gks1096. Epub 2012 Nov 21.
Experimental techniques for the investigation of three-dimensional (3D) genome organization are being developed at a fast pace. Currently, the associated computational methods are mostly specific to the individual experimental approach. Here we present a general statistical framework that is widely applicable to the analysis of genomic contact maps, irrespective of the data acquisition and normalization processes. Within this framework DNA-DNA contact data are represented as a complex network, for which a broad number of directly applicable methods already exist. In such a network representation, DNA segments and contacts between them are denoted as nodes and edges, respectively. Furthermore, we present a robust method for generating randomized contact networks that explicitly take into account the inherent 3D nature of the genome and serve as realistic null-models for unbiased statistical analyses. By integrating a variety of large-scale genome-wide datasets we demonstrate that meiotic crossover sites display enriched genomic contacts and that cohesin-bound genes are significantly colocalized in the yeast nucleus. We anticipate that the complex network framework in conjunction with the randomization of DNA-DNA contact networks will become a widely used tool in the study of nuclear architecture.
实验技术的研究三维(3D)基因组组织正在快速发展。目前,相关的计算方法大多是特定于个别实验方法。在这里,我们提出了一个通用的统计框架,广泛适用于基因组接触图谱的分析,而不考虑数据获取和归一化过程。在这个框架内,DNA-DNA 接触数据被表示为一个复杂的网络,对于这个网络,已经存在大量可直接应用的方法。在这种网络表示中,DNA 片段和它们之间的接触分别表示为节点和边。此外,我们提出了一种生成随机接触网络的稳健方法,该方法明确考虑了基因组的固有 3D 性质,并作为无偏统计分析的现实零模型。通过整合各种大规模全基因组数据集,我们证明了减数分裂交叉位点显示出丰富的基因组接触,并且黏着蛋白结合基因在酵母核中显著共定位。我们预计,复杂的网络框架结合 DNA-DNA 接触网络的随机化将成为研究核结构的广泛使用的工具。