Department of Biological Resources, Eötvös Loránd Research Network, Centre for Agricultural Research, 2462 Martonvásár, Hungary.
Department of Molecular Breeding, Eötvös Loránd Research Network, Centre for Agricultural Research, 2462 Martonvásár, Hungary.
Genes (Basel). 2022 Nov 23;13(12):2189. doi: 10.3390/genes13122189.
Chromatin-chromatin interactions and three-dimensional (3D) spatial structures are involved in transcriptional regulation and have a decisive role in DNA replication and repair. To understand how individual genes and their regulatory elements function within the larger genomic context, and how the genome reacts to environmental stimuli, the linear sequence information needs to be interpreted in three-dimensional space, which is still a challenging task. Here, we propose a novel, heuristic approach to represent Hi-C datasets by a whole-genomic pseudo-structure in 3D space. The baseline of our approach is the construction of a multigraph from genomic-sequence data and Hi-C interaction data, then applying a modified force-directed layout algorithm. The resulting layout is a pseudo-structure. While pseudo-structures are not based on direct observation and their details are inherent to settings, surprisingly, they demonstrate interesting, overall similarities of known genome structures of both barley and rice, namely, the Rabl and Rosette-like conformation. It has an exciting potential to be extended by additional omics data (RNA-seq, Chip-seq, etc.), allowing to visualize the dynamics of the pseudo-structures across various tissues or developmental stages. Furthermore, this novel method would make it possible to revisit most Hi-C data accumulated in the public domain in the last decade.
染色质-染色质相互作用和三维(3D)空间结构参与转录调控,并在 DNA 复制和修复中起决定性作用。为了了解单个基因及其调控元件在更大的基因组背景中如何发挥作用,以及基因组如何对环境刺激做出反应,需要在三维空间中解释线性序列信息,这仍然是一项具有挑战性的任务。在这里,我们提出了一种新的启发式方法,通过 3D 空间中的全基因组伪结构来表示 Hi-C 数据集。我们方法的基础是从基因组序列数据和 Hi-C 相互作用数据构建一个多图,然后应用一种改进的力导向布局算法。得到的布局是一个伪结构。虽然伪结构不是基于直接观察,其细节是固有设置的,但令人惊讶的是,它们展示了有趣的、整体上类似于大麦和水稻已知基因组结构的相似性,即 Rabl 和 Rosette 样构象。它具有通过额外的组学数据(RNA-seq、Chip-seq 等)进行扩展的令人兴奋的潜力,允许在不同组织或发育阶段可视化伪结构的动态。此外,这种新方法将使人们能够重新审视过去十年中在公共领域积累的大多数 Hi-C 数据。