Kirchhoff-Institute for Physics, Heidelberg University, INF 227, 69117 Heidelberg, Germany.
Institute for Human Genetics, University Hospital Heidelberg, INF 366, 69117 Heidelberg, Germany.
Genes (Basel). 2024 Sep 25;15(10):1247. doi: 10.3390/genes15101247.
It is widely accepted that the 3D chromatin organization in human cell nuclei is not random and recent investigations point towards an interactive relation of epigenetic functioning and chromatin (re-)organization. Although chromatin organization seems to be the result of self-organization of the entirety of all molecules available in the cell nucleus, a general question remains open as to what extent chromatin organization might additionally be predetermined by the DNA sequence and, if so, if there are characteristic differences that distinguish typical regions involved in dysfunction-related aberrations from normal ones, since typical DNA breakpoint regions involved in disease-related chromosome aberrations are not randomly distributed along the DNA sequence.
Highly conserved -mer patterns in intronic and intergenic regions have been reported in eukaryotic genomes. In this article, we search and analyze regions deviating from average spectra (ReDFAS) of -mer word frequencies in the human genome. This includes all assembled regions, e.g., telomeric, centromeric, genic as well as intergenic regions.
A positive correlation between -mer spectra and 3D contact frequencies, obtained exemplarily from given Hi-C datasets, has been found indicating a relation of ReDFAS to chromatin organization and interactions. We also searched and found correlations of known functional annotations, e.g., genes correlating with ReDFAS. Selected regions known to contain typical breakpoints on chromosomes 9 and 5 that are involved in cancer-related chromosomal aberrations appear to be enriched in ReDFAS. Since transposable elements like ALUs are often assigned as major players in 3D genome organization, we also studied their impact on our examples but could not find a correlation between ALU regions and breakpoints comparable to ReDFAS.
Our findings might show that ReDFAS are associated with instable regions of the genome and regions with many chromatin contacts which is in line with current research indicating that chromatin loop anchor points lead to genomic instability.
人们普遍认为,人类细胞核中的 3D 染色质组织并非随机的,最近的研究表明,表观遗传功能和染色质(重新)组织之间存在相互关系。尽管染色质组织似乎是细胞核中所有可用分子的整体自组织的结果,但仍有一个未解决的问题,即染色质组织在多大程度上可能还预先由 DNA 序列决定,如果是这样,是否存在区分与功能相关的异常相关的典型区域与正常区域的特征差异,因为涉及疾病相关染色体异常的典型 DNA 断裂点区域并非随机分布在 DNA 序列中。
真核生物基因组中报告了高度保守的 -mer 模式存在于内含子和基因间区。在本文中,我们搜索并分析了人类基因组中 -mer 单词频率平均谱(ReDFAS)偏离的区域。这包括所有组装的区域,例如端粒、着丝粒、基因以及基因间区域。
已经发现 -mer 谱与从给定 Hi-C 数据集获得的 3D 接触频率之间存在正相关,这表明 ReDFAS 与染色质组织和相互作用有关。我们还搜索并找到了已知功能注释的相关性,例如与 ReDFAS 相关的基因。已知包含染色体 9 和 5 上与癌症相关的染色体异常相关的典型断点的选定区域似乎富含 ReDFAS。由于转座元件(如 ALU)通常被认为是 3D 基因组组织的主要参与者,我们也研究了它们对我们的例子的影响,但未能找到与 ReDFAS 可比的 ALU 区域和断点之间的相关性。
我们的发现可能表明,ReDFAS 与基因组不稳定区域和具有许多染色质接触的区域相关,这与当前的研究一致,表明染色质环锚点导致基因组不稳定性。