CRI Genetics LLC, Santa Monica, CA 90404, USA.
Department of Medicine, University of Toledo, Toledo, OH 43606, USA.
Genes (Basel). 2022 Nov 7;13(11):2053. doi: 10.3390/genes13112053.
The public UCNEbase database, comprising 4273 human ultra-conserved noncoding elements (UCNEs), was thoroughly investigated with the aim to find any nucleotide signals or motifs that have made these DNA sequences practically unchanged over three hundred million years of evolution. Each UCNE comprises over 200 nucleotides and has at least 95% identity between humans and chickens. A total of 31,046 SNPs were found within the UCNE database. We demonstrated that every human has over 300 mutations within 4273 UCNEs. No association of UCNEs with non-coding RNAs, nor preference of a particular meiotic recombination rate within them were found. No sequence motifs associated with UCNEs nor their flanking regions have been found. However, we demonstrated that UCNEs have strong nucleotide and dinucleotide sequence abnormalities compared to genome averages. Specifically, UCNEs are depleted for CC and GG dinucleotides, while GC dinucleotides are in excess of 28%. Importantly, GC dinucleotides have extraordinarily strong stacking free-energy inside the DNA helix and unique resistance to dissociation. Based on the adjacent nucleotide stacking abnormalities within UCNEs, we conjecture that peculiarities in dinucleotide distribution within UCNEs may create unique 3D conformation and specificity to bind proteins. We also discuss the strange dynamics of multiple SNPs inside UCNEs and reasons why these sequences are extraordinarily conserved.
公共 UCNEbase 数据库包含 4273 个人类超保守非编码元件 (UCNE),对其进行了彻底研究,旨在寻找使这些 DNA 序列在三亿多年的进化过程中基本不变的任何核苷酸信号或基序。每个 UCNE 由超过 200 个核苷酸组成,在人类和鸡之间具有至少 95%的同一性。在 UCNE 数据库中发现了 31046 个 SNPs。我们证明,每个人类在 4273 个 UCNE 中都有超过 300 个突变。没有发现 UCNE 与非编码 RNA 之间的关联,也没有发现它们内部特定减数分裂重组率的偏好。没有发现与 UCNE 相关的序列基序或其侧翼区域。然而,我们证明与基因组平均值相比,UCNE 具有强烈的核苷酸和二核苷酸序列异常。具体来说,UCNE 中 CC 和 GG 二核苷酸缺失,而 GC 二核苷酸过量 28%。重要的是,GC 二核苷酸在 DNA 螺旋内具有极强的堆积自由能,并且对解离具有独特的抵抗力。基于 UCNE 内相邻核苷酸堆积异常,我们推测 UCNE 内二核苷酸分布的特殊性可能会产生独特的 3D 构象和与蛋白质结合的特异性。我们还讨论了 UCNE 内多个 SNPs 的奇怪动态以及这些序列为何如此保守的原因。