Center for Genomics and Data Science Research, Division of Intramural Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA.
Genome Biol Evol. 2024 Apr 2;16(4). doi: 10.1093/gbe/evae052.
Conserved noncoding elements (CNEs) are DNA sequences located outside of protein-coding genes that can remain under purifying selection for up to hundreds of millions of years. Studies in vertebrate genomes have revealed that most CNEs carry out regulatory functions. Notably, many of them are enhancers that control the expression of homeodomain transcription factors and other genes that play crucial roles in embryonic development. To further our knowledge of CNEs in other parts of the animal tree, we conducted a large-scale characterization of CNEs in more than 50 genomes from three of the main branches of the metazoan tree: Cnidaria, Mollusca, and Arthropoda. We identified hundreds of thousands of CNEs and reconstructed the temporal dynamics of their appearance in each lineage, as well as determining their spatial distribution across genomes. We show that CNEs evolve repeatedly around the same genes across the Metazoa, including around homeodomain genes and other transcription factors; they also evolve repeatedly around genes involved in neural development. We also show that transposons are a major source of CNEs, confirming previous observations from vertebrates and suggesting that they have played a major role in wiring developmental gene regulatory mechanisms since the dawn of animal evolution.
保守的非编码元件(CNEs)是位于蛋白质编码基因之外的 DNA 序列,它们可以在长达数亿年的时间里受到净化选择的影响。脊椎动物基因组的研究表明,大多数 CNEs 发挥着调节作用。值得注意的是,其中许多是增强子,它们控制同源域转录因子和其他在胚胎发育中发挥关键作用的基因的表达。为了进一步了解动物树其他部分的 CNEs,我们对来自后生动物树的三个主要分支(刺胞动物门、软体动物门和节肢动物门)的 50 多个基因组中的 CNEs 进行了大规模的特征描述。我们鉴定了数十万 CNE,并重建了它们在每个谱系中出现的时间动态,以及确定了它们在基因组中的空间分布。我们表明,CNEs 在整个后生动物中在相同的基因周围重复进化,包括同源域基因和其他转录因子周围;它们也在参与神经发育的基因周围重复进化。我们还表明,转座子是 CNEs 的主要来源,这证实了来自脊椎动物的先前观察结果,并表明它们自动物进化的黎明以来就在布线发育基因调控机制方面发挥了重要作用。