Departamento de Ciências Biológicas, Faculdade de Ciências, Universidade Estadual Paulista, UNESP, Campus de Bauru, Bauru, Sao Paulo, Brazil.
Departamento de Biologia Estrutural e Funcional, Instituto de Biociências de Botucatu, Universidade Estadual Paulista, UNESP, Botucatu, Sao Paulo, Brazil.
Genome Biol Evol. 2021 Feb 3;13(2). doi: 10.1093/gbe/evab002.
Eukaryotic genomes contain large amounts of repetitive DNA sequences, such as tandemly repeated satellite DNAs (satDNAs). These sequences are highly dynamic and tend to be genus- or species-specific due to their particular evolutionary pathways, although there are few unusual cases of conserved satDNAs over long periods of time. Here, we used multiple approaches to reveal that an satDNA named CharSat01-52 originated in the last common ancestor of Characoidei fish, a superfamily within the Characiformes order, ∼140-78 Ma, whereas its nucleotide composition has remained considerably conserved in several taxa. We show that 14 distantly related species within Characoidei share the presence of this satDNA, which is highly amplified and clustered in subtelomeric regions in a single species (Characidium gomesi), while remained organized as small clusters in all the other species. Defying predictions of the molecular drive of satellite evolution, CharSat01-52 shows similar values of intra- and interspecific divergence. Although we did not provide evidence for a specific functional role of CharSat01-52, its transcriptional activity was demonstrated in different species. In addition, we identified short tandem arrays of CharSat01-52 embedded within single-molecule real-time long reads of Astyanax paranae (536 bp-3.1 kb) and A. mexicanus (501 bp-3.9 kb). Such arrays consisted of head-to-tail repeats and could be found interspersed with other sequences, inverted sequences, or neighbored by other satellites. Our results provide a detailed characterization of an old and conserved satDNA, challenging general predictions of satDNA evolution.
真核生物基因组含有大量重复的 DNA 序列,如串联重复卫星 DNA(satDNA)。这些序列具有高度的动态性,由于其特定的进化途径,往往具有属或种特异性,尽管在很长一段时间内,也有一些罕见的保守 satDNA 存在。在这里,我们使用多种方法揭示了一种名为 CharSat01-52 的 satDNA 起源于 Characoidei 鱼类的最后共同祖先,Characoidei 是 Characiformes 目内的一个超科,约在 140-78 百万年前,而其核苷酸组成在几个分类群中仍然保持相当的保守。我们表明,Characoidei 内的 14 个远缘物种都存在这种 satDNA,它在一个物种(Characidium gomesi)的端粒区域高度扩增和聚类,而在所有其他物种中仍然以小簇的形式存在。这与卫星进化的分子驱动的预测相悖,CharSat01-52 显示出相似的种内和种间分歧值。虽然我们没有提供 CharSat01-52 特定功能作用的证据,但它的转录活性在不同物种中得到了证明。此外,我们在 Astyanax paranae(536bp-3.1kb)和 A. mexicanus(501bp-3.9kb)的单分子实时长读序列中鉴定了短串联重复的 CharSat01-52 。这些阵列由头对头重复组成,可以在其他序列之间、反向序列之间或与其他卫星相邻找到。我们的研究结果为一个古老而保守的 satDNA 提供了详细的特征描述,挑战了 satDNA 进化的一般预测。