Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Siberian Branch, 630090 Novosibirsk, Russia.
Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Branch, 630090 Novosibirsk, Russia.
Int J Mol Sci. 2022 Oct 27;23(21):13063. doi: 10.3390/ijms232113063.
Tandemly arranged and dispersed repetitive DNA sequences are important structural and functional elements that make up a significant portion of vertebrate genomes. Using high throughput, low coverage whole genome sequencing followed by bioinformatics analysis, we have identified seven major tandem repetitive DNAs and two fragments of LTR retrotransposons in the genome of the Nile crocodile (, 2n = 32). The repeats showed great variability in structure, genomic organization, and chromosomal distribution as revealed by fluorescence in situ hybridization (FISH). We found that centromeric and pericentromeric heterochromatin of is composed of previously described in CSI-dIII and CSI-I repetitive sequence families, a satellite revealed in , and additionally contains at least three previously unannotated tandem repeats. Both LTR sequences identified here belong to the ERV1 family of endogenous retroviruses. Each pericentromeric region was characterized by a diverse set of repeats, with the exception of chromosome pair 4, in which we found only one type of satellite. Only a few repeats showed non-centromeric signals in addition to their centromeric localization. Mapping of 18S-28S ribosomal RNA genes and telomeric sequences (TTAGGG) did not demonstrate any co-localization of these sequences with revealed centromeric and pericentromeric heterochromatic blocks.
串联排列和分散的重复 DNA 序列是构成脊椎动物基因组重要部分的重要结构和功能元件。通过高通量、低覆盖度的全基因组测序,然后进行生物信息学分析,我们在尼罗河鳄(,2n=32)的基因组中鉴定了七个主要的串联重复 DNA 序列和两个 LTR 反转录转座子片段。通过荧光原位杂交(FISH)发现,这些重复序列在结构、基因组组织和染色体分布上具有很大的变异性。我们发现,的着丝粒和近着丝粒异染色质由先前在 CSI-dIII 和 CSI-I 重复序列家族中描述的、在中揭示的卫星以及至少三个以前未注释的串联重复组成。这里鉴定的两个 LTR 序列都属于内源性逆转录病毒 ERV1 家族。除了第 4 号染色体外,每个着丝粒区域都具有一组不同的重复序列,而在第 4 号染色体中,我们只发现了一种类型的卫星。除了它们的着丝粒定位外,只有少数重复序列显示出非着丝粒信号。18S-28S 核糖体 RNA 基因和端粒序列(TTAGGG)的定位并没有显示这些序列与鉴定的着丝粒和近着丝粒异染色质块有任何共定位。