Li Yuan, Chen Xiao, Wu Kun, Pan Jiao, Long Hongan, Yan Ying
Institute of Evolution & Marine Biodiversity, KLMME, Ocean University of China, Qingdao 266003, China.
Microorganisms. 2020 May 1;8(5):662. doi: 10.3390/microorganisms8050662.
Simple sequence repeats (SSRs) are prevalent in the genomes of all organisms. They are widely used as genetic markers, and are insertion/deletion mutation hotspots, which directly influence genome evolution. However, little is known about such important genomic components in ciliated protists, a large group of unicellular eukaryotes with extremely long evolutionary history and genome diversity. With recent publications of multiple ciliate genomes, we start to get a chance to explore perfect SSRs with motif size 1-100 bp and at least three motif repeats in nine species of two ciliate classes, Oligohymenophorea and Spirotrichea. We found that homopolymers are the most prevalent SSRs in these A/T-rich species, with AAA (lysine, charged amino acid; also seen as an SSR with one-adenine motif repeated three times) being the codons repeated at the highest frequencies in coding SSR regions, consistent with the widespread alveolin proteins rich in lysine repeats as found in . Micronuclear SSRs are universally more abundant than the macronuclear ones of the same motif-size, except for the 8-bp-motif SSRs in extensively fragmented chromosomes. Both the abundance and A/T content of SSRs decrease as motif-size increases, while the abundance is positively correlated with the A/T content of the genome. Also, smaller genomes have lower proportions of coding SSRs out of all SSRs in species. This genome-wide and cross-species analysis reveals the high diversity of SSRs and reflects the rapid evolution of these simple repetitive elements in ciliate genomes.
简单序列重复(SSRs)在所有生物的基因组中都很普遍。它们被广泛用作遗传标记,并且是插入/缺失突变热点,直接影响基因组进化。然而,对于纤毛虫这类具有极长进化历史和基因组多样性的单细胞真核生物中的这些重要基因组成分,人们却知之甚少。随着多个纤毛虫基因组的近期发表,我们开始有机会在寡膜纲和旋毛纲这两个纤毛虫类别的九个物种中探索基序大小为1 - 100 bp且至少有三个基序重复的完美SSRs。我们发现,在这些富含A/T的物种中,同聚物是最普遍的SSRs,AAA(赖氨酸,带电荷氨基酸;也被视为单腺嘌呤基序重复三次的SSRs)是编码SSRs区域中重复频率最高的密码子,这与广泛存在的富含赖氨酸重复序列的泡膜蛋白一致。除了在高度碎片化染色体中的8 - bp基序SSRs外,微核SSRs普遍比相同基序大小的大核SSRs更丰富。随着基序大小的增加,SSRs的丰度和A/T含量均降低,而丰度与基因组的A/T含量呈正相关。此外,在物种中,较小的基因组中编码SSRs在所有SSRs中的比例较低。这种全基因组和跨物种分析揭示了SSRs的高度多样性,并反映了这些简单重复元件在纤毛虫基因组中的快速进化。