Department of Biology, Indiana University, Bloomington, IN.
Institute of Hydrobiology, Technische Universität Dresden, Dresden, Germany.
Mol Biol Evol. 2017 May 1;34(5):1194-1216. doi: 10.1093/molbev/msx074.
Population-genomic analyses are essential to understanding factors shaping genomic variation and lineage-specific sequence constraints. The dearth of such analyses for unicellular eukaryotes prompted us to assess genomic variation in Paramecium, one of the most well-studied ciliate genera. The Paramecium aurelia complex consists of ∼15 morphologically indistinguishable species that diverged subsequent to two rounds of whole-genome duplications (WGDs, as long as 320 MYA) and possess extremely streamlined genomes. We examine patterns of both nuclear and mitochondrial polymorphism, by sequencing whole genomes of 10-13 worldwide isolates of each of three species belonging to the P. aurelia complex: P. tetraurelia, P. biaurelia, P. sexaurelia, as well as two outgroup species that do not share the WGDs: P. caudatum and P. multimicronucleatum. An apparent absence of global geographic population structure suggests continuous or recent dispersal of Paramecium over long distances. Intergenic regions are highly constrained relative to coding sequences, especially in P. caudatum and P. multimicronucleatum that have shorter intergenic distances. Sequence diversity and divergence are reduced up to ∼100-150 bp both upstream and downstream of genes, suggesting strong constraints imposed by the presence of densely packed regulatory modules. In addition, comparison of sequence variation at non-synonymous and synonymous sites suggests similar recent selective pressures on paralogs within and orthologs across the deeply diverging species. This study presents the first genome-wide population-genomic analysis in ciliates and provides a valuable resource for future studies in evolutionary and functional genetics in Paramecium.
群体基因组分析对于理解塑造基因组变异和谱系特异性序列约束的因素至关重要。单细胞真核生物缺乏此类分析促使我们评估了 Paramecium 的基因组变异,Paramecium 是研究最深入的纤毛动物属之一。Paramecium aurelia 复合体由约 15 种形态上无法区分的物种组成,这些物种在两轮全基因组复制(WGD,长达 320 百万年前)后分化,并拥有极其精简的基因组。我们通过对属于 P. aurelia 复合体的三个物种(P. tetraurelia、P. biaurelia 和 P. sexaurelia)的 10-13 个全球分离株的全基因组进行测序,来研究核和线粒体多态性的模式,以及两个不属于 WGD 的外群物种:P. caudatum 和 P. multimicronucleatum。明显缺乏全球地理种群结构表明 Paramecium 能够在长距离内持续或最近地扩散。与编码序列相比,基因间区受到高度限制,特别是在 P. caudatum 和 P. multimicronucleatum 中,它们的基因间距离更短。在基因上下游,多达 100-150bp 的序列多样性和分化都减少了,这表明存在密集的调控模块对其施加了强烈的约束。此外,对非同义突变和同义突变位点的序列变异进行比较,表明在深度分化的物种中,同源基因和直系同源基因受到类似的近期选择压力。本研究首次在纤毛动物中进行了全基因组群体基因组分析,并为未来在 Paramecium 中进行进化和功能遗传学研究提供了有价值的资源。