Srivastav Satyam, Feschotte Cédric, Clark Andrew G
Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA.
bioRxiv. 2023 May 23:2023.05.08.539910. doi: 10.1101/2023.05.08.539910.
Animal genomes are parasitized by a horde of transposable elements (TEs) whose mutagenic activity can have catastrophic consequences. The piRNA pathway is a conserved mechanism to repress TE activity in the germline via a specialized class of small RNAs associated with effector Piwi proteins called piwi-associated RNAs (piRNAs). piRNAs are produced from discrete genomic regions called piRNA clusters (piCs). While piCs are generally enriched for TE sequences and the molecular processes by which they are transcribed and regulated are relatively well understood in , much less is known about the origin and evolution of piCs in this or any other species. To investigate piC evolution, we use a population genomics approach to compare piC activity and sequence composition across 8 geographically distant strains of with high quality long-read genome assemblies. We perform extensive annotations of ovary piCs and TE content in each strain and test predictions of two proposed models of piC evolution. The model posits that individual TE insertions can spontaneously attain the status of a small piC to generate piRNAs silencing the entire TE family. The 'trap' model envisions large and evolutionary stable genomic clusters where TEs tend to accumulate and serves as a long-term "memory" of ancient TE invasions and produce a great variety of piRNAs protecting against related TEs entering the genome. It remains unclear which model best describes the evolution of piCs. Our analysis uncovers extensive variation in piC activity across strains and signatures of rapid birth and death of piCs in natural populations. Most TE families inferred to be recently or currently active show an enrichment of strain-specific insertions into large piCs, consistent with the trap model. By contrast, only a small subset of active LTR retrotransposon families is enriched for the formation of strain-specific piCs, suggesting that these families have an inherent proclivity to form piCs. Thus, our findings support aspects of both and 'trap' models of piC evolution. We propose that these two models represent two extreme stages along an evolutionary continuum, which begins with the emergence of piCs from a few specific LTR retrotransposon insertions that subsequently expand by accretion of other TE insertions during evolution to form larger 'trap' clusters. Our study shows that piCs are evolutionarily labile and that TEs themselves are the major force driving the formation and evolution of piCs.
动物基因组被大量转座元件(TEs)寄生,这些元件的诱变活性可能会产生灾难性后果。piRNA途径是一种保守机制,通过一类与效应蛋白Piwi相关的特殊小RNA(称为piwi相关RNA,即piRNAs)来抑制生殖系中的TE活性。piRNAs由称为piRNA簇(piCs)的离散基因组区域产生。虽然piCs通常富含TE序列,并且它们的转录和调控的分子过程在[具体物种]中相对较为清楚,但对于该物种或任何其他物种中piCs的起源和进化了解较少。为了研究piC的进化,我们采用群体基因组学方法,比较了8个地理上相距遥远的[具体物种]菌株的piC活性和序列组成,这些菌株具有高质量的长读长基因组组装。我们对每个菌株的卵巢piCs和TE含量进行了广泛注释,并测试了两种提出的piC进化模型的预测。“种子”模型假定单个TE插入可以自发获得小piC的状态,以产生沉默整个TE家族的piRNAs。“陷阱”模型设想了大型且进化稳定的基因组簇,其中TEs倾向于积累,并作为古代TE入侵的长期“记忆”,并产生各种各样的piRNAs,防止相关TEs进入基因组。目前尚不清楚哪种模型最能描述piCs的进化。我们的分析揭示了不同菌株间piC活性的广泛差异以及自然种群中piCs快速产生和消亡的特征。大多数推断为近期或当前活跃的TE家族显示出在大型piCs中菌株特异性插入的富集,这与“陷阱”模型一致。相比之下,只有一小部分活跃的LTR逆转录转座子家族在形成菌株特异性piCs方面富集,这表明这些家族具有形成piCs的内在倾向。因此,我们的发现支持了piC进化的“种子”和“陷阱”模型的某些方面。我们提出,这两种模型代表了进化连续体上的两个极端阶段,其始于从一些特定的LTR逆转录转座子插入中出现piCs,随后在进化过程中通过其他TE插入的积累而扩展,形成更大的“陷阱”簇。我们的研究表明,piCs在进化上是不稳定的,并且TEs本身是驱动piCs形成和进化的主要力量。