Iyer Lakshminarayan M, Anantharaman Vivek, Wolf Maxim Y, Aravind L
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
Int J Parasitol. 2008 Jan;38(1):1-31. doi: 10.1016/j.ijpara.2007.07.018. Epub 2007 Sep 15.
Comparative genomics of parasitic protists and their free-living relatives are profoundly impacting our understanding of the regulatory systems involved in transcription and chromatin dynamics. While some parts of these systems are highly conserved, other parts are rapidly evolving, thereby providing the molecular basis for the variety in the regulatory adaptations of eukaryotes. The gross number of specific transcription factors and chromatin proteins are positively correlated with proteome size in eukaryotes. However, the individual types of specific transcription factors show an enormous variety across different eukaryotic lineages. The dominant families of specific transcription factors even differ between sister lineages, and have been shaped by gene loss and lineage-specific expansions. Recognition of this principle has helped in identifying the hitherto unknown, major specific transcription factors of several parasites, such as apicomplexans, Entamoeba histolytica, Trichomonas vaginalis, Phytophthora and ciliates. Comparative analysis of predicted chromatin proteins from protists allows reconstruction of the early evolutionary history of histone and DNA modification, nucleosome assembly and chromatin-remodeling systems. Many key catalytic, peptide-binding and DNA-binding domains in these systems ultimately had bacterial precursors, but were put together into distinctive regulatory complexes that are unique to the eukaryotes. In the case of histone methylases, histone demethylases and SWI2/SNF2 ATPases, proliferation of paralogous families followed by acquisition of novel domain architectures, seem to have played a major role in producing a diverse set of enzymes that create and respond to an epigenetic code of modified histones. The diversification of histone acetylases and DNA methylases appears to have proceeded via repeated emergence of new versions, most probably via transfers from bacteria to different eukaryotic lineages, again resulting in lineage-specific diversity in epigenetic signals. Even though the key histone modifications are universal to eukaryotes, domain architectures of proteins binding post-translationally modified-histones vary considerably across eukaryotes. This indicates that the histone code might be "interpreted" differently from model organisms in parasitic protists and their relatives. The complexity of domain architectures of chromatin proteins appears to have increased during eukaryotic evolution. Thus, Trichomonas, Giardia, Naegleria and kinetoplastids have relatively simple domain architectures, whereas apicomplexans and oomycetes have more complex architectures. RNA-dependent post-transcriptional silencing systems, which interact with chromatin-level regulatory systems, show considerable variability across parasitic protists, with complete loss in many apicomplexans and partial loss in Trichomonas vaginalis. This evolutionary synthesis offers a robust scaffold for future investigation of transcription and chromatin structure in parasitic protists.
寄生原生生物与其自由生活的亲缘生物的比较基因组学,正在深刻影响我们对转录和染色质动态变化所涉及调控系统的理解。虽然这些系统的某些部分高度保守,但其他部分却在快速进化,从而为真核生物调控适应性的多样性提供了分子基础。真核生物中特定转录因子和染色质蛋白的总数与蛋白质组大小呈正相关。然而,特定转录因子的个体类型在不同真核生物谱系中表现出极大的差异。特定转录因子的优势家族在姐妹谱系之间甚至也有所不同,并且受到基因丢失和谱系特异性扩增的影响。认识到这一原则有助于鉴定几种寄生虫(如顶复门原虫、溶组织内阿米巴、阴道毛滴虫、疫霉和纤毛虫)中迄今未知的主要特定转录因子。对原生生物预测的染色质蛋白进行比较分析,有助于重建组蛋白和DNA修饰、核小体组装及染色质重塑系统的早期进化历史。这些系统中的许多关键催化、肽结合和DNA结合结构域最终都有细菌前体,但被组合成了真核生物特有的独特调控复合物。就组蛋白甲基化酶、组蛋白去甲基化酶和SWI2/SNF2 ATP酶而言,旁系同源家族的增殖以及随后获得新的结构域架构,似乎在产生多种能够产生并响应修饰组蛋白表观遗传密码的酶方面发挥了重要作用。组蛋白乙酰化酶和DNA甲基化酶的多样化似乎是通过新版本的反复出现而进行的,很可能是通过从细菌转移到不同的真核生物谱系,这同样导致了表观遗传信号的谱系特异性多样性。尽管关键的组蛋白修饰在真核生物中是普遍存在的,但翻译后修饰组蛋白结合蛋白的结构域架构在不同真核生物中差异很大。这表明在寄生原生生物及其亲缘生物中,组蛋白密码的“解读”可能与模式生物不同。染色质蛋白结构域架构的复杂性在真核生物进化过程中似乎有所增加。因此,阴道毛滴虫、贾第虫、耐格里属变形虫和动质体具有相对简单的结构域架构,而顶复门原虫和卵菌则具有更复杂的架构。与染色质水平调控系统相互作用的RNA依赖性转录后沉默系统,在寄生原生生物中表现出相当大的变异性,在许多顶复门原虫中完全缺失,在阴道毛滴虫中部分缺失。这种进化综合为未来研究寄生原生生物中的转录和染色质结构提供了一个坚实的框架。