Fang Tiffany, Mohseni Amir, Lonardi Stefano, Ben Mamoun Choukri
Department of Internal Medicine, Section of Infectious Diseases, Department of Microbial Pathogenesis and Department of Pathology, Yale School of Medicine, New Haven, CT, 06520 USA.
Department of Computer Science and Engineering, University of California, Riverside, CA, 92521 USA.
NAR Genom Bioinform. 2024 Apr 4;6(2):lqae032. doi: 10.1093/nargab/lqae032. eCollection 2024 Jun.
Evolutionary constraints greatly favor compact genomes that efficiently encode proteins. However, several eukaryotic organisms, including apicomplexan parasites such as , and , the causative agents of toxoplasmosis, malaria and babesiosis, respectively, encode very large proteins, exceeding 20 times their average protein size. Although these large proteins represent <1% of the total protein pool and are generally expressed at low levels, their persistence throughout evolution raises important questions about their functions and possible evolutionary pressures to maintain them. In this study, we examined the trends in gene and protein size, function and expression patterns within seven apicomplexan pathogens. Our analysis revealed that certain large proteins in apicomplexan parasites harbor domains potentially important for functions such as antigenic variation, erythrocyte invasion and immune evasion. However, these domains are not limited to or strictly conserved within large proteins. While some of these proteins are predicted to engage in conventional metabolic pathways within these parasites, others fulfill specialized functions for pathogen-host interactions, nutrient acquisition and overall survival.
进化限制极大地青睐于能高效编码蛋白质的紧凑基因组。然而,包括顶复门寄生虫(如分别导致弓形虫病、疟疾和巴贝斯虫病的 、 和 )在内的几种真核生物,编码的蛋白质非常大,超过其平均蛋白质大小的20倍。尽管这些大蛋白质占总蛋白质库的比例不到1%,且通常表达水平较低,但它们在整个进化过程中的持续存在引发了关于其功能以及维持它们的可能进化压力的重要问题。在本研究中,我们研究了七种顶复门病原体的基因和蛋白质大小、功能及表达模式的趋势。我们的分析表明,顶复门寄生虫中的某些大蛋白质含有对诸如抗原变异、红细胞入侵和免疫逃避等功能可能重要的结构域。然而,这些结构域并不局限于大蛋白质,也不是严格保守的。虽然其中一些蛋白质预计参与这些寄生虫内的传统代谢途径,但其他蛋白质则履行病原体 - 宿主相互作用、营养获取和整体生存的特殊功能。