Department of Biological Sciences, George Washington University, Washington, DC, United States.
Front Immunol. 2021 Nov 15;12:744783. doi: 10.3389/fimmu.2021.744783. eCollection 2021.
The generation of large immune gene families is often driven by evolutionary pressure exerted on host genomes by their pathogens, which has been described as the immunological arms race. The () gene family from the California purple sea urchin, , is upregulated upon immune challenge and encodes the SpTrf proteins that interact with pathogens during an immune response. Native SpTrf proteins bind both bacteria and yeast, and augment phagocytosis of a marine , while a recombinant SpTrf protein (rSpTrf-E1) binds a subset of pathogens and a range of pathogen associated molecular patterns. In the sequenced sea urchin genome, there are four gene clusters for a total of 17 genes. Here, we report an in-depth analysis of these genes to understand the sequence complexities of this family, its genomic structure, and to derive a putative evolutionary history for the formation of the gene clusters. We report a detailed characterization of gene structure including the intron type and UTRs with conserved transcriptional start sites, the start codon and multiple stop codons, and locations of polyadenylation signals. Phylogenetic and percent mismatch analyses of the genes and the intergenic regions allowed us to predict the last common ancestral gene and a theoretical evolutionary history of the gene family. The appearance of the gene clusters from the theoretical ancestral gene may have been driven by multiple duplication and deletion events of regions containing genes. Duplications and ectopic insertion events, indels, and point mutations in the exons likely resulted in the extant genes and family structure. This theoretical evolutionary history is consistent with the involvement of these genes in the arms race in responses to pathogens and suggests that the diversification of these genes and their encoded proteins have been selected for based on the survival benefits of pathogen binding and host protection.
大型免疫基因家族的产生通常是由病原体对宿主基因组施加的进化压力驱动的,这被描述为免疫军备竞赛。加利福尼亚紫海胆的 () 基因家族,在免疫挑战时上调,并编码在免疫反应中与病原体相互作用的 SpTrf 蛋白。天然的 SpTrf 蛋白既能结合细菌又能结合酵母,增强海洋原生动物的吞噬作用,而重组 SpTrf 蛋白(rSpTrf-E1)则能结合一部分病原体和一系列与病原体相关的分子模式。在测序的海胆基因组中,有四个 基因簇,共有 17 个基因。在这里,我们对这些基因进行了深入分析,以了解该家族的序列复杂性、基因组结构,并推导出基因簇形成的可能进化历史。我们报告了对基因结构的详细特征描述,包括内含子类型和具有保守转录起始位点的 UTRs、起始密码子和多个终止密码子,以及多聚腺苷酸化信号的位置。基因和基因间区的系统发育和百分比不匹配分析使我们能够预测最后一个共同祖先 基因,并预测基因家族的理论进化历史。从理论上祖先基因出现的基因簇可能是由包含 基因的区域的多次重复和缺失事件驱动的。基因和家族结构的重复、异位插入事件、插入缺失和外显子点突变可能是由这些基因的存在导致的。这种理论进化历史与这些基因在对病原体的军备竞赛中的参与一致,并表明这些基因及其编码蛋白的多样化是基于病原体结合和宿主保护的生存益处而被选择的。