State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Pharmaceutical Functional Genes, College of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
BMC Evol Biol. 2010 Oct 13;10:306. doi: 10.1186/1471-2148-10-306.
The vertebrate tetraspanin family has many features which make it suitable for preserving the imprint of ancient sequence evolution and amenable for phylogenomic analysis. So we believe that an in-depth analysis of the tetraspanin evolution not only provides more complete understanding of tetraspanin biology, but offers new insights into the influence of the two rounds of whole genome duplication (2R-WGD) at the origin of vertebrates.
A detailed phylogeny of vertebrate tetraspanins was constructed by using multiple lines of information, including sequence-based phylogenetics, key structural features, intron configuration and genomic synteny. In particular, a total of 38 modern tetraspanin ortholog lineages in bony vertebrates have been identified and subsequently classified into 17 ancestral lineages existing before 2R-WGD. Based on this phylogeny, we found that the ohnolog retention rate of tetraspanins after 2R-WGD was three times as the average (a rate similar to those of transcription factors and protein kinases). This high rate didn't increase the tetrapanin family size, but changed the family composition, possibly by displacing vertebrate-specific gene lineages with the lineages conserved across deuterostomes. We also found that the period from 2R-WGD to recent time is controlled by gene losses. Meanwhile, positive selection has been detected on 80% of the branches right after 2R-WGDs, which declines significantly on both magnitude and extensity on the following speciation branches. Notably, the loss of mammalian RDS2 is accompanied by strong positive selection on mammalian ROM1, possibly due to gene loss-induced compensatory evolution.
First, different from transcription factors and kinases, high duplicate retention rate after 2R-WGD didn't increase the tetraspanin family size but just reshaped the family composition. Second, the evolution of tetraspanins right after 2R-WGD had been impacted by a massive wave of gene loss and positive selection on coding sequences. Third, the lingering effect of 2R-WGD on tetraspanin gene loss and positive selection might last for 300-400 million years.
脊椎动物四跨膜蛋白家族具有许多特征,使其适合于保存古老序列进化的印记,并适用于系统发生基因组分析。因此,我们认为对四跨膜蛋白进化的深入分析不仅提供了对四跨膜蛋白生物学更全面的理解,而且为在脊椎动物起源时两轮全基因组复制(2R-WGD)的影响提供了新的见解。
通过使用多种信息,包括基于序列的系统发生学、关键结构特征、内含子配置和基因组同线性,构建了脊椎动物四跨膜蛋白的详细系统发育。特别是,在硬骨鱼类中总共鉴定出 38 个现代四跨膜蛋白直系同源物谱系,并随后将其分类为 2R-WGD 之前存在的 17 个祖先谱系。基于该系统发育,我们发现 2R-WGD 后四跨膜蛋白的同源保留率是平均值的三倍(与转录因子和蛋白激酶的保留率相似)。这种高保留率并没有增加四跨膜蛋白家族的规模,而是改变了家族组成,可能通过用后生动物保守的谱系取代脊椎动物特异性基因谱系来实现。我们还发现,从 2R-WGD 到最近的时间是由基因缺失控制的。同时,在 2R-WGD 之后的 80%的分支上检测到了正选择,而在随后的物种形成分支上,正选择的幅度和广度都显著下降。值得注意的是,哺乳动物 RDS2 的丢失伴随着哺乳动物 ROM1 上强烈的正选择,这可能是由于基因丢失诱导的补偿进化。
首先,与转录因子和激酶不同,2R-WGD 后高的重复保留率并没有增加四跨膜蛋白家族的规模,而是重塑了家族组成。其次,2R-WGD 后四跨膜蛋白的进化受到了大量基因丢失和编码序列正选择的影响。第三,2R-WGD 对四跨膜蛋白基因丢失和正选择的挥之不去的影响可能持续 3 亿至 4 亿年。