Seixas Susana, Suriano Gianpaolo, Carvalho Filipa, Seruca Raquel, Rocha Jorge, Di Rienzo Anna
Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal.
Mol Biol Evol. 2007 Feb;24(2):587-98. doi: 10.1093/molbev/msl187. Epub 2006 Nov 29.
The superfamily of serine protease inhibitors (SERPINs) plays a key role in controlling the activity of proteinases in diverse biological processes. alpha1-antitrypsin (SERPINA1), the most studied member of this family, is encoded by a gene located within the proximal 14q32.1 SERPIN subcluster, together with the highly homologous alpha1-antitrypsin-like sequence (SERPINA2), which was previously proposed to be a pseudogene. Here, we performed a resequencing study encompassing both SERPINA1 and SERPINA2 as well as the adjacent gene coding for corticosteroid-binding globulin (SERPINA6) in samples from Europe and West Africa. In the African sample, we found that a common haplotype carrying a 2-kb deletion in the SERPINA2 gene is associated with remarkable long-range homozygozity as if it was quickly driven to high frequency by natural selection acting on an advantageous variant. An analysis of the HapMap Phase I data for the Yoruba sample confirmed that variation in this subcluster carries a strong signal of positive selection. We also show that the SERPINA2 gene is expressed and probably encodes a functional SERPIN. Finally, comparisons with orthologous sequences in nonhuman primates showed that SERPINA2 is present in some great apes, but in chimpanzees it was lost by a deletion event independent from that observed in humans. In agreement with the "less is more" hypothesis, we propose that loss of SERPINA2 is an ongoing process associated with a selective advantage during recent primate evolution, possibly because of a role in fertility or in host-pathogen interactions.
丝氨酸蛋白酶抑制剂超家族(SERPINs)在多种生物过程中控制蛋白酶活性方面发挥着关键作用。α1-抗胰蛋白酶(SERPINA1)是该家族中研究最多的成员,由位于14q32.1近端SERPIN亚簇内的一个基因编码,与高度同源的α1-抗胰蛋白酶样序列(SERPINA2)一起,该序列先前被认为是一个假基因。在这里,我们对来自欧洲和西非的样本进行了一项重测序研究,涵盖了SERPINA1和SERPINA2以及编码皮质类固醇结合球蛋白的相邻基因(SERPINA6)。在非洲样本中,我们发现SERPINA2基因中携带2 kb缺失的一个常见单倍型与显著的长程纯合性相关,就好像它是由作用于一个有利变异的自然选择迅速驱动到高频的。对约鲁巴样本的HapMap一期数据的分析证实,该亚簇中的变异带有强烈的正选择信号。我们还表明SERPINA2基因是表达的,并且可能编码一种功能性的丝氨酸蛋白酶抑制剂。最后,与非人类灵长类动物的直系同源序列比较表明,SERPINA2在一些大猩猩中存在,但在黑猩猩中它因一个与人类中观察到的缺失事件无关的缺失而丢失。与“少即是多”假说一致,我们提出SERPINA2的丢失是一个与近期灵长类动物进化过程中的选择优势相关的持续过程,可能是因为其在生育或宿主-病原体相互作用中的作用。