Kouprina Natalay, Mullokandov Michael, Rogozin Igor B, Collins N Keith, Solomon Greg, Otstot John, Risinger John I, Koonin Eugene V, Barrett J Carl, Larionov Vladimir
Laboratory of Biosystems and Cancer, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.
Proc Natl Acad Sci U S A. 2004 Mar 2;101(9):3077-82. doi: 10.1073/pnas.0308532100. Epub 2004 Feb 18.
Human sperm protein associated with the nucleus on the X chromosome (SPANX) genes comprise a gene family with five known members (SPANX-A1, -A2, -B, -C, and -D), encoding cancer/testis-specific antigens that are potential targets for cancer immunotherapy. These highly similar paralogous genes cluster on the X chromosome at Xq27. We isolated and sequenced primate genomic clones homologous to human SPANX. Analysis of these clones and search of the human genome sequence revealed an uncharacterized group of genes, SPANX-N, which are present in all primates as well as in mouse and rat. In humans, four SPANX-N genes comprise a series of tandem duplicates at Xq27; a fifth member of this subfamily is located at Xp11. Similarly to SPANX-A/D, human SPANX-N genes are expressed in normal testis and some melanoma cell lines; testis-specific expression of SPANX is also conserved in mouse. Analysis of the taxonomic distribution of the long and short forms of the intron indicates that SPANX-N is the ancestral form, from which the SPANX-A/D subfamily evolved in the common ancestor of the hominoid lineage. Strikingly, the coding sequences of the SPANX genes evolved much faster than the intron and the 5' untranslated region. There is a strong correlation between the rates of evolution of synonymous and nonsynonymous codon positions, both of which are accelerated 2-fold or more compared to the noncoding sequences. Thus, evolution of the SPANX family appears to have involved positive selection that affected not only the protein sequence but also the synonymous sites in the coding sequence.
与X染色体上的细胞核相关的人类精子蛋白(SPANX)基因构成了一个基因家族,已知有五个成员(SPANX-A1、-A2、-B、-C和-D),它们编码癌症/睾丸特异性抗原,这些抗原是癌症免疫治疗的潜在靶点。这些高度相似的旁系同源基因在X染色体的Xq27处成簇。我们分离并测序了与人类SPANX同源的灵长类基因组克隆。对这些克隆的分析以及对人类基因组序列的搜索揭示了一组未被表征的基因,即SPANX-N,它们存在于所有灵长类动物以及小鼠和大鼠中。在人类中,四个SPANX-N基因在Xq27处构成一系列串联重复;该亚家族的第五个成员位于Xp11。与SPANX-A/D类似,人类SPANX-N基因在正常睾丸和一些黑色素瘤细胞系中表达;SPANX在睾丸中的特异性表达在小鼠中也保守存在。对内含子的长、短形式的分类分布分析表明,SPANX-N是祖先形式,SPANX-A/D亚家族在类人猿谱系的共同祖先中从该形式进化而来。引人注目的是,SPANX基因的编码序列进化速度比内含子和5'非翻译区快得多。同义密码子位置和非同义密码子位置的进化速率之间存在很强的相关性,与非编码序列相比,这两者的进化速率都加快了2倍或更多。因此,SPANX家族的进化似乎涉及正选择,这种选择不仅影响蛋白质序列,还影响编码序列中的同义位点。