Zhang Yong, Lu Shujuan, Zhao Shuqi, Zheng Xiaofeng, Long Manyuan, Wei Liping
Center for Bioinformatics, National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing, 100871, PR China.
BMC Evol Biol. 2009 Oct 15;9:252. doi: 10.1186/1471-2148-9-252.
New genes generated by retroposition are widespread in humans and other mammalian species. Usually, this process copies a single parental gene and inserts it into a distant genomic location. However, retroposition of two adjacent parental genes, i.e. co-retroposition, had not been reported until the hominoid chimeric gene, PIPSL, was identified recently. It was shown how two genes linked in tandem (phosphatidylinositol-4-phosphate 5-kinase, type I, alpha, PIP5K1A and proteasome 26S subunit, non-ATPase, 4, PSMD4) could be co-retroposed from a single RNA molecule to form this novel chimeric gene. However, understanding of the origination and biological function of PIPSL requires determination of the coding potential of this gene as well as the evolutionary forces acting on its hominoid copies.
We tackled these problems by analyzing the evolutionary signature in both within-species variation and between species divergence in the sequence and structure of the gene. We revealed a significant evolutionary signature: the coding region has significantly lower sequence variation, especially insertions and deletions, suggesting that the human copy may encode a protein. Moreover, a survey across five different hominoid species revealed that all adaptive changes of PSMD4-derived regions occurred on branches leading to human and chimp rather than other hominoid lineages. Finally, computational analysis suggests testis-specific transcription of PIPSL is regulated by tissue-dependent methylation rather than some transcriptional leakage.
Therefore, this set of analyses showed that PIPSL is an extraordinary co-retroposed protein-coding gene that may participate in the male functions of humans and its close relatives.
逆转座产生的新基因在人类和其他哺乳动物物种中广泛存在。通常,这个过程会复制单个亲本基因并将其插入到基因组的远处位置。然而,直到最近发现类人猿嵌合基因PIPSL,两个相邻亲本基因的逆转座,即共逆转座,才被报道。研究表明,两个串联连接的基因(I型磷脂酰肌醇-4-磷酸5-激酶α,PIP5K1A和蛋白酶体26S亚基非ATP酶4,PSMD4)如何从单个RNA分子共逆转座形成这个新的嵌合基因。然而,要了解PIPSL的起源和生物学功能,需要确定该基因的编码潜力以及作用于其类人猿拷贝的进化力量。
我们通过分析该基因序列和结构在种内变异和种间差异中的进化特征来解决这些问题。我们揭示了一个显著的进化特征:编码区的序列变异,特别是插入和缺失,显著降低,这表明人类拷贝可能编码一种蛋白质。此外,对五个不同类人猿物种的调查显示,PSMD4衍生区域的所有适应性变化都发生在导致人类和黑猩猩的分支上,而不是其他类人猿谱系。最后,计算分析表明,PIPSL的睾丸特异性转录受组织依赖性甲基化调控,而不是一些转录渗漏。
因此,这一系列分析表明,PIPSL是一个非凡的共逆转座蛋白质编码基因,可能参与人类及其近亲的雄性功能。