Mortlock D P, Sateesh P, Innis J W
Department of Human Genetics, University of Michigan Medical School, Ann Arbor 48104-0618, USA.
Mamm Genome. 2000 Feb;11(2):151-8. doi: 10.1007/s003350010029.
While the the role of the homeodomain in HOX function has been evaluated extensively, little attention has been given to the non-homeodomain portions of the HOX proteins. To investigate the evolution of the HOXA13 protein and to identify conserved residues in the N-terminal region of the protein with potential functional significance, N-terminal Hoxa13 coding sequences were PCR-amplified from fish, amphibian, reptile, chicken, and marsupial and eutherian mammal genomic DNA. Compared with fish HOXA13, the mammalian protein has increased in size by 35% primarily owing to the accumulation of alanine repeats and flanking segments rich in proline, glycine, or serine within the first 215 amino acids. Certain residues and amino acid motifs were strongly conserved, and several HOXA13 N-terminal domains were also shared in the paralogous HOXB 13 and HOXD13 genes; however, other conserved regions appear to be unique to HOXA13. Two domains highly conserved in HOXA13 orthologs are shared with Drosophila AbdB and other vertebrate AbdB-like proteins. Marsupial and eutherian mammalian HOXA13 proteins have three large homopolymeric alanine repeats of 14, 12, and 17-18 residues that are absent in reptiles, birds, and fish. Thus, the repeats arose after the divergence of reptiles from the lineage that would give rise to the mammals. In contrast, other short homopolymeric alanine repeats in mammalian HOXA13 have remained virtually the same length, suggesting that forces driving or limiting repeat expansion are context dependent. Consecutive stretches of identical third-base usage in alanine codons within the large repeats were found, supporting replication slippage as a mechanism for their generation. However, numerous species-specific base substitutions affecting third-base alanine repeat codon positions were observed, particularly in the largest repeat. Therefore, if the large alanine repeats were present prior to eutherian mammal development as is suggested by the opossum data, then a dynamic process of recurring replication slippage and point mutation within alanine repeat codons must be considered to reconcile these observations. This model might also explain why the alanine repeats are flanked by proline, serine, and glycine-rich sequences, and it reveals a biological mechanism that promotes increases in protein size and, potentially, acquisition of new functions.
虽然同源结构域在HOX功能中的作用已得到广泛评估,但HOX蛋白的非同源结构域部分却很少受到关注。为了研究HOXA13蛋白的进化,并确定该蛋白N端区域具有潜在功能意义的保守残基,从鱼类、两栖类、爬行类、鸡、有袋类和真兽类哺乳动物的基因组DNA中PCR扩增出N端Hoxa13编码序列。与鱼类HOXA13相比,哺乳动物的该蛋白大小增加了35%,主要是由于在前215个氨基酸内丙氨酸重复序列以及富含脯氨酸、甘氨酸或丝氨酸的侧翼片段的积累。某些残基和氨基酸基序高度保守,并且几个HOXA13 N端结构域在旁系同源的HOXB 13和HOXD13基因中也有共享;然而,其他保守区域似乎是HOXA13所特有的。在HOXA13直系同源物中高度保守的两个结构域与果蝇AbdB和其他脊椎动物AbdB样蛋白共享。有袋类和真兽类哺乳动物的HOXA13蛋白有三个大的同聚丙氨酸重复序列,分别为14、12和17 - 18个残基,而在爬行类、鸟类和鱼类中不存在。因此,这些重复序列是在爬行类从产生哺乳动物的谱系中分化出来之后出现的。相比之下,哺乳动物HOXA13中的其他短同聚丙氨酸重复序列长度几乎保持不变,这表明驱动或限制重复序列扩展 的力量取决于具体情况。在大的重复序列内发现丙氨酸密码子连续出现相同的第三位碱基使用情况,支持复制滑动是其产生的一种机制。然而,观察到许多影响第三位碱基丙氨酸重复密码子位置的物种特异性碱基替换,特别是在最大的重复序列中。因此,如果负鼠数据表明大的丙氨酸重复序列在真兽类哺乳动物发育之前就已存在,那么必须考虑丙氨酸重复密码子内反复出现的复制滑动和点突变的动态过程来解释这些观察结果。这个模型也可能解释为什么丙氨酸重复序列两侧是富含脯氨酸、丝氨酸和甘氨酸的序列,并且它揭示了一种促进蛋白质大小增加以及潜在地获得新功能的生物学机制。