Bharathan G, Janssen B J, Kellogg E A, Sinha N
Section of Plant Biology, University of California, Davis, CA 95616, USA.
Proc Natl Acad Sci U S A. 1997 Dec 9;94(25):13749-53. doi: 10.1073/pnas.94.25.13749.
Homeodomain proteins are transcription factors that play a critical role in early development in eukaryotes. These proteins previously have been classified into numerous subgroups whose phylogenetic relationships are unclear. Our phylogenetic analysis of representative eukaryotic sequences suggests that there are two major groups of homeodomain proteins, each containing sequences from angiosperms, metazoa, and fungi. This result, based on parsimony and neighbor-joining analyses of primary amino acid sequences, was supported by two additional features of the proteins. The two protein groups are distinguished by an insertion/deletion in the homeodomain, between helices I and II. In addition, an amphipathic alpha-helical secondary structure in the region N terminal of the homeodomain is shared by angiosperm and metazoan sequences in one group. These results support the hypothesis that there was at least one duplication of homeobox genes before the origin of angiosperms, fungi, and metazoa. This duplication, in turn, suggests that these proteins had diverse functions early in the evolution of eukaryotes. The shared secondary structure in angiosperm and metazoan sequences points to an ancient conserved functional domain.
同源结构域蛋白是转录因子,在真核生物的早期发育中起关键作用。这些蛋白以前被分为众多亚组,其系统发育关系尚不清楚。我们对代表性真核生物序列的系统发育分析表明,同源结构域蛋白有两大组,每组都包含来自被子植物、后生动物和真菌的序列。基于对一级氨基酸序列的简约分析和邻接法分析得出的这一结果,得到了这些蛋白的另外两个特征的支持。这两个蛋白组通过同源结构域中螺旋I和螺旋II之间的插入/缺失来区分。此外,一组中被子植物和后生动物序列在同源结构域N端区域具有两亲性α-螺旋二级结构。这些结果支持了这样一种假说,即在被子植物、真菌和后生动物起源之前,同源异型框基因至少发生过一次复制。这种复制反过来表明,这些蛋白在真核生物进化早期就具有多种功能。被子植物和后生动物序列中共享的二级结构指向一个古老的保守功能域。