Ghoshdastider Umesh, Popp David, Burtnick Leslie D, Robinson Robert C
Institute of Molecular and Cell Biology, A*STAR (Agency for Science, Technology and Research), Biopolis, 138673, Singapore.
Cytoskeleton (Hoboken). 2013 Nov;70(11):775-95. doi: 10.1002/cm.21149. Epub 2013 Nov 8.
The gelsolin homology (GH) domain has been found to date exclusively in actin-binding proteins. In humans, three copies of the domain are present in CapG, five copies in supervillin, and six copies each in adseverin, gelsolin, flightless I and the villins: villin, advillin and villin-like protein. Caenorhabditis elegans contains a four-GH-domain protein, GSNL-1. These architectures are predicted to have arisen from gene triplication followed by gene duplication to result in the six-domain protein. The subsequent loss of one, two or three domains produced the five-, four-, and three-domain proteins, respectively. Here we conducted BLAST and hidden Markov based searches of UniProt and NCBI databases to identify novel gelsolin domain containing proteins. The variety in architectures suggests that the GH domain has been tested in many molecular constructions during evolution. Of particular note is flightless-like I protein (FLIIL1) from Entamoeba histolytica, which combines a leucine rich repeats (LRR) domain, seven GH domains, and a headpiece domain, thus combining many of the features of flightless I with those of villin or supervillin. As such, the GH domain superfamily appears to have developed along complex routes. The distribution of these proteins was analyzed in the 343 completely sequenced genomes, mapped onto the tree of life, and phylogenetic trees of the proteins were constructed to gain insight into their evolution. © 2013 Wiley Periodicals, Inc.
迄今为止,已发现凝溶胶蛋白同源(GH)结构域仅存在于肌动蛋白结合蛋白中。在人类中,CapG含有该结构域的三个拷贝, supervillin含有五个拷贝,而凝溶胶蛋白、切丝蛋白、无翅I蛋白以及绒毛蛋白家族(绒毛蛋白、前绒毛蛋白和类绒毛蛋白)各含有六个拷贝。秀丽隐杆线虫含有一种含四个GH结构域的蛋白GSNL-1。据推测,这些结构是由基因三倍化后再经基因复制产生了含六个结构域的蛋白,随后分别丢失一个、两个或三个结构域,从而产生了含五个、四个和三个结构域的蛋白。在此,我们通过对UniProt和NCBI数据库进行基于BLAST和隐马尔可夫模型的搜索,以鉴定含有新型凝溶胶蛋白结构域的蛋白。结构的多样性表明,在进化过程中,GH结构域已在许多分子结构中得到检验。特别值得注意的是,溶组织内阿米巴的类无翅I蛋白(FLIIL1)结合了富含亮氨酸重复序列(LRR)结构域、七个GH结构域和一个头部结构域,因此兼具了无翅I蛋白与绒毛蛋白或supervillin的许多特征。如此看来,GH结构域超家族似乎是沿着复杂的路径发展而来的。我们在343个完全测序的基因组中分析了这些蛋白的分布,将其映射到生命树上,并构建了这些蛋白的系统发育树,以深入了解它们的进化过程。© 2013威利期刊公司