USDA-ARS, Southern Regional Research Center, Commodity Utilization Research Unit, New Orleans, LA 70124, USA.
Plant J. 2011 Apr;66(1):143-60. doi: 10.1111/j.1365-313X.2011.04512.x.
In higher plants, the superfamily of carboxyl-CoA ligases and related proteins, collectively called acyl activating enzymes (AAEs), has evolved to provide enzymes for many pathways of primary and secondary metabolism and for the conjugation of hormones to amino acids. Across the superfamily there is only limited sequence similarity, but a series of highly conserved motifs, including the AMP-binding domain, make it easy to identify members. These conserved motifs are best understood in terms of the unique domain-rotation architecture that allows AAE enzymes to catalyze the two distinct steps of the CoA ligase reaction. Arabidopsis AAE sequences were used to identify the AAE gene families in the sequenced genomes of green algae, mosses, and trees; the size of the respective families increased with increasing degree of organismal cellular complexity, size, and generation time. Large-scale genome duplications and small-scale tandem gene duplications have contributed to AAE gene family complexity to differing extents in each of the multicellular species analyzed. Gene duplication and evolution of novel functions in Arabidopsis appears to have occurred rapidly, because acquisition of new substrate specificity is relatively easy in this class of proteins. Convergent evolution has also occurred between members of distantly related clades. These features of the AAE superfamily make it difficult to use homology searches and other genomics tools to predict enzyme function.
在高等植物中,羧基-CoA 连接酶和相关蛋白的超家族,统称为酰基激活酶(AAE),已经进化为许多初级和次级代谢途径以及激素与氨基酸结合提供酶。在整个超家族中,只有有限的序列相似性,但一系列高度保守的基序,包括 AMP 结合域,使得识别成员变得容易。这些保守基序最容易理解的是独特的结构域旋转架构,它允许 AAE 酶催化 CoA 连接酶反应的两个不同步骤。拟南芥 AAE 序列被用于鉴定已测序的绿藻、苔藓和树木基因组中的 AAE 基因家族;相应家族的大小随着生物体细胞复杂性、大小和世代时间的增加而增加。在分析的每种多细胞物种中,大规模基因组复制和小规模串联基因复制在不同程度上导致了 AAE 基因家族的复杂性。在拟南芥中,基因复制和新功能的进化似乎发生得很快,因为在这类蛋白质中获得新的底物特异性相对容易。远缘进化枝的成员之间也发生了趋同进化。AAE 超家族的这些特征使得使用同源搜索和其他基因组学工具来预测酶功能变得困难。