Khuri S, Bakker F T, Dunwell J M
School of Plant Sciences, University of Reading, Reading, England.
Mol Biol Evol. 2001 Apr;18(4):593-605. doi: 10.1093/oxfordjournals.molbev.a003840.
The cupin superfamily is a group of functionally diverse proteins that are found in all three kingdoms of life, Archaea, Eubacteria, and Eukaryota. These proteins have a characteristic signature domain comprising two histidine- containing motifs separated by an intermotif region of variable length. This domain consists of six beta strands within a conserved beta barrel structure. Most cupins, such as microbial phosphomannose isomerases (PMIs), AraC- type transcriptional regulators, and cereal oxalate oxidases (OXOs), contain only a single domain, whereas others, such as seed storage proteins and oxalate decarboxylases (OXDCs), are bi-cupins with two pairs of motifs. Although some cupins have known functions and have been characterized at the biochemical level, the majority are known only from gene cloning or sequencing projects. In this study, phylogenetic analyses were conducted on the conserved domain to investigate the evolution and structure/function relationships of cupins, with an emphasis on single- domain plant germin-like proteins (GLPs). An unrooted phylogeny of cupins from a wide spectrum of evolutionary lineages identified three main clusters, microbial PMIs, OXDCs, and plant GLPs. The sister group to the plant GLPs in the global analysis was then used to root a phylogeny of all available plant GLPs. The resulting phylogeny contained three main clades, classifying the GLPs into distinct subfamilies. It is suggested that these subfamilies correlate with functional categories, one of which contains the bifunctional barley germin that has both OXO and superoxide dismutase (SOD) activity. It is proposed that GLPs function primarily as SODs, enzymes that protect plants from the effects of oxidative stress. Closer inspection of the DNA sequence encoding the intermotif region in plant GLPs showed global conservation of thymine in the second codon position, a character associated with hydrophobic residues. Since many of these proteins are multimeric and enzymatically inactive in their monomeric state, this conservation of hydrophobicity is thought to be associated with the need to maintain the various monomer- monomer interactions. The type of structure-based predictive analysis presented in this paper is an important approach for understanding gene function and evolution in an era when genomes from a wide range of organisms are being sequenced at a rapid rate.
“cupin超家族”是一组功能多样的蛋白质,存在于古细菌、真细菌和真核生物这三个生命王国中。这些蛋白质具有一个特征性的结构域,由两个含组氨酸的基序组成,中间被一个长度可变的基序间区域隔开。该结构域在保守的β桶结构内由六条β链组成。大多数cupin蛋白,如微生物磷酸甘露糖异构酶(PMI)、AraC型转录调节因子和谷物草酸氧化酶(OXO),只包含一个结构域,而其他一些蛋白,如种子储存蛋白和草酸脱羧酶(OXDC),则是具有两对基序的双cupin蛋白。虽然一些cupin蛋白具有已知功能并已在生化水平上得到表征,但大多数仅在基因克隆或测序项目中被知晓。在本研究中,对保守结构域进行了系统发育分析,以研究cupin蛋白的进化以及结构/功能关系,重点是单结构域植物类萌发素蛋白(GLP)。对来自广泛进化谱系的cupin蛋白进行的无根系统发育分析确定了三个主要聚类,即微生物PMI、OXDC和植物GLP。然后,在全局分析中植物GLP的姐妹组被用于为所有可用植物GLP的系统发育树确定根。所得的系统发育树包含三个主要分支,将GLP分类为不同的亚家族。有人认为这些亚家族与功能类别相关,其中一个亚家族包含具有OXO和超氧化物歧化酶(SOD)活性的双功能大麦萌发素。有人提出GLP主要作为SOD发挥作用,SOD是一种保护植物免受氧化应激影响的酶。对植物GLP中编码基序间区域的DNA序列进行更仔细的检查发现,第二个密码子位置的胸腺嘧啶具有全局保守性,这一特征与疏水残基相关。由于这些蛋白质中的许多是多聚体,并且在单体状态下没有酶活性,因此这种疏水性的保守被认为与维持各种单体 - 单体相互作用的需要有关。在一个广泛生物体的基因组正在快速测序的时代,本文提出的基于结构的预测分析类型是理解基因功能和进化的重要方法。