Anantharaman Vivek, Aravind L
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
J Mol Biol. 2006 Feb 24;356(3):823-42. doi: 10.1016/j.jmb.2005.11.031. Epub 2005 Dec 1.
Evolution of diverse catalytic and ligand-binding activities in a given protein fold is a widely observed phenomenon in the protein-domain universe. However, the details of this evolutionary process, general principles, if any, and implications for origins of particular catalytic mechanisms are poorly understood in many common protein folds. Taking advantage of the wealth of currently available protein structure and sequence data, we explore these issues in the context of a large assemblage of biochemically diverse protein domains sharing a common origin, namely the sugar isomerases, translation factor eIF2B, ligand-binding domains of the DeoR-family transcription factors, acetyl-CoA transferases and methenyltetrahydrofolate synthetase. We show that in at least three independent instances, including the sugar-binding domains of the DeoR family transcription factors, this domain has been used as small molecule sensor coupled to helix-turn-helix DNA-binding domains. In at least two of these instances the domain functions as a non-catalytic sensor of ligands. We provide evidence that the ancestral version of this fold was a distinct version of the Rosmann-like folds, which probably possessed two distinct ligand-binding areas that were differentially utilized in different descendents. Analyzing the sequences and structures of proteins in this fold we show that there are two principal factors related to the origin of catalytic diversity in this fold. Firstly, specific inserts and extension added to the core domain on multiple occasions in evolution have affected the access to the active site regions, and thereby allowed for different substrates and allosteric regulators. The second major factor appears to be the emergence of considerable diversity of family-specific residues with important biochemical roles. Interestingly, proteins of this fold, which catalyze similar reactions on similar substrates, might possess very distinctive sets of active residues required for substrate binding catalysis. In particular, different sugar isomerases or acyl transferases in this fold might show distinct constellations of active site residues. These findings suggest that whereas ligand-binding, and even generic catalytic ability emerged early in the evolution of the fold, the specific catalytic mechanisms appear to have independently emerged on multiple occasions in the generic precursors of this fold.
在蛋白质结构域世界中,给定蛋白质折叠中多种催化和配体结合活性的进化是一种广泛观察到的现象。然而,在许多常见的蛋白质折叠中,这种进化过程的细节、一般原则(如果有的话)以及对特定催化机制起源的影响却知之甚少。利用当前丰富的蛋白质结构和序列数据,我们在大量具有共同起源、生物化学性质多样的蛋白质结构域的背景下探讨这些问题,这些蛋白质结构域包括糖异构酶、翻译因子eIF2B、DeoR家族转录因子的配体结合结构域、乙酰辅酶A转移酶和亚甲基四氢叶酸合成酶。我们表明,至少在三个独立的实例中,包括DeoR家族转录因子的糖结合结构域,该结构域已被用作与螺旋-转角-螺旋DNA结合结构域偶联的小分子传感器。在这些实例中的至少两个中,该结构域作为配体的非催化传感器发挥作用。我们提供的证据表明,这种折叠的祖先版本是罗斯曼样折叠的一个独特版本,它可能拥有两个不同的配体结合区域,在不同的后代中被不同地利用。分析该折叠中蛋白质的序列和结构,我们表明有两个与该折叠中催化多样性起源相关的主要因素。首先,在进化过程中多次添加到核心结构域的特定插入片段和延伸片段影响了对活性位点区域的 access,从而允许不同的底物和变构调节剂。第二个主要因素似乎是具有重要生化作用的家族特异性残基出现了相当大的多样性。有趣的是,催化相似底物上相似反应的该折叠的蛋白质可能拥有底物结合催化所需的非常独特的活性残基集。特别是,该折叠中的不同糖异构酶或酰基转移酶可能显示出活性位点残基的不同组合。这些发现表明,虽然配体结合甚至一般催化能力在该折叠的进化早期就出现了,但特定的催化机制似乎在该折叠的一般前体中多次独立出现。