Aravind L, Koonin E V
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
Genome Biol. 2001;2(3):RESEARCH0007. doi: 10.1186/gb-2001-2-3-research0007. Epub 2001 Feb 19.
Protein fold recognition using sequence profile searches frequently allows prediction of the structure and biochemical mechanisms of proteins with an important biological function but unknown biochemical activity. Here we describe such predictions resulting from an analysis of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenases, a class of enzymes that are widespread in eukaryotes and bacteria and catalyze a variety of reactions typically involving the oxidation of an organic substrate using a dioxygen molecule.
We employ sequence profile analysis to show that the DNA repair protein AlkB, the extracellular matrix protein leprecan, the disease-resistance-related protein EGL-9 and several uncharacterized proteins define novel families of enzymes of the 2OG-Fe(II) oxygenase superfamily. The identification of AlkB as a member of the 2OG-Fe(II) oxygenase superfamily suggests that this protein catalyzes oxidative detoxification of alkylated bases. More distant homologs of AlkB were detected in eukaryotes and in plant RNA viruses, leading to the hypothesis that these proteins might be involved in RNA demethylation. The EGL-9 protein from Caenorhabditis elegans is necessary for normal muscle function and its inactivation results in resistance against paralysis induced by the Pseudomonas aeruginosa toxin. EGL-9 and leprecan are predicted to be novel protein hydroxylases that might be involved in the generation of substrates for protein glycosylation.
Here, using sequence profile searches, we show that several previously undetected protein families contain 2OG-Fe(II) oxygenase fold. This allows us to predict the catalytic activity for a wide range of biologically important, but biochemically uncharacterized proteins from eukaryotes and bacteria.
利用序列谱搜索进行蛋白质折叠识别常常能够预测具有重要生物学功能但生化活性未知的蛋白质的结构和生化机制。在此,我们描述了对2-氧代戊二酸(2OG)和依赖Fe(II)的加氧酶进行分析所得到的此类预测结果,这是一类在真核生物和细菌中广泛存在的酶,催化多种反应,通常涉及利用双氧分子氧化有机底物。
我们采用序列谱分析表明,DNA修复蛋白AlkB、细胞外基质蛋白leprecan、抗病相关蛋白EGL-9以及几种未鉴定的蛋白质定义了2OG-Fe(II)加氧酶超家族的新酶家族。将AlkB鉴定为2OG-Fe(II)加氧酶超家族的成员表明该蛋白催化烷基化碱基的氧化解毒。在真核生物和植物RNA病毒中检测到了AlkB的更远亲同源物,从而提出这些蛋白质可能参与RNA去甲基化的假说。秀丽隐杆线虫的EGL-9蛋白对正常肌肉功能是必需的,其失活导致对铜绿假单胞菌毒素诱导的麻痹具有抗性。预计EGL-9和leprecan是新型蛋白质羟化酶,可能参与蛋白质糖基化底物的生成。
在此,我们利用序列谱搜索表明,几个先前未检测到的蛋白质家族含有2OG-Fe(II)加氧酶折叠。这使我们能够预测来自真核生物和细菌的一系列生物学上重要但生化特性未明确的蛋白质的催化活性。