Das Rajdeep, Gerstein Mark
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA.
Proteins. 2004 May 1;55(2):455-63. doi: 10.1002/prot.10639.
We have introduced a method to identify functional shifts in protein families. Our method is based on the calculation of an active-site conservation ratio, which we call the "ASC ratio." For a structurally based alignment of a protein family, this ratio is the average sequence similarity of the active-site region compared to the full-length protein. The active-site region is defined as all the residues within a certain radius of the known functionally important groups. Using our method, we have analyzed enzymes of central metabolism from a large number of genomes (35). We found that for most of the enzymes, the active-site region is more highly conserved than the full-length sequence. However, for three tricarboxylic acid (TCA)-cycle enzymes, active-site sequences are considerably more diverged (than full-length ones). In particular, we were able to identify in six pathogens a novel isocitrate dehydrogenase that has very low sequence similarity around the active site. Detailed sequence-structure analysis indicates that while the active-site structure of isocitrate dehydrogenase is most likely similar between pathogens and nonpathogens, the unusual sequence divergence could result from an extra domain added at the N-terminus. This domain has a leucine-rich motif similar one in the Yersinia pestis cytotoxin and may therefore confer additional pathogenic functions.
我们引入了一种识别蛋白质家族功能转变的方法。我们的方法基于一种活性位点保守率的计算,我们将其称为“ASC比率”。对于基于结构的蛋白质家族比对,该比率是活性位点区域与全长蛋白质相比的平均序列相似性。活性位点区域被定义为已知功能重要基团一定半径内的所有残基。使用我们的方法,我们分析了来自大量基因组(35个)的中心代谢酶。我们发现,对于大多数酶来说,活性位点区域比全长序列保守性更高。然而,对于三种三羧酸(TCA)循环酶,活性位点序列的差异要大得多(与全长序列相比)。特别是,我们在六种病原体中鉴定出一种新型异柠檬酸脱氢酶,其活性位点周围的序列相似性非常低。详细的序列-结构分析表明,虽然病原体和非病原体之间异柠檬酸脱氢酶的活性位点结构很可能相似,但这种不寻常的序列差异可能是由于在N端添加了一个额外的结构域。该结构域具有与鼠疫耶尔森氏菌细胞毒素中类似的富含亮氨酸基序,因此可能赋予额外的致病功能。