Koonin E V, Tatusov R L, Galperin M Y
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.
Curr Opin Struct Biol. 1998 Jun;8(3):355-63. doi: 10.1016/s0959-440x(98)80070-5.
Computer analysis of complete prokaryotic genomes shows that microbial proteins are in general highly conserved--approximately 70% of them contain ancient conserved regions. This allows us to delineate families of orthologs across a wide phylogenetic range and, in many cases, predict protein functions with considerable precision. Sequence database searches using newly developed, sensitive algorithms result in the unification of such orthologous families into larger superfamilies sharing common sequence motifs. For many of these superfamilies, prediction of the structural fold and specific amino acid residues involved in enzymatic catalysis is possible. Taken together, sequence and structure comparisons provide a powerful methodology that can successfully complement traditional experimental approaches.
对完整原核生物基因组的计算机分析表明,微生物蛋白质总体上高度保守——其中约70%含有古老的保守区域。这使我们能够在广泛的系统发育范围内描绘直系同源基因家族,并且在许多情况下,能够相当精确地预测蛋白质功能。使用新开发的灵敏算法进行序列数据库搜索,可将这些直系同源基因家族统一为共享共同序列基序的更大超家族。对于许多这样的超家族,可以预测其结构折叠以及参与酶催化的特定氨基酸残基。总之,序列和结构比较提供了一种强大的方法,能够成功地补充传统的实验方法。