Mihalek I, Res I, Lichtarge O
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA.
Proteins. 2006 Apr 1;63(1):87-99. doi: 10.1002/prot.20866.
It has been noted that slowly evolving protein residues have two properties: (a) they tend to cluster in the native fold, and (b) they delineate functional surfaces-parts of the surface through which the protein interacts with other proteins or small ligands. Herein, we demonstrate that the two are coupled sufficiently strongly that one effect, when observed, statistically implies the other. Detection of both can be accomplished in multiple sequence alignment related methods by the careful selection of relevant sequences. For the demonstration, we use two sets of protein families: a small set of diverse proteins with diverse functional surfaces, and a large set of homodimerizing enzymes. A practical outcome of our considerations is a simple prescriptive rule for the selection of homologous sequences for the comparative analysis of proteins: in order to optimize the detection of (potentially unknown) functional surfaces, it is sufficient to select sequences in such a way that the residues observed at any level of evolutionary divergence, as implied by the alignment, cluster on the folded protein.
已经注意到,进化缓慢的蛋白质残基具有两个特性:(a)它们倾向于聚集在天然折叠结构中,并且(b)它们勾勒出功能表面——蛋白质与其他蛋白质或小配体相互作用的表面部分。在此,我们证明这两者的耦合足够强,以至于当观察到一种效应时,从统计学上意味着另一种效应。通过仔细选择相关序列,可以在多序列比对相关方法中实现对两者的检测。为了进行论证,我们使用两组蛋白质家族:一组是具有不同功能表面的少量多样蛋白质,另一组是大量的同二聚化酶。我们思考的一个实际结果是一条简单的规定性规则,用于选择用于蛋白质比较分析的同源序列:为了优化对(潜在未知的)功能表面的检测,以这样一种方式选择序列就足够了,即比对所暗示的在任何进化分歧水平上观察到的残基聚集在折叠的蛋白质上。