Aronson H E, Royer W E, Hendrickson W A
Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York 10032.
Protein Sci. 1994 Oct;3(10):1706-11. doi: 10.1002/pro.5560031009.
The globin family of protein structures was the first for which it was recognized that tertiary structure can be highly conserved even when primary sequences have diverged to a virtually undetectable level of similarity. This principle of structural inertia in molecular evolution is now evident for many other protein families. We have performed a systematic comparison of the sequences and structures of 6 representative hemoglobin subunits as diverse in origin as plants, clams, and humans. Our analysis is based on a 97-residue helical core in common to all 6 structures. Amino acid sequence identities range from 12.4% to 42.3% in pairwise comparisons, and, despite these variations, the maximal RMS deviation in alpha-carbon positions is 3.02 A. Overall, sequence similarity and structural deviation are significantly anticorrelated, with a correlation coefficient of -0.71, but for a set of structures having under 20% pairwise identity, this anticorrelation falls to -0.38, which emphasizes the weak connection between a specific sequence and the tertiary fold. There is substantial variability in structure outside the helical core, and functional characteristics of these globins also differ appreciably. Nevertheless, despite variations in detail that the sequence dissimilarities and functional differences imply, the core structures of these globins remain remarkably preserved.
蛋白质结构的珠蛋白家族是第一个被认识到即使一级序列已经分化到几乎无法检测到的相似水平,三级结构仍可高度保守的家族。分子进化中这种结构惯性的原理现在在许多其他蛋白质家族中也很明显。我们对6种具有代表性的血红蛋白亚基的序列和结构进行了系统比较,它们的来源各不相同,包括植物、蛤和人类。我们的分析基于所有6种结构共有的一个97个残基的螺旋核心。在两两比较中,氨基酸序列同一性范围从12.4%到42.3%,尽管存在这些差异,但α-碳位置的最大均方根偏差为3.02埃。总体而言,序列相似性和结构偏差显著负相关,相关系数为-0.71,但对于一组两两同一性低于20%的结构,这种负相关降至-0.38,这强调了特定序列与三级折叠之间的弱联系。螺旋核心之外的结构存在很大的变异性,这些珠蛋白的功能特性也有明显差异。然而,尽管序列差异和功能差异所暗示的细节存在变化,但这些珠蛋白的核心结构仍然显著保留。