Kapp O H, Moens L, Vanfleteren J, Trotman C N, Suzuki T, Vinogradov S N
Department of Radiology, University of Chicago, Illinois 60637, USA.
Protein Sci. 1995 Oct;4(10):2179-90. doi: 10.1002/pro.5560041024.
Seven-hundred globin sequences, including 146 nonvertebrate sequences, were aligned on the basis of conservation of secondary structure and the avoidance of gap penalties. Of the 182 positions needed to accommodate all the globin sequences, only 84 are common to all, including the absolutely conserved PheCD1 and HisF8. The mean number of amino acid substitutions per position ranges from 8 to 13 for all globins and 5 to 9 for internal positions. Although the total sequence volumes have a variation approximately 2-3%, the variation in volume per position ranges from approximately 13% for the internal to approximately 21% for the surface positions. Plausible correlations exist between amino acid substitution and the variation in volume per position for the 84 common and the internal but not the surface positions. The amino acid substitution matrix derived from the 84 common positions was used to evaluate sequence similarity within the globins and between the globins and phycocyanins C and colicins A, via calculation of pairwise similarity scores. The scores for globin-globin comparisons over the 84 common positions overlap the globin-phycocyanin and globin-colicin scores, with the former being intermediate. For the subset of internal positions, overlap is minimal between the three groups of scores. These results imply a continuum of amino acid sequences able to assume the common three-on-three alpha-helical structure and suggest that the determinants of the latter include sites other than those inaccessible to solvent.
基于二级结构的保守性和避免空位罚分,对包括146个非脊椎动物序列在内的700个珠蛋白序列进行了比对。在容纳所有珠蛋白序列所需的182个位置中,只有84个位置是所有序列共有的,包括绝对保守的苯丙氨酸CD1和组氨酸F8。所有珠蛋白每个位置的氨基酸替换平均数在8到13之间,内部位置的平均数在5到9之间。尽管总序列体积有大约2 - 3%的变化,但每个位置的体积变化范围从内部位置的大约13%到表面位置的大约21%。对于84个共有位置和内部位置(但不是表面位置),氨基酸替换与每个位置的体积变化之间存在合理的相关性。从84个共有位置推导得到的氨基酸替换矩阵,通过计算成对相似性得分,用于评估珠蛋白内部以及珠蛋白与藻蓝蛋白C和大肠杆菌素A之间的序列相似性。在84个共有位置上,珠蛋白 - 珠蛋白比较的得分与珠蛋白 - 藻蓝蛋白以及珠蛋白 - 大肠杆菌素的得分重叠,前者处于中间。对于内部位置的子集,三组得分之间的重叠最小。这些结果意味着能够呈现常见的三对三α - 螺旋结构的氨基酸序列是连续的,并表明后者的决定因素包括溶剂不可及位点以外的其他位点。