Parry D A, North A C
Institute of Fundamental Sciences, Massey University, Palmerston North, New Zealand.
J Struct Biol. 1998;122(1-2):67-75. doi: 10.1006/jsbi.1998.3967.
The quantity of sequence data now available for both Type I and Type II hard alpha-keratin IF proteins makes it possible to analyze their N- and C-terminal domains and ascertain features of likely structural and/or functional importance. The N-terminal domains of both chain types can be divided into acidic (NA) and basic (NB) subdomains, where NA is 29 and 34 residues long, respectively, for Type I and II chains and is located immediately adjacent to the end of the rod domain. NB constitutes the remainder of the N-terminal domain and is about 27 and 70 residues long for the two chain types, respectively. The glycine residue contents, however, are high in NA(I) and NB(II), but low in NA(II) and NB(I). Subdomain NB(II) contains four consecutive nonapeptide quasirepeats of the form GGGFGYRSX. The C-terminal domain of Type I chains, termed C(I), is characterized by a PCX motif repeated 10 times, 7 of them contiguously. From an analysis of the conformation of like peptides from crystal structures it has been shown that this region will probably adopt a polyproline II left-handed helical structure with three residues per turn. In contrast, the C-terminal domain of Type II hard alpha-keratin chains (known as C(II)) contains a periodic distribution of hydrophobicities that, together with other predictive techniques, allow its conformation (a twisted four-stranded antiparallel beta-sheet) to be predicted with some degree of confidence. In addition, it is possible to suggest two partners with which this domain will interact. The first is with segment L12 in the rod domain and the second is with another C(II) domain in an antiparallel neighboring molecule. The latter possibility appears most likely. In either case the aggregation would likely serve to stabilize the molecular assembly through the interaction of two beta-sheets via their apolar faces and, in so doing, would position a number of cysteine residues in external positions that would allow them to form a number of covalent disulfide bonds with other molecules.
现在可获得的I型和II型硬α-角蛋白中间丝(IF)蛋白的序列数据量,使得分析它们的N端和C端结构域并确定可能具有结构和/或功能重要性的特征成为可能。两种链类型的N端结构域都可分为酸性(NA)和碱性(NB)亚结构域,其中对于I型和II型链,NA分别长29和34个残基,并且紧邻杆状结构域的末端。NB构成N端结构域的其余部分,两种链类型分别约为27和70个残基长。然而,甘氨酸残基含量在NA(I)和NB(II)中较高,但在NA(II)和NB(I)中较低。亚结构域NB(II)包含四个连续的九肽准重复序列,形式为GGGFGYRSX。I型链的C端结构域,称为C(I),其特征是PCX基序重复10次,其中7次连续重复。通过对晶体结构中类似肽的构象分析表明,该区域可能会形成每圈三个残基的聚脯氨酸II型左手螺旋结构。相比之下,II型硬α-角蛋白链的C端结构域(称为C(II))含有疏水性的周期性分布,与其他预测技术一起,可以在一定程度上自信地预测其构象(扭曲的四链反平行β-折叠)。此外,有可能提出该结构域将与之相互作用的两个伙伴。第一个是与杆状结构域中的L12片段相互作用,第二个是与反平行相邻分子中的另一个C(II)结构域相互作用。后一种可能性似乎最有可能。在任何一种情况下,聚集都可能通过两个β-折叠通过其非极性面的相互作用来稳定分子组装,并且这样做会使一些半胱氨酸残基位于外部位置,从而使它们能够与其他分子形成许多共价二硫键。