Parker M H, Hefford M A
Centre for Food and Animal Research, Agriculture and Agri-Food Canada, Ottawa, Ontario.
Protein Eng. 1997 May;10(5):487-96. doi: 10.1093/protein/10.5.487.
A statistical study was performed on a set of proteins which adopt the four-alpha-helical-bundle tertiary motif in order to determine amino acid occurrences at helix-capping and loop positions. Eight X-ray crystal structures from the Brookhaven Protein Data Bank (PDB) were examined and N", N', Ncap, Ccap, C' and C" residues were assigned. In addition, a set of 55 protein sequences for the analogous proteins from different strains and species was taken from the Protein Information Resource and Swiss-Prot databanks. The residues at the capping and loop positions in this expanded data set were deduced by aligning these sequences with those from the PDB files. Similar trends were observed in the two data sets. In general, polar residues were predominant in the loops, although aromatic residues were also fairly common. Glycine, a highly flexible residue with an excellent 'helix-breaking' ability, was very common at the Ccap, C' and C" residues. Proline, which can force sharp turns in the direction of a peptide backbone, was only common at the N" residue. Residues which can participate in the N-capping box motif were found with high frequency. Capping motifs at the helix C-termini (Schellman and alphaL motifs) were also somewhat common, while another helix N-terminal stabilizing motif, the hydrophobic stable, was not common. The data presented in this study should prove useful for applying the 'consensus residue' approach to the de novo design of loop regions in helical bundle proteins.
对一组采用四α螺旋束三级基序的蛋白质进行了统计研究,以确定螺旋封端和环位置处的氨基酸出现情况。检查了来自布鲁克海文蛋白质数据库(PDB)的八个X射线晶体结构,并指定了N"、N'、Ncap、Ccap、C'和C"残基。此外,从蛋白质信息资源库和瑞士蛋白质数据库中获取了一组来自不同菌株和物种的55个类似蛋白质的蛋白质序列。通过将这些序列与PDB文件中的序列比对,推断出这个扩展数据集中封端和环位置处的残基。在这两个数据集中观察到了相似的趋势。一般来说,极性残基在环中占主导地位,尽管芳香族残基也相当常见。甘氨酸是一种具有高度灵活性且具有出色“螺旋破坏”能力的残基,在Ccap、C'和C"残基处非常常见。脯氨酸能够使肽主链方向发生急剧转折,仅在N"残基处常见。发现能够参与N封端盒基序的残基频率很高。螺旋C末端的封端基序(Schellman基序和alphaL基序)也较为常见,而另一种螺旋N末端稳定基序,即疏水稳定基序,则不常见。本研究中呈现的数据对于将“共有残基”方法应用于螺旋束蛋白中环区域的从头设计应是有用的。