Schwartz R, Istrail S, King J
Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.
Protein Sci. 2001 May;10(5):1023-31. doi: 10.1110/ps.33201.
Patterns of hydrophobic and hydrophilic residues play a major role in protein folding and function. Long, predominantly hydrophobic strings of 20-22 amino acids each are associated with transmembrane helices and have been used to identify such sequences. Much less attention has been paid to hydrophobic sequences within globular proteins. In prior work on computer simulations of the competition between on-pathway folding and off-pathway aggregate formation, we found that long sequences of consecutive hydrophobic residues promoted aggregation within the model, even controlling for overall hydrophobic content. We report here on an analysis of the frequencies of different lengths of contiguous blocks of hydrophobic residues in a database of amino acid sequences of proteins of known structure. Sequences of three or more consecutive hydrophobic residues are found to be significantly less common in actual globular proteins than would be predicted if residues were selected independently. The result may reflect selection against long blocks of hydrophobic residues within globular proteins relative to what would be expected if residue hydrophobicities were independent of those of nearby residues in the sequence.
疏水残基和亲水残基的模式在蛋白质折叠和功能中起着重要作用。每段由20 - 22个氨基酸组成的、主要为疏水的长链与跨膜螺旋相关联,并已被用于识别此类序列。球状蛋白内的疏水序列受到的关注要少得多。在之前关于蛋白质折叠途径与非折叠途径聚集体形成之间竞争的计算机模拟工作中,我们发现即使控制了总体疏水含量,连续疏水残基的长序列仍会促进模型内聚集体的形成。我们在此报告对已知结构蛋白质氨基酸序列数据库中不同长度连续疏水残基块频率的分析。与独立选择残基时的预测情况相比,在实际球状蛋白中发现三个或更多连续疏水残基的序列明显不那么常见。该结果可能反映了相对于残基疏水性与序列中附近残基的疏水性无关时的预期情况,球状蛋白内对长疏水残基块的选择作用。