Huang Shao-Wei, Hwang Jenn-Kang
Institute of Bioinformatics, National Chiao Tung University, Taiwan, Republic of China.
Proteins. 2005 Jun 1;59(4):802-9. doi: 10.1002/prot.20462.
A complete protein sequence can usually determine a unique conformation; however, the situation is different for shorter subsequences--some of them are able to adopt unique conformations, independent of context; while others assume diverse conformations in different contexts. The conformations of subsequences are determined by the interplay between local and nonlocal interactions. A quantitative measure of such structural conservation or variability will be useful in the understanding of the sequence-structure relationship. In this report, we developed an approach using the support vector machine method to compute the conformational variability directly from sequences, which is referred to as the sequence structural entropy. As a practical application, we studied the relationship between sequence structural entropy and the hydrogen exchange for a set of well-studied proteins. We found that the slowest exchange cores usually comprise amino acids of the lowest sequence structural entropy. Our results indicate that structural conservation is closely related to the local structural stability. This relationship may have interesting implications in the protein folding processes, and may be useful in the study of the sequence-structure relationship.
完整的蛋白质序列通常能够决定一种独特的构象;然而,较短的子序列情况则有所不同——其中一些能够形成独特的构象,与上下文无关;而另一些在不同的上下文中会呈现出多样的构象。子序列的构象由局部和非局部相互作用之间的相互影响所决定。对这种结构保守性或变异性的定量衡量,将有助于理解序列与结构之间的关系。在本报告中,我们开发了一种使用支持向量机方法直接从序列计算构象变异性的方法,这被称为序列结构熵。作为一个实际应用,我们研究了一组经过充分研究的蛋白质的序列结构熵与氢交换之间的关系。我们发现,交换最慢的核心区域通常包含序列结构熵最低的氨基酸。我们的结果表明,结构保守性与局部结构稳定性密切相关。这种关系可能在蛋白质折叠过程中具有有趣的意义,并且可能有助于序列 - 结构关系的研究。