Guo Jun-Tao, Jaromczyk Jerzy W, Xu Ying
Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia 30602, USA.
Proteins. 2007 May 15;67(3):548-58. doi: 10.1002/prot.21285.
Chameleon sequences have been implicated in amyloid related diseases. Here we report an analysis of two types of chameleon sequences, chameleon-HS (Helix vs. Strand) and chameleon-HE (Helix vs. Sheet), based on known structures in Protein Data Bank. Our survey shows that the longest chameleon-HS is eight residues while the longest chameleon-HE is seven residues. We have done a detailed analysis on the local and global environment that might contribute to the unique conformation of a chameleon sequence. We found that the existence of chameleon sequences does not present a problem for secondary structure prediction programs, including the first generation prediction programs, such as Chou-Fasman algorithm, and the third generation prediction programs that utilize evolution information. We have also investigated the possible implication of chameleon sequences in structural conservation and functional diversity of alternatively spliced protein isoforms.
变色龙序列与淀粉样相关疾病有关。在此,我们基于蛋白质数据库中的已知结构,报告了对两种类型变色龙序列的分析,即变色龙-HS(螺旋与链)和变色龙-HE(螺旋与片层)。我们的调查表明,最长的变色龙-HS为8个残基,而最长的变色龙-HE为7个残基。我们对可能促成变色龙序列独特构象的局部和全局环境进行了详细分析。我们发现,变色龙序列的存在对于二级结构预测程序而言并非问题,这些程序包括第一代预测程序,如Chou-Fasman算法,以及利用进化信息的第三代预测程序。我们还研究了变色龙序列在可变剪接蛋白异构体的结构保守性和功能多样性方面的可能影响。