Sridhar Settu, Nagamruta Mallapragada, Guruprasad Kunchur
Bioinformatics, Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad, 500007, India.
PLoS One. 2015 Oct 14;10(10):e0139568. doi: 10.1371/journal.pone.0139568. eCollection 2015.
The analyses of 3967 representative proteins selected from the Protein Data Bank revealed the presence of 2803 pentapeptide and large palindrome sequences with known secondary structure conformation. These represent 2014 unique palindrome sequences. 60% palindromes are not associated with any regular secondary structure and 28% are in helix conformation, 11% in strand conformation and 1% in the coil conformation. The average solvent accessibility values are in the range between 0-155.28 Å2 suggesting that the palindromes in proteins can be either buried, exposed to the solvent or share an intermittent property. The number of residue neighborhood contacts defined by interactions ≤ 3.2 Ǻ is in the range between 0-29 residues. Palindromes of the same length in helix, strand and coil conformation are associated with different amino acid residue preferences at the individual positions. Nearly, 20% palindromes interact with catalytic/active site residues, ligand or metal ions in proteins and may therefore be important for function in the corresponding protein. The average hydrophobicity values for the pentapeptide and large palindromes range between -4.3 to +4.32 and the number of palindromes is almost equally distributed between the negative and positive hydrophobicity values. The palindromes represent 107 different protein families and the hydrolases, transferases, oxidoreductases and lyases contain relatively large number of palindromes.
从蛋白质数据库中选取的3967种代表性蛋白质分析显示,存在2803个具有已知二级结构构象的五肽和大回文序列。这些代表了2014个独特的回文序列。60%的回文与任何规则二级结构无关,28%呈螺旋构象,11%呈链构象,1%呈卷曲构象。平均溶剂可及性值在0 - 155.28 Å2之间,这表明蛋白质中的回文序列可能被埋藏、暴露于溶剂中或具有间歇性特征。由≤ 3.2 Å相互作用定义的残基邻域接触数在0 - 29个残基之间。螺旋、链和卷曲构象中相同长度的回文在各个位置与不同的氨基酸残基偏好相关。近20%的回文与蛋白质中的催化/活性位点残基、配体或金属离子相互作用,因此可能对相应蛋白质的功能很重要。五肽和大回文的平均疏水性值在 - 4.3至 + 4.32之间,回文数量在正负疏水性值之间几乎均匀分布。回文代表107个不同的蛋白质家族,水解酶、转移酶、氧化还原酶和裂解酶含有相对较多的回文。