Department of Chemistry, Nanchang University, Nanchang 330031, China.
J Theor Biol. 2012 Oct 7;310:223-30. doi: 10.1016/j.jtbi.2012.06.030. Epub 2012 Jul 11.
Lysine acetylation and methylation are two major post-translational modifications of lysine residues. They play vital roles in both biological and pathological processes. Specific lysine residues in H3 histone protein tails appear to be targeted for either acetylation or methylation. Hence it is very challenging to distinguish between acetylated and methylated lysine residues using computational methods. This work presents a method that incorporates protein sequence information, secondary structure and amino acid properties to differentiate acetyl-lysine from methyl-lysine. We apply an encoding scheme based on grouped weight and position weight amino acid composition to extract sequence information and physicochemical properties around lysine sites. The proposed method achieves an accuracy of 93.3% using a jackknife test. Feature analysis demonstrates that the prediction model with multiple features can take full advantage of the supplementary information from different features to improve classification performance and prediction robustness. Analysis of the characteristics of lysine residues which can be either methylated or acetylated shows that they are more similar to methyl-lysine than to acetyl-lysine.
赖氨酸乙酰化和甲基化是赖氨酸残基的两种主要翻译后修饰。它们在生物和病理过程中都起着至关重要的作用。H3 组蛋白尾部的特定赖氨酸残基似乎是乙酰化或甲基化的目标。因此,使用计算方法区分乙酰化赖氨酸和甲基化赖氨酸具有很大的挑战性。本工作提出了一种方法,该方法结合了蛋白质序列信息、二级结构和氨基酸性质,以区分乙酰化赖氨酸和甲基化赖氨酸。我们应用基于分组权重和位置权重氨基酸组成的编码方案来提取赖氨酸位点周围的序列信息和理化性质。使用折刀法测试,该方法的准确率达到 93.3%。特征分析表明,具有多个特征的预测模型可以充分利用来自不同特征的补充信息,以提高分类性能和预测稳健性。对可以甲基化或乙酰化的赖氨酸残基特征的分析表明,它们与甲基化赖氨酸的相似性大于与乙酰化赖氨酸的相似性。