Hou Ting, Zheng Guangyong, Zhang Pingyu, Jia Jia, Li Jing, Xie Lu, Wei Chaochun, Li Yixue
School of Biological Engineering, East China University of Science and Technology, Shanghai, China ; Shanghai Center for Bioinformation Technology, Shanghai, China ; Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China.
CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China.
PLoS One. 2014 Feb 20;9(2):e89575. doi: 10.1371/journal.pone.0089575. eCollection 2014.
Lysine acetylation is a crucial type of protein post-translational modification, which is involved in many important cellular processes and serious diseases. However, identification of protein acetylated sites through traditional experiment methods is time-consuming and laborious. Those methods are not suitable to identify a large number of acetylated sites quickly. Therefore, computational methods are still very valuable to accelerate lysine acetylated site finding.
In this study, many biological characteristics of acetylated sites have been investigated, such as the amino acid sequence around the acetylated sites, the physicochemical property of the amino acids and the transition probability of adjacent amino acids. A logistic regression method was then utilized to integrate these information for generating a novel lysine acetylation prediction system named LAceP. When compared with existing methods, LAceP overwhelms most of state-of-the-art methods. Especially, LAceP has a more balanced prediction capability for positive and negative datasets.
LAceP can integrate different biological features to predict lysine acetylation with high accuracy. An online web server is freely available at http://www.scbit.org/iPTM/.
赖氨酸乙酰化是一种关键的蛋白质翻译后修饰类型,它参与许多重要的细胞过程及严重疾病。然而,通过传统实验方法鉴定蛋白质乙酰化位点既耗时又费力。这些方法不适用于快速鉴定大量的乙酰化位点。因此,计算方法对于加速赖氨酸乙酰化位点的发现仍然非常有价值。
在本研究中,对乙酰化位点的许多生物学特征进行了研究,例如乙酰化位点周围的氨基酸序列、氨基酸的物理化学性质以及相邻氨基酸的转移概率。然后利用逻辑回归方法整合这些信息,生成了一个名为LAceP的新型赖氨酸乙酰化预测系统。与现有方法相比,LAceP优于大多数先进方法。特别是,LAceP对正数据集和负数据集具有更平衡的预测能力。
LAceP可以整合不同的生物学特征来高精度地预测赖氨酸乙酰化。可通过http://www.scbit.org/iPTM/免费获得在线网络服务器。