Ilyas Sarah, Hussain Waqar, Ashraf Adeel, Khan Yaser Daanial, Khan Sher Afzal, Chou Kuo-Chen
1Department of Computer Science, School of Systems and Technology, University of Management and Technology, P.O. Box 10033, C-II, Johar Town, Lahore54770, Pakistan; 2Faculty of Computing and Information Technology in Rabigh, Jeddah, 21577, KSA; 3Gordon Life Science Institute, Boston, MA02478, USA; 4Department of Computer Sciences, Abdul Wali Khan University, Mardan, Pakistan.
Curr Genomics. 2019 May;20(4):275-292. doi: 10.2174/1389202920666190809095206.
Methylation is one of the most important post-translational modifications in the human body which usually arises on lysine among the most intensely modified residues. It performs a dynamic role in numerous biological procedures, such as regulation of gene expression, regulation of protein function and RNA processing. Therefore, to identify lysine methylation sites is an important challenge as some experimental procedures are time-consuming.
Herein, we propose a computational predictor named iMethylK_pseAAC to identify lysine methylation sites.
Firstly, we constructed feature vectors based on PseAAC using position and composition rel-ative features and statistical moments. A neural network is trained based on the extracted features. The performance of the proposed method is then validated using cross-validation and jackknife testing.
The objective evaluation of the predictor showed accuracy of 96.7% for self-consistency, 91.61% for 10-fold cross-validation and 93.42% for jackknife testing.
It is concluded that iMethylK_pseAAC outperforms the counterparts to identify lysine methylation sites such as iMethyl_pseACC, BPB_pPMS and PMeS.
甲基化是人体中最重要的翻译后修饰之一,通常发生在修饰程度最高的残基中的赖氨酸上。它在众多生物学过程中发挥着动态作用,如基因表达调控、蛋白质功能调节和RNA加工。因此,由于一些实验过程耗时,识别赖氨酸甲基化位点是一项重要挑战。
在此,我们提出一种名为iMethylK_pseAAC的计算预测器来识别赖氨酸甲基化位点。
首先,我们基于伪氨基酸组成(PseAAC)使用位置和组成相对特征以及统计矩构建特征向量。基于提取的特征训练神经网络。然后使用交叉验证和留一法测试验证所提出方法的性能。
预测器的客观评估显示,自一致性准确率为96.7%,十折交叉验证准确率为91.61%,留一法测试准确率为93.42%。
得出结论,iMethylK_pseAAC在识别赖氨酸甲基化位点方面优于同类工具,如iMethyl_pseACC、BPB_pPMS和PMeS。