Department of Computer Science, School of Systems and Technology, University of Management and Technology, C-II Johar Town, Lahore, Pakistan.
Department of Computer Science, School of Systems and Technology, University of Management and Technology, C-II Johar Town, Lahore, Pakistan.
J Theor Biol. 2019 May 7;468:1-11. doi: 10.1016/j.jtbi.2019.02.007. Epub 2019 Feb 12.
The protein prenylation (or S-prenylation) is one of the most essential modifications, required for the association of membrane of a plethora of signalling proteins with the key biological process such as protein trafficking, cell growth, proliferation and differentiation. Due to the ubiquitous nature of S-prenylation and its role in cellular functions, any defect in the biosynthesis or regulation of the isoprenoid leads to the occurrence of a variety of diseases including neurodegenerative disorders, metabolic issues, cardiovascular diseases and one of the most fatal diseases, cancer. This depicts the strong biological significance of S-prenylation, thus, the timely and accurate identification of S-prenylation sites is crucial and may provide with possible ways to understand the mechanism of this modification in proteins. To avoid laborious, resource demanding and expensive experimental techniques of identifying S-prenylation sites, here, we propose a novel predictor namely SPrenylC-PseAAC by integrating the Chou's Pseudo Amino Acid Composition (PseAAC) and relative/absolute position-based features. A 2-tier classification was performed i.e., at first level, identification of prenylation and non-prenylation sites is performed, while at the second level, identification of S-farnesylation and S-geranylgeranylation sites is performed. Using jackknife, perdition model validation gave 95.31% accuracy for tier-1 classification and 91.42% for tier 2 classification, while for 10-fold cross-validation, it gave 93.68% accuracy for tier-1 classification and 89.70% for tier 2 classification. Thus the proposed predictor can help in predicting the Prenylation sites in an efficient and accurate way. The SPrenylC-PseAAC is available at (biopred.org/prenyl).
蛋白质的 prenylation(或 S-prenylation)是最基本的修饰之一,对于将大量信号蛋白与关键生物过程(如蛋白质运输、细胞生长、增殖和分化)的膜结合至关重要。由于 S-prenylation 的普遍存在及其在细胞功能中的作用,异戊烯基生物合成或调节的任何缺陷都会导致多种疾病的发生,包括神经退行性疾病、代谢问题、心血管疾病和最致命的疾病之一——癌症。这说明了 S-prenylation 的强大生物学意义,因此,及时准确地识别 S-prenylation 位点至关重要,这可能为理解蛋白质中这种修饰的机制提供可能的途径。为了避免繁琐、资源密集和昂贵的实验技术来识别 S-prenylation 位点,我们提出了一种新的预测器,即 SPrenylC-PseAAC,它集成了 Chou 的伪氨基酸组成(PseAAC)和相对/绝对位置特征。进行了两层分类,即第一层是对 prenylation 和 non-prenylation 位点进行识别,第二层是对 S-farnesylation 和 S-geranylgeranylation 位点进行识别。使用 jackknife 和 perdition 模型验证,第一层分类的准确率为 95.31%,第二层分类的准确率为 91.42%,而 10 倍交叉验证的准确率分别为 93.68%和 89.70%。因此,该预测器可以帮助以高效、准确的方式预测 prenylation 位点。SPrenylC-PseAAC 可在(biopred.org/prenyl)获得。