• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用支持向量机和融合多个 F-Score 特征选择的方法预测蛋白质中的赖氨酸磷酸化糖基化位点

PLP_FS: prediction of lysine phosphoglycerylation sites in protein using support vector machine and fusion of multiple F_Score feature selection.

机构信息

Dept. of Computer Science and Engineering, Rajshahi University of Engineering and Technology, Rajshahi, Bangladesh.

Dept. of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh.

出版信息

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac306.

DOI:10.1093/bib/bbac306
PMID:35929355
Abstract

A newly invented post-translational modification (PTM), phosphoglycerylation, has shown its essential role in the construction and functional properties of proteins and dangerous human diseases. Hence, it is very urgent to know about the molecular mechanism behind the phosphoglycerylation process to develop the drugs for related diseases. But accurately identifying of phosphoglycerylation site from a protein sequence in a laboratory is a very difficult and challenging task. Hence, the construction of an efficient computation model is greatly sought for this purpose. A little number of computational models are currently available for identifying the phosphoglycerylation sites, which are not able to reach their prediction capability at a satisfactory level. Therefore, an effective predictor named PLP_FS has been designed and constructed to identify phosphoglycerylation sites in this study. For the training purpose, an optimal number of feature sets was obtained by fusion of multiple F_Score feature selection techniques from the features generated by three types of sequence-based feature extraction methods and fitted with the support vector machine classification technique to the prediction model. On the other hand, the k-neighbor near cleaning and SMOTE methods were also implemented to balance the benchmark dataset. The suggested model in 10-fold cross-validation obtained an accuracy of 99.22%, a sensitivity of 98.17% and a specificity of 99.75% according to the experimental findings, which are better than other currently available predictors for accurately identifying the phosphoglycerylation sites.

摘要

一种新发明的翻译后修饰(PTM),磷酸化,已经显示出它在蛋白质的构建和功能特性以及危险的人类疾病中的重要作用。因此,了解磷酸化过程背后的分子机制对于开发相关疾病的药物非常紧迫。但是,在实验室中从蛋白质序列中准确识别磷酸化位点是一项非常困难和具有挑战性的任务。因此,非常需要构建一个有效的计算模型来实现这一目的。目前可用于识别磷酸化位点的计算模型数量很少,无法达到令人满意的预测能力水平。因此,本研究设计并构建了一个名为 PLP_FS 的有效预测器,用于识别磷酸化位点。为了训练目的,通过融合来自三种基于序列的特征提取方法生成的特征的多个 F_Score 特征选择技术,获得了最佳数量的特征集,并将其拟合到支持向量机分类技术的预测模型中。另一方面,还实施了 k-近邻近清理和 SMOTE 方法来平衡基准数据集。根据实验结果,该模型在 10 折交叉验证中获得了 99.22%的准确率、98.17%的灵敏度和 99.75%的特异性,优于其他目前可用的预测器,可更准确地识别磷酸化位点。

相似文献

1
PLP_FS: prediction of lysine phosphoglycerylation sites in protein using support vector machine and fusion of multiple F_Score feature selection.使用支持向量机和融合多个 F-Score 特征选择的方法预测蛋白质中的赖氨酸磷酸化糖基化位点
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac306.
2
iDPGK: characterization and identification of lysine phosphoglycerylation sites based on sequence-based features.iDPGK:基于序列特征的赖氨酸磷酸甘油化位点的表征和鉴定。
BMC Bioinformatics. 2020 Dec 9;21(1):568. doi: 10.1186/s12859-020-03916-5.
3
Predicting lysine phosphoglycerylation with fuzzy SVM by incorporating k-spaced amino acid pairs into Chou׳s general PseAAC.通过将k间隔氨基酸对纳入周氏广义伪氨基酸组成,利用模糊支持向量机预测赖氨酸磷酸甘油化。
J Theor Biol. 2016 May 21;397:145-50. doi: 10.1016/j.jtbi.2016.02.020. Epub 2016 Feb 22.
4
RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix.RAM-PGK:基于残基邻接矩阵的赖氨酸磷酸甘油化预测。
Genes (Basel). 2020 Dec 20;11(12):1524. doi: 10.3390/genes11121524.
5
Predicting protein lysine phosphoglycerylation sites by hybridizing many sequence based features.通过整合多种基于序列的特征来预测蛋白质赖氨酸磷酸甘油化位点。
Mol Biosyst. 2017 May 2;13(5):874-882. doi: 10.1039/c6mb00875e.
6
Bigram-PGK: phosphoglycerylation prediction using the technique of bigram probabilities of position specific scoring matrix.双元模型-PGK:基于位置特异得分矩阵双元概率技术的磷酸甘油酰化预测。
BMC Mol Cell Biol. 2019 Dec 20;20(Suppl 2):57. doi: 10.1186/s12860-019-0240-1.
7
EvolStruct-Phogly: incorporating structural properties and evolutionary information from profile bigrams for the phosphoglycerylation prediction.EvolStruct-Phogly:从二联体轮廓中整合结构特性和进化信息,用于磷酸甘油化预测。
BMC Genomics. 2019 Apr 18;19(Suppl 9):984. doi: 10.1186/s12864-018-5383-5.
8
predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance.通过将概率序列耦合信息纳入 PseAAC 并解决数据不平衡问题来预测磷酸化糖基化位点。
PLoS One. 2021 Apr 1;16(4):e0249396. doi: 10.1371/journal.pone.0249396. eCollection 2021.
9
Prediction of lysine propionylation sites using biased SVM and incorporating four different sequence features into Chou's PseAAC.利用有偏支持向量机并将四种不同序列特征纳入周氏伪氨基酸组成对赖氨酸丙酰化位点进行预测。
J Mol Graph Model. 2017 Sep;76:356-363. doi: 10.1016/j.jmgm.2017.07.022. Epub 2017 Jul 25.
10
Prediction of lysine formylation sites using support vector machine based on the sample selection from majority classes and synthetic minority over-sampling techniques.基于多数类样本选择和合成少数类过采样技术的支持向量机预测赖氨酸甲酰化位点。
Biochimie. 2022 Jan;192:125-135. doi: 10.1016/j.biochi.2021.10.001. Epub 2021 Oct 7.

引用本文的文献

1
Advancing the Accuracy of Anti-MRSA Peptide Prediction Through Integrating Multi-Source Protein Language Models.通过整合多源蛋白质语言模型提高抗耐甲氧西林金黄色葡萄球菌肽预测的准确性
Interdiscip Sci. 2025 Mar 11. doi: 10.1007/s12539-025-00696-5.