• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iPhos-PseEn:通过将不同的伪组分融合到集成分类器中来识别蛋白质中的磷酸化位点。

iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier.

作者信息

Qiu Wang-Ren, Xiao Xuan, Xu Zhao-Chun, Chou Kuo-Chen

机构信息

Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, China.

Department of Computer Science and Bond Life Science Center, University of Missouri, Columbia, MO, USA.

出版信息

Oncotarget. 2016 Aug 9;7(32):51270-51283. doi: 10.18632/oncotarget.9987.

DOI:10.18632/oncotarget.9987
PMID:27323404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5239474/
Abstract

Protein phosphorylation is a posttranslational modification (PTM or PTLM), where a phosphoryl group is added to the residue(s) of a protein molecule. The most commonly phosphorylated amino acids occur at serine (S), threonine (T), and tyrosine (Y). Protein phosphorylation plays a significant role in a wide range of cellular processes; meanwhile its dysregulation is also involved with many diseases. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of S, T, or Y, which ones can be phosphorylated, and which ones cannot? To address this problem, we have developed a predictor called iPhos-PseEn by fusing four different pseudo component approaches (amino acids' disorder scores, nearest neighbor scores, occurrence frequencies, and position weights) into an ensemble classifier via a voting system. Rigorous cross-validations indicated that the proposed predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iPhos-PseEn has been established at http://www.jci-bioinfo.cn/iPhos-PseEn, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved.

摘要

蛋白质磷酸化是一种翻译后修饰(PTM或PTLM),其中一个磷酰基被添加到蛋白质分子的残基上。最常发生磷酸化的氨基酸是丝氨酸(S)、苏氨酸(T)和酪氨酸(Y)。蛋白质磷酸化在广泛的细胞过程中发挥着重要作用;同时,其失调也与许多疾病有关。因此,从基础研究和药物开发的角度来看,我们面临一个具有挑战性的问题:对于一个含有许多S、T或Y残基的未表征蛋白质序列,哪些残基可以被磷酸化,哪些不能?为了解决这个问题,我们开发了一种名为iPhos-PseEn的预测器,通过投票系统将四种不同的伪组分方法(氨基酸的无序得分、最近邻得分、出现频率和位置权重)融合到一个集成分类器中。严格的交叉验证表明,所提出的预测器明显优于现有的同类预测器。为了方便大多数实验科学家,已在http://www.jci-bioinfo.cn/iPhos-PseEn建立了一个用户友好的iPhos-PseEn网络服务器,用户通过该服务器可以轻松获得所需结果,而无需处理所涉及的复杂数学方程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/c46b172d173e/oncotarget-07-51270-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/2308a3088e25/oncotarget-07-51270-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/807932ad6574/oncotarget-07-51270-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/fdee39a305e5/oncotarget-07-51270-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/c46b172d173e/oncotarget-07-51270-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/2308a3088e25/oncotarget-07-51270-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/807932ad6574/oncotarget-07-51270-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/fdee39a305e5/oncotarget-07-51270-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f300/5239474/c46b172d173e/oncotarget-07-51270-g004.jpg

相似文献

1
iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier.iPhos-PseEn:通过将不同的伪组分融合到集成分类器中来识别蛋白质中的磷酸化位点。
Oncotarget. 2016 Aug 9;7(32):51270-51283. doi: 10.18632/oncotarget.9987.
2
iPhos-PseEvo: Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into General PseAAC via Grey System Theory.iPhos-PseEvo:通过灰色系统理论将进化信息纳入通用 PseAAC 来识别人类磷酸化蛋白质。
Mol Inform. 2017 May;36(5-6). doi: 10.1002/minf.201600010. Epub 2016 May 12.
3
iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC.iCar-PseCp:通过蒙特卡洛采样识别蛋白质中的羰基化位点,并将序列耦合效应纳入通用伪氨基酸组成中。
Oncotarget. 2016 Jun 7;7(23):34558-70. doi: 10.18632/oncotarget.9148.
4
pSuc-Lys: Predict lysine succinylation sites in proteins with PseAAC and ensemble random forest approach.pSuc-Lys:利用伪氨基酸组成和集成随机森林方法预测蛋白质中的赖氨酸琥珀酰化位点。
J Theor Biol. 2016 Apr 7;394:223-230. doi: 10.1016/j.jtbi.2016.01.020. Epub 2016 Jan 22.
5
iPTM-mLys: identifying multiple lysine PTM sites and their different types.iPTM-mLys:鉴定多个赖氨酸 PTM 位点及其不同类型。
Bioinformatics. 2016 Oct 15;32(20):3116-3123. doi: 10.1093/bioinformatics/btw380. Epub 2016 Jun 22.
6
iSuc-PseOpt: Identifying lysine succinylation sites in proteins by incorporating sequence-coupling effects into pseudo components and optimizing imbalanced training dataset.iSuc-PseOpt:通过将序列耦合效应纳入伪组件并优化不平衡训练数据集来识别蛋白质中的赖氨酸琥珀酰化位点。
Anal Biochem. 2016 Mar 15;497:48-56. doi: 10.1016/j.ab.2015.12.009. Epub 2015 Dec 23.
7
iHyd-PseCp: Identify hydroxyproline and hydroxylysine in proteins by incorporating sequence-coupled effects into general PseAAC.iHyd-PseCp:通过将序列耦合效应纳入通用伪氨基酸组成来鉴定蛋白质中的羟脯氨酸和羟赖氨酸。
Oncotarget. 2016 Jul 12;7(28):44310-44321. doi: 10.18632/oncotarget.10027.
8
iKcr-PseEns: Identify lysine crotonylation sites in histone proteins with pseudo components and ensemble classifier.iKcr-PseEns:使用伪组件和集成分类器鉴定组蛋白中的赖氨酸巴豆酰化位点。
Genomics. 2018 Sep;110(5):239-246. doi: 10.1016/j.ygeno.2017.10.008. Epub 2017 Nov 17.
9
iPGK-PseAAC: Identify Lysine Phosphoglycerylation Sites in Proteins by Incorporating Four Different Tiers of Amino Acid Pairwise Coupling Information into the General PseAAC.iPGK-PseAAC:通过将四种不同层次的氨基酸成对耦合信息整合到通用伪氨基酸组成中识别蛋白质中的赖氨酸磷酸甘油化位点。
Med Chem. 2017;13(6):552-559. doi: 10.2174/1573406413666170515120507.
10
Multi-iPPseEvo: A Multi-label Classifier for Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into Chou's General PseAAC via Grey System Theory.多标签 iPPseEvo 模型:一种通过将进化信息纳入到 Chou 的通用 PseAAC 中,并利用灰色系统理论来识别人类磷酸化蛋白质的多标签分类器。
Mol Inform. 2017 Mar;36(3). doi: 10.1002/minf.201600085. Epub 2016 Sep 29.

引用本文的文献

1
GraphPhos: Predict Protein-Phosphorylation Sites Based on Graph Neural Networks.GraphPhos:基于图神经网络预测蛋白质磷酸化位点
Int J Mol Sci. 2025 Jan 23;26(3):941. doi: 10.3390/ijms26030941.
2
MoRF-FUNCpred: Molecular Recognition Feature Function Prediction Based on Multi-Label Learning and Ensemble Learning.MoRF-FUNCpred:基于多标签学习和集成学习的分子识别特征功能预测
Front Pharmacol. 2022 Mar 8;13:856417. doi: 10.3389/fphar.2022.856417. eCollection 2022.
3
Mal-Light: Enhancing Lysine Malonylation Sites Prediction Problem Using Evolutionary-based Features.

本文引用的文献

1
iPhos-PseEvo: Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into General PseAAC via Grey System Theory.iPhos-PseEvo:通过灰色系统理论将进化信息纳入通用 PseAAC 来识别人类磷酸化蛋白质。
Mol Inform. 2017 May;36(5-6). doi: 10.1002/minf.201600010. Epub 2016 May 12.
2
iRSpot-EL: identify recombination spots with an ensemble learning approach.iRSpot-EL:基于集成学习方法识别重组热点。
Bioinformatics. 2017 Jan 1;33(1):35-41. doi: 10.1093/bioinformatics/btw539. Epub 2016 Aug 16.
3
PseDNA-Pro: DNA-Binding Protein Identification by Combining Chou's PseAAC and Physicochemical Distance Transformation.
Mal-Light:利用基于进化的特征增强赖氨酸丙二酰化位点预测问题
IEEE Access. 2020;8:77888-77902. doi: 10.1109/access.2020.2989713. Epub 2020 Apr 22.
4
Using phosphoproteomics data to understand cellular signaling: a comprehensive guide to bioinformatics resources.利用磷酸化蛋白质组学数据理解细胞信号传导:生物信息学资源综合指南
Clin Proteomics. 2020 Jul 11;17:27. doi: 10.1186/s12014-020-09290-x. eCollection 2020.
5
iSulfoTyr-PseAAC: Identify Tyrosine Sulfation Sites by Incorporating Statistical Moments Chou's 5-steps Rule and Pseudo Components.iSulfoTyr-PseAAC:通过结合统计矩、周氏五步法则和伪组分来识别酪氨酸硫酸化位点
Curr Genomics. 2019 May;20(4):306-320. doi: 10.2174/1389202920666190819091609.
6
iMethylK_pseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General PseAAC Chou's 5-steps Rule.iMethylK_pseAAC:通过将统计矩和位置相关特征纳入通用伪氨基酸组成的周氏五步法则来提高赖氨酸甲基化位点识别的准确性
Curr Genomics. 2019 May;20(4):275-292. doi: 10.2174/1389202920666190809095206.
7
Identifying Acetylation Protein by Fusing Its PseAAC and Functional Domain Annotation.通过融合乙酰化蛋白的伪氨基酸组成和功能域注释来识别该蛋白
Front Bioeng Biotechnol. 2019 Dec 6;7:311. doi: 10.3389/fbioe.2019.00311. eCollection 2019.
8
RAACBook: a web server of reduced amino acid alphabet for sequence-dependent inference by using Chou's five-step rule.RAACBook:一个基于简化氨基酸字母表的网络服务器,用于通过使用周保罗的五步法则进行序列相关推断。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz131.
9
iN6-methylat (5-step): identifying DNA N-methyladenine sites in rice genome using continuous bag of nucleobases via Chou's 5-step rule.iN6-methylat(5 步):使用 Chou 的 5 步规则通过连续核苷酸袋鉴定水稻基因组中的 DNA N6-甲基腺嘌呤位点。
Mol Genet Genomics. 2019 Oct;294(5):1173-1182. doi: 10.1007/s00438-019-01570-y. Epub 2019 May 4.
10
LAIPT: Lysine Acetylation Site Identification with Polynomial Tree.LAIPT:利用多项式树进行赖氨酸乙酰化位点鉴定。
Int J Mol Sci. 2018 Dec 29;20(1):113. doi: 10.3390/ijms20010113.
PseDNA-Pro:结合周氏伪氨基酸组成和物理化学距离变换的DNA结合蛋白鉴定方法
Mol Inform. 2015 Jan;34(1):8-17. doi: 10.1002/minf.201400025. Epub 2014 Sep 26.
4
iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework.iDHS-EL:通过将三种不同模式的伪核苷酸组成融合到一个集成学习框架中,来识别 DNase I 超敏位点。
Bioinformatics. 2016 Aug 15;32(16):2411-8. doi: 10.1093/bioinformatics/btw186. Epub 2016 Apr 8.
5
iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC.iCar-PseCp:通过蒙特卡洛采样识别蛋白质中的羰基化位点,并将序列耦合效应纳入通用伪氨基酸组成中。
Oncotarget. 2016 Jun 7;7(23):34558-70. doi: 10.18632/oncotarget.9148.
6
iROS-gPseKNC: Predicting replication origin sites in DNA by incorporating dinucleotide position-specific propensity into general pseudo nucleotide composition.iROS-gPseKNC:通过将二核苷酸位置特异性倾向纳入通用伪核苷酸组成来预测DNA中的复制起始位点。
Oncotarget. 2016 Jun 7;7(23):34180-9. doi: 10.18632/oncotarget.9057.
7
iACP: a sequence-based tool for identifying anticancer peptides.iACP:一种用于鉴定抗癌肽的基于序列的工具。
Oncotarget. 2016 Mar 29;7(13):16895-909. doi: 10.18632/oncotarget.7815.
8
pSuc-Lys: Predict lysine succinylation sites in proteins with PseAAC and ensemble random forest approach.pSuc-Lys:利用伪氨基酸组成和集成随机森林方法预测蛋白质中的赖氨酸琥珀酰化位点。
J Theor Biol. 2016 Apr 7;394:223-230. doi: 10.1016/j.jtbi.2016.01.020. Epub 2016 Jan 22.
9
iPPBS-Opt: A Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets.iPPBS-Opt:一种基于序列的集成分类器,通过优化不平衡训练数据集来识别蛋白质-蛋白质结合位点。
Molecules. 2016 Jan 19;21(1):E95. doi: 10.3390/molecules21010095.
10
pRNAm-PC: Predicting N(6)-methyladenosine sites in RNA sequences via physical-chemical properties.pRNAm-PC:通过物理化学性质预测RNA序列中的N6-甲基腺嘌呤位点。
Anal Biochem. 2016 Mar 15;497:60-7. doi: 10.1016/j.ab.2015.12.017. Epub 2015 Dec 31.