PlantPhos：使用最大依赖分解法鉴定具有底物特异性的植物磷酸化位点。

PlantPhos: using maximal dependence decomposition to identify plant phosphorylation sites with substrate site specificity.

机构信息

Department of Computer Science and Engineering, Yuan Ze University, Chungli 320, Taiwan.

出版信息

BMC Bioinformatics. 2011 Jun 26;12:261. doi: 10.1186/1471-2105-12-261.

DOI:10.1186/1471-2105-12-261

PMID:21703007

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3228547/

Abstract

BACKGROUND

Protein phosphorylation catalyzed by kinases plays crucial regulatory roles in intracellular signal transduction. Due to the difficulty in performing high-throughput mass spectrometry-based experiment, there is a desire to predict phosphorylation sites using computational methods. However, previous studies regarding in silico prediction of plant phosphorylation sites lack the consideration of kinase-specific phosphorylation data. Thus, we are motivated to propose a new method that investigates different substrate specificities in plant phosphorylation sites.

RESULTS

Experimentally verified phosphorylation data were extracted from TAIR9-a protein database containing 3006 phosphorylation data from the plant species Arabidopsis thaliana. In an attempt to investigate the various substrate motifs in plant phosphorylation, maximal dependence decomposition (MDD) is employed to cluster a large set of phosphorylation data into subgroups containing significantly conserved motifs. Profile hidden Markov model (HMM) is then applied to learn a predictive model for each subgroup. Cross-validation evaluation on the MDD-clustered HMMs yields an average accuracy of 82.4% for serine, 78.6% for threonine, and 89.0% for tyrosine models. Moreover, independent test results using Arabidopsis thaliana phosphorylation data from UniProtKB/Swiss-Prot show that the proposed models are able to correctly predict 81.4% phosphoserine, 77.1% phosphothreonine, and 83.7% phosphotyrosine sites. Interestingly, several MDD-clustered subgroups are observed to have similar amino acid conservation with the substrate motifs of well-known kinases from Phospho.ELM-a database containing kinase-specific phosphorylation data from multiple organisms.

CONCLUSIONS

This work presents a novel method for identifying plant phosphorylation sites with various substrate motifs. Based on cross-validation and independent testing, results show that the MDD-clustered models outperform models trained without using MDD. The proposed method has been implemented as a web-based plant phosphorylation prediction tool, PlantPhos http://csb.cse.yzu.edu.tw/PlantPhos/. Additionally, two case studies have been demonstrated to further evaluate the effectiveness of PlantPhos.

摘要

背景

激酶催化的蛋白质磷酸化在细胞内信号转导中发挥着至关重要的调节作用。由于高通量质谱实验的难度，人们希望使用计算方法来预测磷酸化位点。然而，以前关于植物磷酸化位点的计算机预测研究缺乏对激酶特异性磷酸化数据的考虑。因此，我们有动力提出一种新的方法来研究植物磷酸化位点的不同底物特异性。

结果

从包含拟南芥 3006 个磷酸化数据的 TAIR9 蛋白质数据库中提取了实验验证的磷酸化数据。为了研究植物磷酸化中的各种底物基序，我们采用最大依赖分解（MDD）将大量磷酸化数据聚类成包含显著保守基序的子组。然后应用轮廓隐马尔可夫模型（HMM）为每个子组学习预测模型。对 MDD 聚类的 HMM 进行交叉验证评估，得到丝氨酸模型的平均准确率为 82.4%，苏氨酸模型的准确率为 78.6%，酪氨酸模型的准确率为 89.0%。此外，使用 UniProtKB/Swiss-Prot 中的拟南芥磷酸化数据进行独立测试的结果表明，所提出的模型能够正确预测 81.4%的磷酸丝氨酸、77.1%的磷酸苏氨酸和 83.7%的磷酸酪氨酸位点。有趣的是，几个 MDD 聚类的子组被观察到与 Phospho.ELM-a 数据库中包含来自多个生物体的激酶特异性磷酸化数据的激酶特异性磷酸化数据的底物基序具有相似的氨基酸保守性。

结论

本研究提出了一种识别具有不同底物基序的植物磷酸化位点的新方法。基于交叉验证和独立测试，结果表明，使用 MDD 聚类的模型优于不使用 MDD 聚类的模型。该方法已被实现为一个基于网络的植物磷酸化预测工具，PlantPhos http://csb.cse.yzu.edu.tw/PlantPhos/。此外，还进行了两个案例研究，以进一步评估 PlantPhos 的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8155/3228547/97e158dd363b/1471-2105-12-261-1.jpg

相似文献

PlantPhos: using maximal dependence decomposition to identify plant phosphorylation sites with substrate site specificity.PlantPhos：使用最大依赖分解法鉴定具有底物特异性的植物磷酸化位点。

BMC Bioinformatics. 2011 Jun 26;12:261. doi: 10.1186/1471-2105-12-261.

Identifying protein phosphorylation sites with kinase substrate specificity on human viruses.鉴定人类病毒中具有激酶底物特异性的蛋白质磷酸化位点。

PLoS One. 2012;7(7):e40694. doi: 10.1371/journal.pone.0040694. Epub 2012 Jul 23.

Incorporating substrate sequence motifs and spatial amino acid composition to identify kinase-specific phosphorylation sites on protein three-dimensional structures.将底物序列基序和空间氨基酸组成纳入蛋白质三维结构中，以鉴定激酶特异性磷酸化位点。

BMC Bioinformatics. 2013;14 Suppl 16(Suppl 16):S2. doi: 10.1186/1471-2105-14-S16-S2. Epub 2013 Oct 22.

ViralPhos: incorporating a recursively statistical method to predict phosphorylation sites on virus proteins.ViralPhos：一种整合递归统计方法的病毒蛋白磷酸化位点预测工具。

BMC Bioinformatics. 2013;14 Suppl 16(Suppl 16):S10. doi: 10.1186/1471-2105-14-S16-S10. Epub 2013 Oct 22.

A two-layered machine learning method to identify protein O-GlcNAcylation sites with O-GlcNAc transferase substrate motifs.一种用于识别具有O-连接N-乙酰葡糖胺转移酶底物基序的蛋白质O-连接N-乙酰葡糖胺化位点的两层机器学习方法。

BMC Bioinformatics. 2015;16 Suppl 18(Suppl 18):S10. doi: 10.1186/1471-2105-16-S18-S10. Epub 2015 Dec 9.

MDD-carb: a combinatorial model for the identification of protein carbonylation sites with substrate motifs.MDD-carb：一种用于识别具有底物基序的蛋白质羰基化位点的组合模型。

BMC Syst Biol. 2017 Dec 21;11(Suppl 7):137. doi: 10.1186/s12918-017-0511-4.

Characterization and identification of lysine glutarylation based on intrinsic interdependence between positions in the substrate sites.基于底物结合位点中位置的内在相关性对赖氨酸瓜氨酸化的表征和鉴定。

BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):384. doi: 10.1186/s12859-018-2394-9.

MDD-SOH: exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs.MDD-SOH：利用最大依赖分解来识别具有底物基序的S-亚磺酰化位点。

Bioinformatics. 2016 Jan 15;32(2):165-72. doi: 10.1093/bioinformatics/btv558. Epub 2015 Sep 26.

MDD-Palm: Identification of protein S-palmitoylation sites with substrate motifs based on maximal dependence decomposition.MDD-Palm：基于最大依赖分解法识别具有底物基序的蛋白质S-棕榈酰化位点

PLoS One. 2017 Jun 29;12(6):e0179529. doi: 10.1371/journal.pone.0179529. eCollection 2017.

SNOSite: exploiting maximal dependence decomposition to identify cysteine S-nitrosylation with substrate site specificity.SNOSite：利用最大依赖分解鉴定具有底物特异性的半胱氨酸 S-亚硝酰化。

PLoS One. 2011;6(7):e21849. doi: 10.1371/journal.pone.0021849. Epub 2011 Jul 15.

引用本文的文献

Recent advances in proteomics and metabolomics in plants.植物蛋白质组学和代谢组学的最新进展。

Mol Hortic. 2022 Jul 23;2(1):17. doi: 10.1186/s43897-022-00038-9.

Regulation of PaRBOH1-mediated ROS production in Norway spruce by Ca binding and phosphorylation.通过钙结合和磷酸化对挪威云杉中PaRBOH1介导的活性氧产生的调控

Front Plant Sci. 2022 Oct 13;13:978586. doi: 10.3389/fpls.2022.978586. eCollection 2022.

dbPTM in 2022: an updated database for exploring regulatory networks and functional associations of protein post-translational modifications.dbPTM 在 2022 年：一个更新的数据库，用于探索蛋白质翻译后修饰的调控网络和功能关联。

Nucleic Acids Res. 2022 Jan 7;50(D1):D471-D479. doi: 10.1093/nar/gkab1017.

A Novel Putative Microtubule-Associated Protein Is Involved in Arbuscule Development during Arbuscular Mycorrhiza Formation.一种新型假定微管相关蛋白参与丛枝菌根形成过程中的丛枝发育。

Plant Cell Physiol. 2021 May 11;62(2):306-320. doi: 10.1093/pcp/pcaa159.

GasPhos: Protein Phosphorylation Site Prediction Using a New Feature Selection Approach with a GA-Aided Ant Colony System.GasPhos：一种使用新的特征选择方法和 GA 辅助蚁群系统进行蛋白质磷酸化位点预测。

Int J Mol Sci. 2020 Oct 24;21(21):7891. doi: 10.3390/ijms21217891.

BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):384. doi: 10.1186/s12859-018-2394-9.

dbPTM in 2019: exploring disease association and cross-talk of post-translational modifications.dbPTM 于 2019 年：探索翻译后修饰的疾病关联和串扰。

Nucleic Acids Res. 2019 Jan 8;47(D1):D298-D308. doi: 10.1093/nar/gky1074.

In silico insights on diverse interacting partners and phosphorylation sites of respiratory burst oxidase homolog (Rbohs) gene families from Arabidopsis and rice.基于计算机的拟南芥和水稻呼吸爆发氧化酶同源基因家族不同相互作用伙伴和磷酸化位点的研究进展。

BMC Plant Biol. 2018 Aug 10;18(1):161. doi: 10.1186/s12870-018-1378-2.

MDD-carb: a combinatorial model for the identification of protein carbonylation sites with substrate motifs.MDD-carb：一种用于识别具有底物基序的蛋白质羰基化位点的组合模型。

BMC Syst Biol. 2017 Dec 21;11(Suppl 7):137. doi: 10.1186/s12918-017-0511-4.

Investigation and identification of functional post-translational modification sites associated with drug binding and protein-protein interactions.与药物结合及蛋白质-蛋白质相互作用相关的功能性翻译后修饰位点的研究与鉴定。

BMC Syst Biol. 2017 Dec 21;11(Suppl 7):132. doi: 10.1186/s12918-017-0506-1.

本文引用的文献

Exploiting maximal dependence decomposition to identify conserved motifs from a group of aligned signal sequences.利用最大依赖分解从一组对齐的信号序列中识别保守基序。

Bioinformatics. 2011 Jul 1;27(13):1780-7. doi: 10.1093/bioinformatics/btr291. Epub 2011 May 6.

A comprehensive resource for integrating and displaying protein post-translational modifications.一个用于整合和展示蛋白质翻译后修饰的综合资源。

BMC Res Notes. 2009 Jun 23;2:111. doi: 10.1186/1756-0500-2-111.

Incorporating structural characteristics for identification of protein methylation sites.整合结构特征用于蛋白质甲基化位点的识别。

J Comput Chem. 2009 Jul 15;30(9):1532-43. doi: 10.1002/jcc.21232.

The UniProtKB/Swiss-Prot knowledgebase and its Plant Proteome Annotation Program.通用蛋白质资源知识库/瑞士蛋白质数据库及其植物蛋白质组注释计划。

J Proteomics. 2009 Apr 13;72(3):567-73. doi: 10.1016/j.jprot.2008.11.010. Epub 2008 Nov 24.

P3DB: a plant protein phosphorylation database.P3DB：一个植物蛋白质磷酸化数据库。

Nucleic Acids Res. 2009 Jan;37(Database issue):D960-2. doi: 10.1093/nar/gkn733. Epub 2008 Oct 17.

PHOSIDA (phosphorylation site database): management, structural and evolutionary investigation, and prediction of phosphosites.PHOSIDA（磷酸化位点数据库）：磷酸化位点的管理、结构与进化研究以及预测

Genome Biol. 2007;8(11):R250. doi: 10.1186/gb-2007-8-11-r250.

PhosPhAt: a database of phosphorylation sites in Arabidopsis thaliana and a plant-specific phosphorylation site predictor.PhosPhAt：拟南芥磷酸化位点数据库及植物特异性磷酸化位点预测工具

Nucleic Acids Res. 2008 Jan;36(Database issue):D1015-21. doi: 10.1093/nar/gkm812. Epub 2007 Nov 4.

Phospho.ELM: a database of phosphorylation sites--update 2008.磷酸化位点数据库Phospho.ELM：2008年更新版

Nucleic Acids Res. 2008 Jan;36(Database issue):D240-4. doi: 10.1093/nar/gkm772. Epub 2007 Oct 25.

Recent progress in protein subcellular location prediction.蛋白质亚细胞定位预测的最新进展。

Anal Biochem. 2007 Nov 1;370(1):1-16. doi: 10.1016/j.ab.2007.07.006. Epub 2007 Jul 12.

KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns.激酶磷酸化位点预测工具2.0：一个基于序列和偶联模式识别蛋白激酶特异性磷酸化位点的网络服务器。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W588-94. doi: 10.1093/nar/gkm322. Epub 2007 May 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

PlantPhos：使用最大依赖分解法鉴定具有底物特异性的植物磷酸化位点。

PlantPhos: using maximal dependence decomposition to identify plant phosphorylation sites with substrate site specificity.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献