• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过融合不同特征预测人类蛋白质中的磷酸苏氨酸位点。

Prediction of phosphothreonine sites in human proteins by fusing different features.

机构信息

Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.

Department of Pathophysiology, Southwest Medical University, Luzhou 646000, China.

出版信息

Sci Rep. 2016 Oct 4;6:34817. doi: 10.1038/srep34817.

DOI:10.1038/srep34817
PMID:27698459
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5048138/
Abstract

Phosphorylation is one of the most important protein post-translation modifications. With the rapid development of high-throughput mass spectrometry, phosphorylation site data is rapidly accumulating, which provides us an opportunity to systematically investigate and predict phosphorylation in proteins. The phosphorylation of threonine is the addition of a phosphoryl group to its polar side chains group. In this work, we statistically analyzed the distribution of the different properties including position conservation, secondary structure, accessibility and some other physicochemical properties of the residues surrounding the phosphothreonine site and non-phosphothreonine site. We found that the distributions of those features are non-symmetrical. Based on the distribution of properties, we developed a new model by using optimal window size strategy and feature selection technique. The cross-validated results show that the area under receiver operating characteristic curve reaches to 0.847, suggesting that our model may play a complementary role to other existing methods for predicting phosphothreonine site in proteins.

摘要

磷酸化是最重要的蛋白质翻译后修饰之一。随着高通量质谱技术的快速发展,磷酸化位点数据正在迅速积累,这为我们系统地研究和预测蛋白质中的磷酸化提供了机会。苏氨酸的磷酸化是在其极性侧链基团上添加磷酸基团。在这项工作中,我们统计分析了围绕磷酸苏氨酸位点和非磷酸苏氨酸位点的残基的不同性质(包括位置保守性、二级结构、可及性和一些其他物理化学性质)的分布。我们发现这些特征的分布是不对称的。基于性质的分布,我们使用最优窗口大小策略和特征选择技术开发了一种新模型。交叉验证结果表明,接收者操作特征曲线下的面积达到 0.847,这表明我们的模型可能对其他现有方法在蛋白质中预测磷酸苏氨酸位点起到补充作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/0bf9451bd3c2/srep34817-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/f11bce829c9c/srep34817-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/b02401c2f68f/srep34817-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/5def89b4e725/srep34817-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/752fe4102a4a/srep34817-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/92173899833b/srep34817-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/9dabb0374d23/srep34817-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/0bf9451bd3c2/srep34817-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/f11bce829c9c/srep34817-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/b02401c2f68f/srep34817-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/5def89b4e725/srep34817-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/752fe4102a4a/srep34817-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/92173899833b/srep34817-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/9dabb0374d23/srep34817-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f63/5048138/0bf9451bd3c2/srep34817-f7.jpg

相似文献

1
Prediction of phosphothreonine sites in human proteins by fusing different features.通过融合不同特征预测人类蛋白质中的磷酸苏氨酸位点。
Sci Rep. 2016 Oct 4;6:34817. doi: 10.1038/srep34817.
2
iPhosT-PseAAC: Identify phosphothreonine sites by incorporating sequence statistical moments into PseAAC.iPhosT-PseAAC:通过将序列统计矩纳入伪氨基酸组成来识别磷酸苏氨酸位点。
Anal Biochem. 2018 Jun 1;550:109-116. doi: 10.1016/j.ab.2018.04.021. Epub 2018 Apr 25.
3
Using support vector machines to identify protein phosphorylation sites in viruses.使用支持向量机识别病毒中的蛋白质磷酸化位点。
J Mol Graph Model. 2015 Mar;56:84-90. doi: 10.1016/j.jmgm.2014.12.005. Epub 2014 Dec 24.
4
Identification of insulin receptor substrate 1 serine/threonine phosphorylation sites using mass spectrometry analysis: regulatory role of serine 1223.利用质谱分析鉴定胰岛素受体底物1的丝氨酸/苏氨酸磷酸化位点:丝氨酸1223的调节作用
Endocrinology. 2005 Oct;146(10):4410-6. doi: 10.1210/en.2005-0260. Epub 2005 Jul 14.
5
Characterization of serine and threonine phosphorylation sites in beta-elimination/ethanethiol addition-modified proteins by electrospray tandem mass spectrometry and database searching.通过电喷雾串联质谱法和数据库搜索对β-消除/乙硫醇加成修饰蛋白质中的丝氨酸和苏氨酸磷酸化位点进行表征。
Biochemistry. 1998 Nov 17;37(46):16211-24. doi: 10.1021/bi981264p.
6
Protein serine and threonine phosphorylation, hyperactivation and acrosome reaction in in vitro capacitated hamster spermatozoa.体外获能的仓鼠精子中的蛋白质丝氨酸和苏氨酸磷酸化、过度激活及顶体反应
Mol Reprod Dev. 2002 Sep;63(1):119-30. doi: 10.1002/mrd.10152.
7
A novel method for predicting post-translational modifications on serine and threonine sites by using site-modification network profiles.一种利用位点修饰网络图谱预测丝氨酸和苏氨酸位点翻译后修饰的新方法。
Mol Biosyst. 2015 Nov;11(11):3092-100. doi: 10.1039/c5mb00384a.
8
Prediction of protein methylation sites using conditional random field.使用条件随机场预测蛋白质甲基化位点。
Protein Pept Lett. 2013 Jan;20(1):71-7.
9
Prediction of tyrosine sulfation with mRMR feature selection and analysis.酪氨酸硫酸化的预测与 mRMR 特征选择和分析。
J Proteome Res. 2010 Dec 3;9(12):6490-7. doi: 10.1021/pr1007152. Epub 2010 Nov 11.
10
Analysis of phosphorylated peptides by double pseudoneutral loss extraction coupled with derivatization using N-(4-bromobenzoyl)aminoethanethiol.采用 N-(4-溴苯甲酰基)氨乙基硫醇衍生化结合双假性中性丢失提取分析磷酸化肽。
Anal Chem. 2009 Nov 15;81(22):9395-401. doi: 10.1021/ac9017988.

引用本文的文献

1
BBPpredict: A Web Service for Identifying Blood-Brain Barrier Penetrating Peptides.BBPpredict:一种用于识别血脑屏障穿透肽的网络服务。
Front Genet. 2022 May 17;13:845747. doi: 10.3389/fgene.2022.845747. eCollection 2022.
2
Computational identification of 4-carboxyglutamate sites to supplement physiological studies using deep learning.利用深度学习计算鉴定 4-羧基谷氨酸位点,以补充生理研究。
Sci Rep. 2022 Jan 7;12(1):128. doi: 10.1038/s41598-021-03895-4.
3
Pretraining model for biological sequence data.生物序列数据的预训练模型。

本文引用的文献

1
BP Neural Network Could Help Improve Pre-miRNA Identification in Various Species.BP神经网络有助于提高多种物种中前体微小RNA的识别。
Biomed Res Int. 2016;2016:9565689. doi: 10.1155/2016/9565689. Epub 2016 Aug 22.
2
dRHP-PseRA: detecting remote homology proteins using profile-based pseudo protein sequence and rank aggregation.dRHP-PseRA:基于轮廓的伪蛋白质序列和排序聚合检测远程同源蛋白质。
Sci Rep. 2016 Sep 1;6:32333. doi: 10.1038/srep32333.
3
Mechanism of APC/CCDC20 activation by mitotic phosphorylation.有丝分裂磷酸化激活APC/CCDC20的机制。
Brief Funct Genomics. 2021 Jun 9;20(3):181-195. doi: 10.1093/bfgp/elab025.
4
PhD7Faster 2.0: predicting clones propagating faster from the Ph.D.-7 phage display library by coupling PseAAC and tripeptide composition.PhD7Faster 2.0:通过结合伪氨基酸组成和三肽组成从Ph.D.-7噬菌体展示文库中预测增殖更快的克隆
PeerJ. 2019 Jun 17;7:e7131. doi: 10.7717/peerj.7131. eCollection 2019.
5
Large-scale in-silico statistical mutagenesis analysis sheds light on the deleteriousness landscape of the human proteome.大规模计算机统计诱变分析揭示了人类蛋白质组的有害性景观。
Sci Rep. 2018 Nov 19;8(1):16980. doi: 10.1038/s41598-018-34959-7.
6
Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.利用综合机器学习框架中的信息特征对赖氨酸丙二酰化位点进行计算分析和预测。
Brief Bioinform. 2019 Nov 27;20(6):2185-2199. doi: 10.1093/bib/bby079.
7
A computational method for prediction of xylanase enzymes activity in strains of Bacillus subtilis based on pseudo amino acid composition features.基于伪氨基酸组成特征的枯草芽孢杆菌木聚糖酶活力预测的计算方法。
PLoS One. 2018 Oct 22;13(10):e0205796. doi: 10.1371/journal.pone.0205796. eCollection 2018.
8
HBPred: a tool to identify growth hormone-binding proteins.HBPred:一种识别生长激素结合蛋白的工具。
Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.
9
iRSpot-Pse6NC: Identifying recombination spots in by incorporating hexamer composition into general PseKNC.iRSpot-Pse6NC:通过将六聚体组成纳入通用 PseKNC 来识别 中的重组热点。
Int J Biol Sci. 2018 May 22;14(8):883-891. doi: 10.7150/ijbs.24616. eCollection 2018.
10
iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites.iRNA-3型A:鉴定RNA腺苷位点的三种修饰类型。
Mol Ther Nucleic Acids. 2018 Jun 1;11:468-474. doi: 10.1016/j.omtn.2018.03.012. Epub 2018 Mar 30.
Proc Natl Acad Sci U S A. 2016 May 10;113(19):E2570-8. doi: 10.1073/pnas.1604929113. Epub 2016 Apr 25.
4
Analysis and Identification of Aptamer-Compound Interactions with a Maximum Relevance Minimum Redundancy and Nearest Neighbor Algorithm.使用最大相关最小冗余和最近邻算法分析和鉴定适体-化合物相互作用
Biomed Res Int. 2016;2016:8351204. doi: 10.1155/2016/8351204. Epub 2016 Feb 3.
5
Identification of immunoglobulins using Chou's pseudo amino acid composition with feature selection technique.使用具有特征选择技术的周氏伪氨基酸组成鉴定免疫球蛋白。
Mol Biosyst. 2016 Apr;12(4):1269-75. doi: 10.1039/c5mb00883b. Epub 2016 Feb 17.
6
Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites.水稻磷酸化1.0:一种用于蛋白质磷酸化位点预测的新型水稻特异性支持向量机预测器。
Sci Rep. 2015 Jul 7;5:11940. doi: 10.1038/srep11940.
7
Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences.Pse-in-One:一个用于生成DNA、RNA和蛋白质序列各种伪组件模式的网络服务器。
Nucleic Acids Res. 2015 Jul 1;43(W1):W65-71. doi: 10.1093/nar/gkv458. Epub 2015 May 9.
8
iPPI-Esml: An ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC.iPPI-Esml:一种通过将蛋白质的物理化学性质和小波变换纳入伪氨基酸组成来识别蛋白质相互作用的集成分类器。
J Theor Biol. 2015 Jul 21;377:47-56. doi: 10.1016/j.jtbi.2015.04.011. Epub 2015 Apr 20.
9
Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.结合周氏距离对伪氨基酸组成和主成分分析进行蛋白质远程同源性检测。
Mol Genet Genomics. 2015 Oct;290(5):1919-31. doi: 10.1007/s00438-015-1044-4. Epub 2015 Apr 21.
10
Predicting the types of J-proteins using clustered amino acids.利用聚类氨基酸预测J蛋白的类型。
Biomed Res Int. 2014;2014:935719. doi: 10.1155/2014/935719. Epub 2014 Apr 2.