• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用局部组成识别甘露糖相互作用残基。

Identification of mannose interacting residues using local composition.

机构信息

Institute of Microbial Technology, Chandigarh, India.

出版信息

PLoS One. 2011;6(9):e24039. doi: 10.1371/journal.pone.0024039. Epub 2011 Sep 13.

DOI:10.1371/journal.pone.0024039
PMID:21931639
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3172211/
Abstract

BACKGROUND

Mannose binding proteins (MBPs) play a vital role in several biological functions such as defense mechanisms. These proteins bind to mannose on the surface of a wide range of pathogens and help in eliminating these pathogens from our body. Thus, it is important to identify mannose interacting residues (MIRs) in order to understand mechanism of recognition of pathogens by MBPs.

RESULTS

This paper describes modules developed for predicting MIRs in a protein. Support vector machine (SVM) based models have been developed on 120 mannose binding protein chains, where no two chains have more than 25% sequence similarity. SVM models were developed on two types of datasets: 1) main dataset consists of 1029 mannose interacting and 1029 non-interacting residues, 2) realistic dataset consists of 1029 mannose interacting and 10320 non-interacting residues. In this study, firstly, we developed standard modules using binary and PSSM profile of patterns and got maximum MCC around 0.32. Secondly, we developed SVM modules using composition profile of patterns and achieved maximum MCC around 0.74 with accuracy 86.64% on main dataset. Thirdly, we developed a model on a realistic dataset and achieved maximum MCC of 0.62 with accuracy 93.08%. Based on this study, a standalone program and web server have been developed for predicting mannose interacting residues in proteins (http://www.imtech.res.in/raghava/premier/).

CONCLUSIONS

Compositional analysis of mannose interacting and non-interacting residues shows that certain types of residues are preferred in mannose interaction. It was also observed that residues around mannose interacting residues have a preference for certain types of residues. Composition of patterns/peptide/segment has been used for predicting MIRs and achieved reasonable high accuracy. It is possible that this novel strategy may be effective to predict other types of interacting residues. This study will be useful in annotating the function of protein as well as in understanding the role of mannose in the immune system.

摘要

背景

甘露糖结合蛋白(MBP)在多种生物学功能中发挥着重要作用,如防御机制。这些蛋白质与多种病原体表面的甘露糖结合,有助于清除体内的这些病原体。因此,识别甘露糖相互作用残基(MIR)对于了解 MBP 识别病原体的机制非常重要。

结果

本文描述了用于预测蛋白质中 MIR 的模块。基于支持向量机(SVM)的模型已经在 120 个甘露糖结合蛋白链上开发,其中没有两个链具有超过 25%的序列相似性。SVM 模型是在两种类型的数据集上开发的:1)主要数据集由 1029 个甘露糖相互作用残基和 1029 个非相互作用残基组成,2)实际数据集由 1029 个甘露糖相互作用残基和 10320 个非相互作用残基组成。在这项研究中,首先,我们使用二元和 PSSM 模式的图案以及最大 MCC 约为 0.32 来开发标准模块。其次,我们使用模式的组成图案来开发 SVM 模块,在主要数据集上实现了最大 MCC 约 0.74,准确率为 86.64%。第三,我们在实际数据集上开发了一个模型,最大 MCC 为 0.62,准确率为 93.08%。基于这项研究,我们开发了一个用于预测蛋白质中甘露糖相互作用残基的独立程序和网络服务器(http://www.imtech.res.in/raghava/premier/)。

结论

甘露糖相互作用和非相互作用残基的组成分析表明,某些类型的残基在甘露糖相互作用中是优先的。还观察到,甘露糖相互作用残基周围的残基对某些类型的残基有偏好。模式/肽/段的组成已被用于预测 MIR,并取得了相当高的准确性。这种新策略可能对预测其他类型的相互作用残基有效。这项研究将有助于注释蛋白质的功能以及理解甘露糖在免疫系统中的作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/4d79481c6686/pone.0024039.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/3e01cc700acf/pone.0024039.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/3c3b98b53547/pone.0024039.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/ab3e80aa9925/pone.0024039.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/66a38be8040b/pone.0024039.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/a8146dddf599/pone.0024039.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/e20d9db1b888/pone.0024039.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/4d79481c6686/pone.0024039.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/3e01cc700acf/pone.0024039.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/3c3b98b53547/pone.0024039.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/ab3e80aa9925/pone.0024039.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/66a38be8040b/pone.0024039.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/a8146dddf599/pone.0024039.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/e20d9db1b888/pone.0024039.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7547/3172211/4d79481c6686/pone.0024039.g007.jpg

相似文献

1
Identification of mannose interacting residues using local composition.使用局部组成识别甘露糖相互作用残基。
PLoS One. 2011;6(9):e24039. doi: 10.1371/journal.pone.0024039. Epub 2011 Sep 13.
2
Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information.从蛋白质的进化信息预测其 GTP 相互作用残基、二肽和三肽。
BMC Bioinformatics. 2010 Jun 3;11:301. doi: 10.1186/1471-2105-11-301.
3
Identification of ATP binding residues of a protein from its primary sequence.从蛋白质的一级序列鉴定其 ATP 结合残基。
BMC Bioinformatics. 2009 Dec 19;10:434. doi: 10.1186/1471-2105-10-434.
4
SVM based prediction of RNA-binding proteins using binding residues and evolutionary information.基于支持向量机的 RNA 结合蛋白结合残基和进化信息预测。
J Mol Recognit. 2011 Mar-Apr;24(2):303-13. doi: 10.1002/jmr.1061.
5
Identification of NAD interacting residues in proteins.鉴定蛋白质中与 NAD 相互作用的残基。
BMC Bioinformatics. 2010 Mar 30;11:160. doi: 10.1186/1471-2105-11-160.
6
Prediction of vitamin interacting residues in a vitamin binding protein using evolutionary information.利用进化信息预测维生素结合蛋白中的维生素相互作用残基。
BMC Bioinformatics. 2013 Feb 7;14:44. doi: 10.1186/1471-2105-14-44.
7
Prediction of RNA binding sites in a protein using SVM and PSSM profile.使用支持向量机和位置特异性得分矩阵预测蛋白质中的RNA结合位点。
Proteins. 2008 Apr;71(1):189-94. doi: 10.1002/prot.21677.
8
Prediction of nuclear proteins using SVM and HMM models.使用支持向量机和隐马尔可夫模型预测核蛋白。
BMC Bioinformatics. 2009 Jan 19;10:22. doi: 10.1186/1471-2105-10-22.
9
Predicting sub-cellular localization of tRNA synthetases from their primary structures.从一级结构预测 tRNA 合成酶的亚细胞定位。
Amino Acids. 2012 May;42(5):1703-13. doi: 10.1007/s00726-011-0872-8. Epub 2011 Mar 13.
10
GlycoPP: a webserver for prediction of N- and O-glycosites in prokaryotic protein sequences.GlycoPP:一个用于预测原核蛋白序列中 N-和 O-糖基化位点的网络服务器。
PLoS One. 2012;7(7):e40155. doi: 10.1371/journal.pone.0040155. Epub 2012 Jul 9.

引用本文的文献

1
A hybrid approach for predicting transcription factors.一种预测转录因子的混合方法。
Front Bioinform. 2024 Jul 25;4:1425419. doi: 10.3389/fbinf.2024.1425419. eCollection 2024.
2
NeuroPIpred: a tool to predict, design and scan insect neuropeptides.神经肽预测:一种用于预测、设计和扫描昆虫神经肽的工具。
Sci Rep. 2019 Mar 26;9(1):5129. doi: 10.1038/s41598-019-41538-x.
3
Harnessing the evolutionary information on oxygen binding proteins through Support Vector Machines based modules.通过基于支持向量机的模块利用氧结合蛋白的进化信息。

本文引用的文献

1
Identification of conformational B-cell Epitopes in an antigen from its primary sequence.从抗原的一级序列中鉴定其构象性B细胞表位。
Immunome Res. 2010 Oct 20;6:6. doi: 10.1186/1745-7580-6-6.
2
A simple approach for predicting protein-protein interactions.一种预测蛋白质-蛋白质相互作用的简单方法。
Curr Protein Pept Sci. 2010 Nov;11(7):589-600. doi: 10.2174/138920310794109120.
3
PROCARB: A Database of Known and Modelled Carbohydrate-Binding Protein Structures with Sequence-Based Prediction Tools.PROCARB:一个包含已知和建模的碳水化合物结合蛋白结构以及基于序列的预测工具的数据库。
BMC Res Notes. 2018 May 11;11(1):290. doi: 10.1186/s13104-018-3383-9.
4
BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins.BacHbpred:用于预测细菌类血红蛋白蛋白的支持向量机方法。
Adv Bioinformatics. 2016;2016:8150784. doi: 10.1155/2016/8150784. Epub 2016 Feb 29.
5
A web server for analysis, comparison and prediction of protein ligand binding sites.一个用于分析、比较和预测蛋白质配体结合位点的网络服务器。
Biol Direct. 2016 Mar 25;11(1):14. doi: 10.1186/s13062-016-0118-5.
6
Prediction of membrane transport proteins and their substrate specificities using primary sequence information.利用一级序列信息预测膜转运蛋白及其底物特异性。
PLoS One. 2014 Jun 26;9(6):e100278. doi: 10.1371/journal.pone.0100278. eCollection 2014.
7
Community-based network study of protein-carbohydrate interactions in plant lectins using glycan array data.基于社区网络的利用聚糖阵列数据对植物凝集素中蛋白质-碳水化合物相互作用的研究。
PLoS One. 2014 Apr 22;9(4):e95480. doi: 10.1371/journal.pone.0095480. eCollection 2014.
8
Improved method for linear B-cell epitope prediction using antigen's primary sequence.利用抗原一级序列预测线性 B 细胞表位的改良方法。
PLoS One. 2013 May 7;8(5):e62216. doi: 10.1371/journal.pone.0062216. Print 2013.
9
Prediction of vitamin interacting residues in a vitamin binding protein using evolutionary information.利用进化信息预测维生素结合蛋白中的维生素相互作用残基。
BMC Bioinformatics. 2013 Feb 7;14:44. doi: 10.1186/1471-2105-14-44.
10
GlycoPP: a webserver for prediction of N- and O-glycosites in prokaryotic protein sequences.GlycoPP:一个用于预测原核蛋白序列中 N-和 O-糖基化位点的网络服务器。
PLoS One. 2012;7(7):e40155. doi: 10.1371/journal.pone.0040155. Epub 2012 Jul 9.
Adv Bioinformatics. 2010;2010:436036. doi: 10.1155/2010/436036. Epub 2010 Jun 29.
4
Prediction of FAD interacting residues in a protein from its primary sequence using evolutionary information.利用进化信息从蛋白质的一级序列预测 FAD 相互作用残基。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S48. doi: 10.1186/1471-2105-11-S1-S48.
5
Identification of ATP binding residues of a protein from its primary sequence.从蛋白质的一级序列鉴定其 ATP 结合残基。
BMC Bioinformatics. 2009 Dec 19;10:434. doi: 10.1186/1471-2105-10-434.
6
InCa-SiteFinder: a method for structure-based prediction of inositol and carbohydrate binding sites on proteins.InCa-位点发现者:一种基于结构预测蛋白质上肌醇和碳水化合物结合位点的方法。
J Mol Graph Model. 2009 Oct;28(3):297-303. doi: 10.1016/j.jmgm.2009.08.009. Epub 2009 Aug 27.
7
Prediction of protein-glucose binding sites using support vector machines.使用支持向量机预测蛋白质-葡萄糖结合位点。
Proteins. 2009 Oct;77(1):121-32. doi: 10.1002/prot.22424.
8
ESLpred2: improved method for predicting subcellular localization of eukaryotic proteins.ESLpred2:预测真核生物蛋白质亚细胞定位的改进方法。
BMC Bioinformatics. 2008 Nov 28;9:503. doi: 10.1186/1471-2105-9-503.
9
COPid: composition based protein identification.COPid:基于成分的蛋白质鉴定
In Silico Biol. 2008;8(2):121-8.
10
SuperSite: dictionary of metabolite and drug binding sites in proteins.超级位点:蛋白质中代谢物和药物结合位点词典
Nucleic Acids Res. 2009 Jan;37(Database issue):D195-200. doi: 10.1093/nar/gkn618. Epub 2008 Oct 8.