• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于支持向量机的方法,利用进化信息和基序预测分枝杆菌蛋白质的亚细胞定位

Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs.

作者信息

Rashid Mamoon, Saha Sudipto, Raghava Gajendra Ps

机构信息

Bioinformatics Centre, Institute of Microbial Technology, Sector-39A, Chandigarh, India.

出版信息

BMC Bioinformatics. 2007 Sep 13;8:337. doi: 10.1186/1471-2105-8-337.

DOI:10.1186/1471-2105-8-337
PMID:17854501
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2147037/
Abstract

BACKGROUND

In past number of methods have been developed for predicting subcellular location of eukaryotic, prokaryotic (Gram-negative and Gram-positive bacteria) and human proteins but no method has been developed for mycobacterial proteins which may represent repertoire of potent immunogens of this dreaded pathogen. In this study, attempt has been made to develop method for predicting subcellular location of mycobacterial proteins.

RESULTS

The models were trained and tested on 852 mycobacterial proteins and evaluated using five-fold cross-validation technique. First SVM (Support Vector Machine) model was developed using amino acid composition and overall accuracy of 82.51% was achieved with average accuracy (mean of class-wise accuracy) of 68.47%. In order to utilize evolutionary information, a SVM model was developed using PSSM (Position-Specific Scoring Matrix) profiles obtained from PSI-BLAST (Position-Specific Iterated BLAST) and overall accuracy achieved was of 86.62% with average accuracy of 73.71%. In addition, HMM (Hidden Markov Model), MEME/MAST (Multiple Em for Motif Elicitation/Motif Alignment and Search Tool) and hybrid model that combined two or more models were also developed. We achieved maximum overall accuracy of 86.8% with average accuracy of 89.00% using combination of PSSM based SVM model and MEME/MAST. Performance of our method was compared with that of the existing methods developed for predicting subcellular locations of Gram-positive bacterial proteins.

CONCLUSION

A highly accurate method has been developed for predicting subcellular location of mycobacterial proteins. This method also predicts very important class of proteins that is membrane-attached proteins. This method will be useful in annotating newly sequenced or hypothetical mycobacterial proteins. Based on above study, a freely accessible web server TBpred http://www.imtech.res.in/raghava/tbpred/ has been developed.

摘要

背景

过去已经开发了多种方法来预测真核生物、原核生物(革兰氏阴性和革兰氏阳性细菌)以及人类蛋白质的亚细胞定位,但尚未开发出针对分枝杆菌蛋白质的预测方法,而分枝杆菌蛋白质可能代表这种可怕病原体的有效免疫原库。在本研究中,尝试开发预测分枝杆菌蛋白质亚细胞定位的方法。

结果

在852种分枝杆菌蛋白质上对模型进行训练和测试,并使用五折交叉验证技术进行评估。首先使用氨基酸组成开发了支持向量机(SVM)模型,总体准确率达到82.51%,平均准确率(类准确率的平均值)为68.47%。为了利用进化信息,使用从位置特异性迭代比对工具(PSI-BLAST)获得的位置特异性得分矩阵(PSSM)谱开发了一个SVM模型,总体准确率达到86.62%,平均准确率为73.71%。此外,还开发了隐马尔可夫模型(HMM)、多模体引出/模体比对与搜索工具(MEME/MAST)以及结合两个或更多模型的混合模型。使用基于PSSM的SVM模型和MEME/MAST的组合,我们实现了86.8%的最大总体准确率和89.00%的平均准确率。将我们方法的性能与为预测革兰氏阳性细菌蛋白质亚细胞定位而开发的现有方法进行了比较。

结论

已开发出一种高度准确的方法来预测分枝杆菌蛋白质的亚细胞定位。该方法还能预测非常重要的一类蛋白质,即膜附着蛋白。此方法将有助于注释新测序的或假设的分枝杆菌蛋白质。基于上述研究,已开发了一个可免费访问的网络服务器TBpred http://www.imtech.res.in/raghava/tbpred/ 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1807/2147037/beabd383c073/1471-2105-8-337-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1807/2147037/beabd383c073/1471-2105-8-337-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1807/2147037/beabd383c073/1471-2105-8-337-1.jpg

相似文献

1
Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs.基于支持向量机的方法,利用进化信息和基序预测分枝杆菌蛋白质的亚细胞定位
BMC Bioinformatics. 2007 Sep 13;8:337. doi: 10.1186/1471-2105-8-337.
2
PSLpred: prediction of subcellular localization of bacterial proteins.PSLpred:细菌蛋白质亚细胞定位预测
Bioinformatics. 2005 May 15;21(10):2522-4. doi: 10.1093/bioinformatics/bti309. Epub 2005 Feb 4.
3
BTXpred: prediction of bacterial toxins.BTXpred:细菌毒素预测
In Silico Biol. 2007;7(4-5):405-12.
4
Prediction of nuclear proteins using SVM and HMM models.使用支持向量机和隐马尔可夫模型预测核蛋白。
BMC Bioinformatics. 2009 Jan 19;10:22. doi: 10.1186/1471-2105-10-22.
5
ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST.ESLpred:基于支持向量机的方法,利用二肽组成和PSI-BLAST对真核蛋白质进行亚细胞定位。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W414-9. doi: 10.1093/nar/gkh350.
6
Identification of DNA-binding proteins using support vector machines and evolutionary profiles.利用支持向量机和进化谱鉴定DNA结合蛋白。
BMC Bioinformatics. 2007 Nov 27;8:463. doi: 10.1186/1471-2105-8-463.
7
Prediction of subcellular location of mycobacterial protein using feature selection techniques.利用特征选择技术预测分枝杆菌蛋白的亚细胞定位。
Mol Divers. 2010 Nov;14(4):667-71. doi: 10.1007/s11030-009-9205-1. Epub 2009 Nov 12.
8
PairProSVM: protein subcellular localization based on local pairwise profile alignment and SVM.PairProSVM:基于局部两两轮廓比对和支持向量机的蛋白质亚细胞定位
IEEE/ACM Trans Comput Biol Bioinform. 2008 Jul-Sep;5(3):416-22. doi: 10.1109/TCBB.2007.70256.
9
Use of Chou's 5-steps rule to predict the subcellular localization of gram-negative and gram-positive bacterial proteins by multi-label learning based on gene ontology annotation and profile alignment.利用 Chou 的 5 步规则,通过基于基因本体论注释和序列比对的多标签学习,预测革兰氏阴性和革兰氏阳性细菌蛋白质的亚细胞定位。
J Integr Bioinform. 2020 Jun 29;18(1):51-79. doi: 10.1515/jib-2019-0091.
10
Identification of proteins secreted by malaria parasite into erythrocyte using SVM and PSSM profiles.使用支持向量机和位置特异性得分矩阵概况鉴定疟原虫分泌到红细胞中的蛋白质。
BMC Bioinformatics. 2008 Apr 16;9:201. doi: 10.1186/1471-2105-9-201.

引用本文的文献

1
DeePhafier: a phage lifestyle classifier using a multilayer self-attention neural network combining protein information.DeePhafier:一种使用结合蛋白质信息的多层自注意力神经网络的噬菌体生活方式分类器。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae377.
2
MERITS: a web-based integrated PE/PPE protein database.优点:一个基于网络的整合型PE/PPE蛋白数据库。
Bioinform Adv. 2024 Mar 2;4(1):vbae035. doi: 10.1093/bioadv/vbae035. eCollection 2024.
3
Mechanistic Insight into the Enzymatic Inhibition of β-Amyrin against Mycobacterial Rv1636: In Silico and In Vitro Approaches.

本文引用的文献

1
BTXpred: prediction of bacterial toxins.BTXpred:细菌毒素预测
In Silico Biol. 2007;7(4-5):405-12.
2
Prediction of neurotoxins based on their function and source.基于神经毒素的功能和来源对其进行预测。
In Silico Biol. 2007;7(4-5):369-87.
3
Analysis and prediction of antibacterial peptides.抗菌肽的分析与预测
羽扇豆醇对结核分枝杆菌Rv1636酶抑制作用的机制洞察:计算机模拟和体外实验方法
Biology (Basel). 2022 Aug 12;11(8):1214. doi: 10.3390/biology11081214.
4
Potential Efficacy of -Amyrin Targeting Mycobacterial Universal Stress Protein by In Vitro and In Silico Approach.体外和计算机模拟方法靶向分枝杆菌普遍应激蛋白的 - 香树脂醇的潜在功效。
Molecules. 2022 Jul 18;27(14):4581. doi: 10.3390/molecules27144581.
5
An Update on "Reverse Vaccinology": The Pathway from Genomes and Epitope Predictions to Tailored, Recombinant Vaccines.“反向疫苗学”最新进展:从基因组和表位预测到定制的重组疫苗。
Methods Mol Biol. 2022;2412:45-71. doi: 10.1007/978-1-0716-1892-9_4.
6
Identification of genotypic variants and its proteomic mutations of Brazilian SARS-CoV-2 isolates.巴西 SARS-CoV-2 分离株的基因型变异及其蛋白质突变体的鉴定。
Virus Res. 2022 Jan 2;307:198618. doi: 10.1016/j.virusres.2021.198618. Epub 2021 Nov 3.
7
Recognition of Mitochondrial Proteins in Plasmodium Based on the Tripeptide Composition.基于三肽组成对疟原虫线粒体蛋白质的识别
Front Cell Dev Biol. 2020 Sep 16;8:578901. doi: 10.3389/fcell.2020.578901. eCollection 2020.
8
Dataset of complete genome assembly and analysis of mycobacterium tuberculosis strain SIT745/EAI1-MYS.结核分枝杆菌菌株SIT745/EAI1-MYS的全基因组组装与分析数据集。
Data Brief. 2020 Jun 30;31:105949. doi: 10.1016/j.dib.2020.105949. eCollection 2020 Aug.
9
Specific Binding Peptides from Rv3632: A Strategy for Blocking Entry to Target Cells?来自 Rv3632 的特异性结合肽:阻止进入靶细胞的策略?
Biomed Res Int. 2019 Apr 14;2019:8680935. doi: 10.1155/2019/8680935. eCollection 2019.
10
Integrated proteomics, genomics, metabolomics approaches reveal oxalic acid as pathogenicity factor in Tilletia indica inciting Karnal bunt disease of wheat.综合蛋白质组学、基因组学和代谢组学方法揭示,草酸是引起小麦印度腥黑穗病菌致病的因素。
Sci Rep. 2018 May 18;8(1):7826. doi: 10.1038/s41598-018-26257-z.
BMC Bioinformatics. 2007 Jul 23;8:263. doi: 10.1186/1471-2105-8-263.
4
MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.MemType-2L:一个通过伪位置特异性得分矩阵整合进化信息来预测膜蛋白及其类型的网络服务器。
Biochem Biophys Res Commun. 2007 Aug 24;360(2):339-45. doi: 10.1016/j.bbrc.2007.06.027. Epub 2007 Jun 15.
5
A hybrid approach for predicting promiscuous MHC class I restricted T cell epitopes.一种预测混杂的主要组织相容性复合体I类限制性T细胞表位的混合方法。
J Biosci. 2007 Jan;32(1):31-42. doi: 10.1007/s12038-007-0004-5.
6
Euk-mPLoc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites.Euk-mPLoc:一种通过整合多个位点进行大规模真核生物蛋白质亚细胞定位预测的融合分类器。
J Proteome Res. 2007 May;6(5):1728-34. doi: 10.1021/pr060635i. Epub 2007 Mar 31.
7
Hum-mPLoc: an ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites.Hum-mPLoc:一种通过纳入具有多个位点的样本进行大规模人类蛋白质亚细胞定位预测的集成分类器。
Biochem Biophys Res Commun. 2007 Apr 20;355(4):1006-11. doi: 10.1016/j.bbrc.2007.02.071. Epub 2007 Feb 23.
8
Gpos-PLoc: an ensemble classifier for predicting subcellular localization of Gram-positive bacterial proteins.Gpos-PLoc:一种用于预测革兰氏阳性细菌蛋白质亚细胞定位的集成分类器。
Protein Eng Des Sel. 2007 Jan;20(1):39-46. doi: 10.1093/protein/gzl053. Epub 2007 Jan 23.
9
Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction.Euk-PLoc:一种用于大规模真核生物蛋白质亚细胞定位预测的集成分类器。
Amino Acids. 2007 Jul;33(1):57-67. doi: 10.1007/s00726-006-0478-8. Epub 2007 Jan 19.
10
Large-scale predictions of gram-negative bacterial protein subcellular locations.革兰氏阴性细菌蛋白质亚细胞定位的大规模预测。
J Proteome Res. 2006 Dec;5(12):3420-8. doi: 10.1021/pr060404b.