• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NClassG+:一种用于非经典分泌革兰氏阳性细菌蛋白的分类器。

NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins.

机构信息

School of Medicine and Health Sciences, Universidad del Rosario, Carrera 24 No, 63C-69, Bogotá DC, Colombia.

出版信息

BMC Bioinformatics. 2011 Jan 14;12:21. doi: 10.1186/1471-2105-12-21.

DOI:10.1186/1471-2105-12-21
PMID:21235786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3025837/
Abstract

BACKGROUND

Most predictive methods currently available for the identification of protein secretion mechanisms have focused on classically secreted proteins. In fact, only two methods have been reported for predicting non-classically secreted proteins of Gram-positive bacteria. This study describes the implementation of a sequence-based classifier, denoted as NClassG+, for identifying non-classically secreted Gram-positive bacterial proteins.

RESULTS

Several feature-based classifiers were trained using different sequence transformation vectors (frequencies, dipeptides, physicochemical factors and PSSM) and Support Vector Machines (SVMs) with Linear, Polynomial and Gaussian kernel functions. Nested k-fold cross-validation (CV) was applied to select the best models, using the inner CV loop to tune the model parameters and the outer CV group to compute the error. The parameters and Kernel functions and the combinations between all possible feature vectors were optimized using grid search.

CONCLUSIONS

The final model was tested against an independent set not previously seen by the model, obtaining better predictive performance compared to SecretomeP V2.0 and SecretPV2.0 for the identification of non-classically secreted proteins. NClassG+ is freely available on the web at http://www.biolisi.unal.edu.co/web-servers/nclassgpositive/.

摘要

背景

目前可用于识别蛋白质分泌机制的大多数预测方法都集中在经典分泌蛋白上。事实上,只有两种方法被报道用于预测革兰氏阳性菌的非经典分泌蛋白。本研究描述了一种基于序列的分类器(记为 NClassG+)的实现,用于识别非经典分泌的革兰氏阳性细菌蛋白。

结果

使用不同的序列变换向量(频率、二肽、物理化学因子和 PSSM)和支持向量机(SVM),包括线性、多项式和高斯核函数,训练了几种基于特征的分类器。采用嵌套 k 折交叉验证(CV)来选择最佳模型,使用内部 CV 循环调整模型参数,使用外部 CV 组计算误差。使用网格搜索优化参数、核函数以及所有可能特征向量的组合。

结论

该最终模型针对一个独立的数据集进行了测试,该数据集以前未被模型看到,与 SecretomeP V2.0 和 SecretPV2.0 相比,该模型在识别非经典分泌蛋白方面具有更好的预测性能。NClassG+可在 http://www.biolisi.unal.edu.co/web-servers/nclassgpositive/ 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e34/3025837/ee72af43bfe1/1471-2105-12-21-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e34/3025837/06e33f1b6bfd/1471-2105-12-21-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e34/3025837/ee72af43bfe1/1471-2105-12-21-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e34/3025837/06e33f1b6bfd/1471-2105-12-21-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e34/3025837/ee72af43bfe1/1471-2105-12-21-2.jpg

相似文献

1
NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins.NClassG+:一种用于非经典分泌革兰氏阳性细菌蛋白的分类器。
BMC Bioinformatics. 2011 Jan 14;12:21. doi: 10.1186/1471-2105-12-21.
2
SecretP: identifying bacterial secreted proteins by fusing new features into Chou's pseudo-amino acid composition.SecretP:通过将新特征融合到 Chou 的伪氨基酸组成中,来鉴定细菌分泌蛋白。
J Theor Biol. 2010 Nov 7;267(1):1-6. doi: 10.1016/j.jtbi.2010.08.001. Epub 2010 Aug 5.
3
PncsHub: a platform for annotating and analyzing non-classically secreted proteins in Gram-positive bacteria.PncsHub:革兰氏阳性菌中非经典分泌蛋白注释和分析的平台。
Nucleic Acids Res. 2022 Jan 7;50(D1):D848-D857. doi: 10.1093/nar/gkab814.
4
NonClasGP-Pred: robust and efficient prediction of non-classically secreted proteins by integrating subset-specific optimal models of imbalanced data.非经典分泌蛋白预测:通过整合不平衡数据子集特定最优模型实现稳健高效预测
Microb Genom. 2020 Dec;6(12). doi: 10.1099/mgen.0.000483. Epub 2020 Nov 27.
5
Multi-location gram-positive and gram-negative bacterial protein subcellular localization using gene ontology and multi-label classifier ensemble.利用基因本体论和多标签分类器集成进行多地点革兰氏阳性和革兰氏阴性细菌蛋白质亚细胞定位
BMC Bioinformatics. 2015;16 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-16-S12-S1. Epub 2015 Aug 25.
6
Ranking Gene Ontology terms for predicting non-classical secretory proteins in eukaryotes and prokaryotes.对真核生物和原核生物中非经典分泌蛋白进行预测的基因本体论术语排序。
J Theor Biol. 2012 Nov 7;312:105-13. doi: 10.1016/j.jtbi.2012.07.027. Epub 2012 Aug 8.
7
Prediction of lipoprotein signal peptides in Gram-positive bacteria with a Hidden Markov Model.用隐马尔可夫模型预测革兰氏阳性菌中的脂蛋白信号肽。
J Proteome Res. 2008 Dec;7(12):5082-93. doi: 10.1021/pr800162c.
8
Bias in error estimation when using cross-validation for model selection.在使用交叉验证进行模型选择时误差估计中的偏差。
BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.
9
Gpos-mPLoc: a top-down approach to improve the quality of predicting subcellular localization of Gram-positive bacterial proteins.Gpos-mPLoc:一种自上而下的方法,用于提高革兰氏阳性细菌蛋白质亚细胞定位预测的质量。
Protein Pept Lett. 2009;16(12):1478-84. doi: 10.2174/092986609789839322.
10
Identify Gram-negative bacterial secreted protein types by incorporating different modes of PSSM into Chou's general PseAAC via Kullback-Leibler divergence.通过将不同模式的 PSSM 纳入 Chou 的广义 PseAAC 并通过 KL 散度来识别革兰氏阴性细菌分泌蛋白类型。
J Theor Biol. 2018 Oct 7;454:22-29. doi: 10.1016/j.jtbi.2018.05.035. Epub 2018 May 29.

引用本文的文献

1
iNClassSec-ESM: Discovering potential non-classical secreted proteins through a novel protein language model.iNClassSec-ESM:通过一种新型蛋白质语言模型发现潜在的非经典分泌蛋白。
Comput Struct Biotechnol J. 2025 Mar 28;27:1350-1358. doi: 10.1016/j.csbj.2025.03.043. eCollection 2025.
2
ASPIRER: a new computational approach for identifying non-classical secreted proteins based on deep learning.ASPIRER:一种基于深度学习的新计算方法,用于识别非经典分泌蛋白。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac031.
3
A journey through the proteome promotes insights into its functional genome.

本文引用的文献

1
SecretP: identifying bacterial secreted proteins by fusing new features into Chou's pseudo-amino acid composition.SecretP:通过将新特征融合到 Chou 的伪氨基酸组成中,来鉴定细菌分泌蛋白。
J Theor Biol. 2010 Nov 7;267(1):1-6. doi: 10.1016/j.jtbi.2010.08.001. Epub 2010 Aug 5.
2
Computational prediction and experimental assessment of secreted/surface proteins from Mycobacterium tuberculosis H37Rv.结核分枝杆菌 H37Rv 分泌/表面蛋白的计算预测和实验评估。
PLoS Comput Biol. 2010 Jun 24;6(6):e1000824. doi: 10.1371/journal.pcbi.1000824.
3
SecretP: a new method for predicting mammalian secreted proteins.
对蛋白质组的深入研究有助于深入了解其功能基因组。
PeerJ. 2021 Dec 23;9:e12456. doi: 10.7717/peerj.12456. eCollection 2021.
4
PncsHub: a platform for annotating and analyzing non-classically secreted proteins in Gram-positive bacteria.PncsHub:革兰氏阳性菌中非经典分泌蛋白注释和分析的平台。
Nucleic Acids Res. 2022 Jan 7;50(D1):D848-D857. doi: 10.1093/nar/gkab814.
5
Protein Secretion Prediction Tools and Extracellular Vesicles Databases.蛋白质分泌预测工具和细胞外囊泡数据库。
Methods Mol Biol. 2021;2361:213-227. doi: 10.1007/978-1-0716-1641-3_13.
6
NonClasGP-Pred: robust and efficient prediction of non-classically secreted proteins by integrating subset-specific optimal models of imbalanced data.非经典分泌蛋白预测:通过整合不平衡数据子集特定最优模型实现稳健高效预测
Microb Genom. 2020 Dec;6(12). doi: 10.1099/mgen.0.000483. Epub 2020 Nov 27.
7
Prediction of Human Secretory Proteins in Plasma Based on Discrete Firefly Optimization and Application to Cancer Biomarkers Identification.基于离散萤火虫优化算法的人血浆分泌蛋白预测及其在癌症生物标志物识别中的应用
Front Genet. 2019 Jun 6;10:542. doi: 10.3389/fgene.2019.00542. eCollection 2019.
8
High-Throughput Identification of Mammalian Secreted Proteins Using Species-Specific Scheme and Application to Human Proteome.高通量鉴定哺乳动物分泌蛋白的物种特异性方案及其在人类蛋白质组中的应用。
Molecules. 2018 Jun 14;23(6):1448. doi: 10.3390/molecules23061448.
9
Better Than Nothing? Limitations of the Prediction Tool SecretomeP in the Search for Leaderless Secretory Proteins (LSPs) in Plants.聊胜于无?植物中预测工具SecretomeP在寻找无信号肽分泌蛋白(LSPs)方面的局限性
Front Plant Sci. 2016 Sep 27;7:1451. doi: 10.3389/fpls.2016.01451. eCollection 2016.
10
Rule-based knowledge acquisition method for promoter prediction in human and Drosophila species.基于规则的人类和果蝇物种启动子预测知识获取方法。
ScientificWorldJournal. 2014;2014:327306. doi: 10.1155/2014/327306. Epub 2014 Jan 29.
SecretP:一种新的哺乳动物分泌蛋白预测方法。
Peptides. 2010 Apr;31(4):574-8. doi: 10.1016/j.peptides.2009.12.026. Epub 2010 Jan 4.
4
Validating subcellular localization prediction tools with mycobacterial proteins.用分枝杆菌蛋白验证亚细胞定位预测工具。
BMC Bioinformatics. 2009 May 7;10:134. doi: 10.1186/1471-2105-10-134.
5
Evaluation of signal peptide prediction algorithms for identification of mycobacterial signal peptides using sequence data from proteomic methods.利用蛋白质组学方法的序列数据评估用于鉴定分枝杆菌信号肽的信号肽预测算法。
Microbiology (Reading). 2009 Jul;155(Pt 7):2375-2383. doi: 10.1099/mic.0.025270-0. Epub 2009 Apr 23.
6
Secretion and subcellular localizations of bacterial proteins: a semantic awareness issue.细菌蛋白质的分泌与亚细胞定位:一个语义认知问题。
Trends Microbiol. 2009 Apr;17(4):139-45. doi: 10.1016/j.tim.2009.01.004. Epub 2009 Mar 18.
7
Support vector machines and kernels for computational biology.用于计算生物学的支持向量机和核函数。
PLoS Comput Biol. 2008 Oct;4(10):e1000173. doi: 10.1371/journal.pcbi.1000173. Epub 2008 Oct 31.
8
Identification of proteins secreted by malaria parasite into erythrocyte using SVM and PSSM profiles.使用支持向量机和位置特异性得分矩阵概况鉴定疟原虫分泌到红细胞中的蛋白质。
BMC Bioinformatics. 2008 Apr 16;9:201. doi: 10.1186/1471-2105-9-201.
9
LocateP: genome-scale subcellular-location predictor for bacterial proteins.LocateP:用于细菌蛋白质的基因组规模亚细胞定位预测工具。
BMC Bioinformatics. 2008 Mar 27;9:173. doi: 10.1186/1471-2105-9-173.
10
VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens.VirulentPred:一种基于支持向量机的细菌病原体中致病蛋白预测方法。
BMC Bioinformatics. 2008 Jan 28;9:62. doi: 10.1186/1471-2105-9-62.