LocateP：用于细菌蛋白质的基因组规模亚细胞定位预测工具。

LocateP: genome-scale subcellular-location predictor for bacterial proteins.

作者信息

Zhou Miaomiao, Boekhorst Jos, Francke Christof, Siezen Roland J

机构信息

Centre for Molecular and Biomolecular Informatics, Radboud University Nijmegen Medical Centre, PO Box 9101, 6500 HB Nijmegen, The Netherlands.

出版信息

BMC Bioinformatics. 2008 Mar 27;9:173. doi: 10.1186/1471-2105-9-173.

DOI:10.1186/1471-2105-9-173

PMID:18371216

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2375117/

Abstract

BACKGROUND

In the past decades, various protein subcellular-location (SCL) predictors have been developed. Most of these predictors, like TMHMM 2.0, SignalP 3.0, PrediSi and Phobius, aim at the identification of one or a few SCLs, whereas others such as CELLO and Psortb.v.2.0 aim at a broader classification. Although these tools and pipelines can achieve a high precision in the accurate prediction of signal peptides and transmembrane helices, they have a much lower accuracy when other sequence characteristics are concerned. For instance, it proved notoriously difficult to identify the fate of proteins carrying a putative type I signal peptidase (SPIase) cleavage site, as many of those proteins are retained in the cell membrane as N-terminally anchored membrane proteins. Moreover, most of the SCL classifiers are based on the classification of the Swiss-Prot database and consequently inherited the inconsistency of that SCL classification. As accurate and detailed SCL prediction on a genome scale is highly desired by experimental researchers, we decided to construct a new SCL prediction pipeline: LocateP.

RESULTS

LocateP combines many of the existing high-precision SCL identifiers with our own newly developed identifiers for specific SCLs. The LocateP pipeline was designed such that it mimics protein targeting and secretion processes. It distinguishes 7 different SCLs within Gram-positive bacteria: intracellular, multi-transmembrane, N-terminally membrane anchored, C-terminally membrane anchored, lipid-anchored, LPxTG-type cell-wall anchored, and secreted/released proteins. Moreover, it distinguishes pathways for Sec- or Tat-dependent secretion and alternative secretion of bacteriocin-like proteins. The pipeline was tested on data sets extracted from literature, including experimental proteomics studies. The tests showed that LocateP performs as well as, or even slightly better than other SCL predictors for some locations and outperforms current tools especially where the N-terminally anchored and the SPIase-cleaved secreted proteins are concerned. Overall, the accuracy of LocateP was always higher than 90%. LocateP was then used to predict the SCLs of all proteins encoded by completed Gram-positive bacterial genomes. The results are stored in the database LocateP-DB http://www.cmbi.ru.nl/locatep-db1.

CONCLUSION

LocateP is by far the most accurate and detailed protein SCL predictor for Gram-positive bacteria currently available.

摘要

背景

在过去几十年中，已经开发了各种蛋白质亚细胞定位（SCL）预测工具。这些预测工具中的大多数，如TMHMM 2.0、SignalP 3.0、PrediSi和Phobius，旨在识别一种或几种SCL，而其他工具，如CELLO和Psortb.v.2.0，则旨在进行更广泛的分类。尽管这些工具和流程在准确预测信号肽和跨膜螺旋方面可以达到很高的精度，但在涉及其他序列特征时，它们的准确性要低得多。例如，事实证明，识别携带假定的I型信号肽酶（SPIase）切割位点的蛋白质的命运非常困难，因为许多此类蛋白质作为N端锚定膜蛋白保留在细胞膜中。此外，大多数SCL分类器基于Swiss-Prot数据库的分类，因此继承了该SCL分类的不一致性。由于实验研究人员非常希望在基因组规模上进行准确而详细的SCL预测，我们决定构建一个新的SCL预测流程：LocateP。

结果

LocateP将许多现有的高精度SCL识别器与我们自己新开发的特定SCL识别器结合在一起。LocateP流程的设计模仿了蛋白质靶向和分泌过程。它区分革兰氏阳性菌中的7种不同SCL：细胞内、多跨膜、N端膜锚定、C端膜锚定、脂锚定、LPxTG型细胞壁锚定和分泌/释放蛋白。此外，它区分Sec或Tat依赖性分泌途径以及类细菌素蛋白的替代分泌途径。该流程在从文献中提取的数据集上进行了测试，包括实验蛋白质组学研究。测试表明，对于某些定位，LocateP的表现与其他SCL预测器相当，甚至略好，尤其在涉及N端锚定和SPIase切割的分泌蛋白方面，其性能优于当前工具。总体而言，LocateP的准确率始终高于90%。然后，LocateP被用于预测已完成的革兰氏阳性菌基因组编码的所有蛋白质的SCL。结果存储在数据库LocateP-DB http://www.cmbi.ru.nl/locatep-db1中。

结论

LocateP是目前可用的用于革兰氏阳性菌的最准确、最详细的蛋白质SCL预测器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8272/2375117/6c6d6afa15ef/1471-2105-9-173-1.jpg

相似文献

LocateP: genome-scale subcellular-location predictor for bacterial proteins.

BMC Bioinformatics. 2008 Mar 27;9:173. doi: 10.1186/1471-2105-9-173.

PSORTb v.2.0: expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis.

Bioinformatics. 2005 Mar 1;21(5):617-23. doi: 10.1093/bioinformatics/bti057. Epub 2004 Oct 22.

Use of Chou's 5-steps rule to predict the subcellular localization of gram-negative and gram-positive bacterial proteins by multi-label learning based on gene ontology annotation and profile alignment.

J Integr Bioinform. 2020 Jun 29;18(1):51-79. doi: 10.1515/jib-2019-0091.

Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization.

Biochem Biophys Res Commun. 2006 Aug 18;347(1):150-7. doi: 10.1016/j.bbrc.2006.06.059. Epub 2006 Jun 21.

Subcellular localization of extracytoplasmic proteins in monoderm bacteria: rational secretomics-based strategy for genomic and proteomic analyses.

PLoS One. 2012;7(8):e42982. doi: 10.1371/journal.pone.0042982. Epub 2012 Aug 9.

Large-scale predictions of gram-negative bacterial protein subcellular locations.

J Proteome Res. 2006 Dec;5(12):3420-8. doi: 10.1021/pr060404b.

Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs.

Bioinformatics. 2003 Sep 1;19(13):1656-63. doi: 10.1093/bioinformatics/btg222.

Protein subcellular localization prediction based on compartment-specific features and structure conservation.

BMC Bioinformatics. 2007 Sep 8;8:330. doi: 10.1186/1471-2105-8-330.

SCL-Epred: a generalised de novo eukaryotic protein subcellular localisation predictor.

Amino Acids. 2013 Aug;45(2):291-9. doi: 10.1007/s00726-013-1491-3. Epub 2013 Apr 9.

pTARGET [corrected] a new method for predicting protein subcellular localization in eukaryotes.

Bioinformatics. 2005 Nov 1;21(21):3963-9. doi: 10.1093/bioinformatics/bti650. Epub 2005 Sep 6.

引用本文的文献

Glycogen-Degrading Activities of Catalytic Domains of α-Amylase and α-Amylase-Pullulanase Enzymes Conserved in spp. from the Vaginal Microbiome.

J Bacteriol. 2023 Feb 22;205(2):e0039322. doi: 10.1128/jb.00393-22. Epub 2023 Feb 6.

Evolutionary Relationships and Divergence of Gene Family Involved in Development and Stress in Cotton ( L.).

Genes (Basel). 2022 Dec 8;13(12):2313. doi: 10.3390/genes13122313.

Enterococcus faecalis V583 cell membrane protein expression to alkaline stress.

FEMS Microbiol Lett. 2022 Sep 20;369(1). doi: 10.1093/femsle/fnac082.

Genome-Scale Mining of Novel Anchor Proteins of .

Front Microbiol. 2022 Feb 4;12:677702. doi: 10.3389/fmicb.2021.677702. eCollection 2021.

The Membrane Proteome of Spores and Vegetative Cells of the Food-Borne Pathogen .

Int J Mol Sci. 2021 Nov 19;22(22):12475. doi: 10.3390/ijms222212475.

Approaching In Vivo Models of Pneumococcus-Host Interaction: Insights into Surface Proteins, Capsule Production, and Extracellular Vesicles.

Pathogens. 2021 Aug 28;10(9):1098. doi: 10.3390/pathogens10091098.

Exploring the Bile Stress Response of LM1 through Exoproteome Analysis.

Molecules. 2021 Sep 20;26(18):5695. doi: 10.3390/molecules26185695.

Three Distinct Proteases Are Responsible for Overall Cell Surface Proteolysis in Streptococcus thermophilus.

Appl Environ Microbiol. 2021 Nov 10;87(23):e0129221. doi: 10.1128/AEM.01292-21. Epub 2021 Sep 22.

Within-Host Adaptation of in a Bovine Mastitis Infection Is Associated with Increased Cytotoxicity.

Int J Mol Sci. 2021 Aug 17;22(16):8840. doi: 10.3390/ijms22168840.

Comparative Exoproteome Analysis of Human Isolates.

Microorganisms. 2021 Jun 12;9(6):1287. doi: 10.3390/microorganisms9061287.

本文引用的文献

Signal-3L: A 3-layer approach for predicting signal peptides.

Biochem Biophys Res Commun. 2007 Nov 16;363(2):297-303. doi: 10.1016/j.bbrc.2007.08.140. Epub 2007 Aug 31.

Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs.

BMC Bioinformatics. 2007 Sep 13;8:337. doi: 10.1186/1471-2105-8-337.

Multiple interactions among the competence proteins of Bacillus subtilis.

Mol Microbiol. 2007 Jul;65(2):454-64. doi: 10.1111/j.1365-2958.2007.05799.x.

Combining algorithms to predict bacterial protein sub-cellular location: Parallel versus concurrent implementations.

Bioinformation. 2006 Dec 6;1(8):285-9. doi: 10.6026/97320630001285.

Toward bacterial protein sub-cellular location prediction: single-class discrimminant models for all gram- and gram+ compartments.

Bioinformation. 2006 Dec 2;1(8):276-80. doi: 10.6026/97320630001276.

TATPred: a Bayesian method for the identification of twin arginine translocation pathway signal sequences.

Bioinformation. 2006 Jul 25;1(5):184-7. doi: 10.6026/97320630001184.

MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.

Biochem Biophys Res Commun. 2007 Aug 24;360(2):339-45. doi: 10.1016/j.bbrc.2007.06.027. Epub 2007 Jun 15.

Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information.

BMC Bioinformatics. 2007 Jun 14;8:201. doi: 10.1186/1471-2105-8-201.

Locating proteins in the cell using TargetP, SignalP and related tools.

Nat Protoc. 2007;2(4):953-71. doi: 10.1038/nprot.2007.131.

Signal-CF: a subsite-coupled and window-fusing approach for predicting signal peptides.

Biochem Biophys Res Commun. 2007 Jun 8;357(3):633-40. doi: 10.1016/j.bbrc.2007.03.162. Epub 2007 Apr 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

LocateP：用于细菌蛋白质的基因组规模亚细胞定位预测工具。

LocateP: genome-scale subcellular-location predictor for bacterial proteins.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献