• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ngLOC:用于预测原核生物和真核生物中蛋白质亚细胞定位的软件和网络服务器。

ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes.

作者信息

King Brian R, Vural Suleyman, Pandey Sanjit, Barteau Alex, Guda Chittibabu

机构信息

Department of Computer Science, Bucknell University, One Dent Drive, Lewisburg, PA 17837, USA.

出版信息

BMC Res Notes. 2012 Jul 10;5:351. doi: 10.1186/1756-0500-5-351.

DOI:10.1186/1756-0500-5-351
PMID:22780965
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3532370/
Abstract

BACKGROUND

Understanding protein subcellular localization is a necessary component toward understanding the overall function of a protein. Numerous computational methods have been published over the past decade, with varying degrees of success. Despite the large number of published methods in this area, only a small fraction of them are available for researchers to use in their own studies. Of those that are available, many are limited by predicting only a small number of organelles in the cell. Additionally, the majority of methods predict only a single location for a sequence, even though it is known that a large fraction of the proteins in eukaryotic species shuttle between locations to carry out their function.

FINDINGS

We present a software package and a web server for predicting the subcellular localization of protein sequences based on the ngLOC method. ngLOC is an n-gram-based Bayesian classifier that predicts subcellular localization of proteins both in prokaryotes and eukaryotes. The overall prediction accuracy varies from 89.8% to 91.4% across species. This program can predict 11 distinct locations each in plant and animal species. ngLOC also predicts 4 and 5 distinct locations on gram-positive and gram-negative bacterial datasets, respectively.

CONCLUSIONS

ngLOC is a generic method that can be trained by data from a variety of species or classes for predicting protein subcellular localization. The standalone software is freely available for academic use under GNU GPL, and the ngLOC web server is also accessible at http://ngloc.unmc.edu.

摘要

背景

了解蛋白质的亚细胞定位是理解蛋白质整体功能的必要组成部分。在过去十年中,已经发表了许多计算方法,其成功程度各不相同。尽管该领域已发表的方法众多,但可供研究人员在自己的研究中使用的却很少。在那些可用的方法中,许多方法仅限于预测细胞中的少数细胞器。此外,大多数方法只为一个序列预测单一位置,尽管已知真核生物中的很大一部分蛋白质会在不同位置之间穿梭以执行其功能。

研究结果

我们展示了一个基于ngLOC方法预测蛋白质序列亚细胞定位的软件包和一个网络服务器。ngLOC是一种基于n元语法的贝叶斯分类器,可预测原核生物和真核生物中蛋白质的亚细胞定位。跨物种的总体预测准确率在89.8%至91.4%之间。该程序可以分别预测植物和动物物种中的11个不同位置。ngLOC还分别在革兰氏阳性和革兰氏阴性细菌数据集上预测4个和5个不同位置。

结论

ngLOC是一种通用方法,可以通过来自各种物种或类别的数据进行训练,以预测蛋白质的亚细胞定位。独立软件在GNU GPL许可下可免费用于学术用途,ngLOC网络服务器也可通过http://ngloc.unmc.edu访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dd7/3532370/19c4d049a937/1756-0500-5-351-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dd7/3532370/19c4d049a937/1756-0500-5-351-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dd7/3532370/19c4d049a937/1756-0500-5-351-1.jpg

相似文献

1
ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes.ngLOC:用于预测原核生物和真核生物中蛋白质亚细胞定位的软件和网络服务器。
BMC Res Notes. 2012 Jul 10;5:351. doi: 10.1186/1756-0500-5-351.
2
ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes.ngLOC:一种基于n元语法的贝叶斯方法,用于估计真核生物的亚细胞蛋白质组。
Genome Biol. 2007;8(5):R68. doi: 10.1186/gb-2007-8-5-r68.
3
pTARGET [corrected] a new method for predicting protein subcellular localization in eukaryotes.pTARGET [已修正] 一种预测真核生物中蛋白质亚细胞定位的新方法。
Bioinformatics. 2005 Nov 1;21(21):3963-9. doi: 10.1093/bioinformatics/bti650. Epub 2005 Sep 6.
4
Protein subcellular localization prediction of eukaryotes using a knowledge-based approach.基于知识的真核生物蛋白质亚细胞定位预测。
BMC Bioinformatics. 2009 Dec 3;10 Suppl 15(Suppl 15):S8. doi: 10.1186/1471-2105-10-S15-S8.
5
Ranking Gene Ontology terms for predicting non-classical secretory proteins in eukaryotes and prokaryotes.对真核生物和原核生物中非经典分泌蛋白进行预测的基因本体论术语排序。
J Theor Biol. 2012 Nov 7;312:105-13. doi: 10.1016/j.jtbi.2012.07.027. Epub 2012 Aug 8.
6
Using AdaBoost for the prediction of subcellular location of prokaryotic and eukaryotic proteins.使用AdaBoost预测原核生物和真核生物蛋白质的亚细胞定位。
Mol Divers. 2008 Feb;12(1):41-5. doi: 10.1007/s11030-008-9073-0. Epub 2008 May 28.
7
Support vector machine approach for protein subcellular localization prediction.用于蛋白质亚细胞定位预测的支持向量机方法
Bioinformatics. 2001 Aug;17(8):721-8. doi: 10.1093/bioinformatics/17.8.721.
8
Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms.Cell-PLoc:用于预测多种生物体中蛋白质亚细胞定位的一组网络服务器程序包。
Nat Protoc. 2008;3(2):153-62. doi: 10.1038/nprot.2007.494.
9
Large-scale predictions of gram-negative bacterial protein subcellular locations.革兰氏阴性细菌蛋白质亚细胞定位的大规模预测。
J Proteome Res. 2006 Dec;5(12):3420-8. doi: 10.1021/pr060404b.
10
A multi-label classifier for predicting the subcellular localization of gram-negative bacterial proteins with both single and multiple sites.一种用于预测具有单一位点和多个位点的革兰氏阴性细菌蛋白亚细胞定位的多标签分类器。
PLoS One. 2011;6(6):e20592. doi: 10.1371/journal.pone.0020592. Epub 2011 Jun 17.

引用本文的文献

1
Screening for genetic variability in photosynthetic regulation provides insights into salt performance traits in forage sorghum under salt stress.筛选光合作用调节中的遗传变异性可深入了解盐胁迫下饲用高粱的耐盐性能特征。
BMC Plant Biol. 2024 Jul 19;24(1):690. doi: 10.1186/s12870-024-05406-9.
2
Computational Strategies in Drug Discovery: Leveraging Subtractive Genomic Analysis for Target Identification.药物发现中的计算策略:利用减法基因组分析进行靶点识别。
Curr Pharm Biotechnol. 2025;26(9):1271-1286. doi: 10.2174/0113892010295441240515074348.
3
A Review for Artificial Intelligence Based Protein Subcellular Localization.

本文引用的文献

1
WegoLoc: accurate prediction of protein subcellular localization using weighted Gene Ontology terms.WegoLoc:使用加权基因本体论术语准确预测蛋白质亚细胞定位
Bioinformatics. 2012 Apr 1;28(7):1028-30. doi: 10.1093/bioinformatics/bts062. Epub 2012 Jan 31.
2
N-gram analysis of 970 microbial organisms reveals presence of biological language models.对 970 种微生物的 N 元组分析揭示了生物语言模型的存在。
BMC Bioinformatics. 2011 Jan 10;12:12. doi: 10.1186/1471-2105-12-12.
3
Prediction of subcellular locations of proteins: where to proceed?
基于人工智能的蛋白质亚细胞定位研究综述
Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.
4
Proteome Exploration of To Identify Novel Therapeutics: a Hierarchical Subtractive Genomics and Reverse Vaccinology Approach.蛋白质组学探索:一种分层消减基因组学和反向疫苗学方法,用于鉴定新型治疗方法。
Microbiol Spectr. 2022 Aug 31;10(4):e0037322. doi: 10.1128/spectrum.00373-22. Epub 2022 Jul 12.
5
Evolutionarily stable gene clusters shed light on the common grounds of pathogenicity in the Acinetobacter calcoaceticus-baumannii complex.进化稳定的基因簇揭示了鲍曼不动杆菌-醋酸钙不动杆菌复合体在致病性方面的共同基础。
PLoS Genet. 2022 Jun 2;18(6):e1010020. doi: 10.1371/journal.pgen.1010020. eCollection 2022 Jun.
6
Membrane Proteins and Proteomics of Cronobacter sakazakii Cells: Reliable Method for Identification and Subcellular Localization.蜡样芽胞杆菌细胞膜蛋白与蛋白组学研究:鉴定和亚细胞定位的可靠方法。
Appl Environ Microbiol. 2022 May 10;88(9):e0250821. doi: 10.1128/aem.02508-21. Epub 2022 Apr 18.
7
Two Piwis with Ago-like functions silence somatic genes at the chromatin level.两个具有 Ago 样功能的 Piwi 蛋白在染色质水平上沉默体细胞基因。
RNA Biol. 2021 Nov 12;18(sup2):757-769. doi: 10.1080/15476286.2021.1991114. Epub 2021 Oct 18.
8
Ensemble of Multiple Classifiers for Multilabel Classification of Plant Protein Subcellular Localization.用于植物蛋白质亚细胞定位多标签分类的多个分类器集成
Life (Basel). 2021 Mar 30;11(4):293. doi: 10.3390/life11040293.
9
Tip60 Phosphorylation at Ser 99 Is Essential for Autophagy Induction in .Tip60 丝氨酸 99 位的磷酸化对于. 中的自噬诱导是必不可少的。
Int J Mol Sci. 2020 Sep 20;21(18):6893. doi: 10.3390/ijms21186893.
10
A Genome-Scale Metabolic Model of CCMP 1335 for a Systems-Level Understanding of Its Metabolism and Biotechnological Potential.用于系统层面理解其代谢和生物技术潜力的CCMP 1335基因组规模代谢模型。
Microorganisms. 2020 Sep 11;8(9):1396. doi: 10.3390/microorganisms8091396.
蛋白质亚细胞定位预测:路在何方?
Proteomics. 2010 Nov;10(22):3970-83. doi: 10.1002/pmic.201000274. Epub 2010 Nov 2.
4
SherLoc2: a high-accuracy hybrid method for predicting subcellular localization of proteins.SherLoc2:一种高精度的蛋白质亚细胞定位混合预测方法。
J Proteome Res. 2009 Nov;8(11):5363-6. doi: 10.1021/pr900665y.
5
Protein subcellular localization prediction using artificial intelligence technology.利用人工智能技术进行蛋白质亚细胞定位预测。
Methods Mol Biol. 2008;484:435-63. doi: 10.1007/978-1-59745-398-1_27.
6
WoLF PSORT: protein localization predictor.WoLF PSORT:蛋白质定位预测工具。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W585-7. doi: 10.1093/nar/gkm259. Epub 2007 May 21.
7
ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes.ngLOC:一种基于n元语法的贝叶斯方法,用于估计真核生物的亚细胞蛋白质组。
Genome Biol. 2007;8(5):R68. doi: 10.1186/gb-2007-8-5-r68.
8
MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition.MultiLoc:利用N端靶向序列、序列基序和氨基酸组成预测蛋白质亚细胞定位
Bioinformatics. 2006 May 15;22(10):1158-65. doi: 10.1093/bioinformatics/btl002. Epub 2006 Jan 20.