• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BaCelLo:一种平衡的亚细胞定位预测器。

BaCelLo: a balanced subcellular localization predictor.

作者信息

Pierleoni Andrea, Martelli Pier Luigi, Fariselli Piero, Casadio Rita

机构信息

Biocomputing Group, Dept. of Biology University of Bologna, via Irnerio 42, 40126 Bologna, Italy.

出版信息

Bioinformatics. 2006 Jul 15;22(14):e408-16. doi: 10.1093/bioinformatics/btl222.

DOI:10.1093/bioinformatics/btl222
PMID:16873501
Abstract

MOTIVATION

The knowledge of the subcellular localization of a protein is fundamental for elucidating its function. It is difficult to determine the subcellular location for eukaryotic cells with experimental high-throughput procedures. Computational procedures are then needed for annotating the subcellular location of proteins in large scale genomic projects.

RESULTS

BaCelLo is a predictor for five classes of subcellular localization (secretory pathway, cytoplasm, nucleus, mitochondrion and chloroplast) and it is based on different SVMs organized in a decision tree. The system exploits the information derived from the residue sequence and from the evolutionary information contained in alignment profiles. It analyzes the whole sequence composition and the compositions of both the N- and C-termini. The training set is curated in order to avoid redundancy. For the first time a balancing procedure is introduced in order to mitigate the effect of biased training sets. Three kingdom-specific predictors are implemented: for animals, plants and fungi, respectively. When distributing the proteins from animals and fungi into four classes, accuracy of BaCelLo reach 74% and 76%, respectively; a score of 67% is obtained when proteins from plants are distributed into five classes. BaCelLo outperforms the other presently available methods for the same task and gives more balanced accuracy and coverage values for each class. We also predict the subcellular localization of five whole proteomes, Homo sapiens, Mus musculus, Caenorhabditis elegans, Saccharomyces cerevisiae and Arabidopsis thaliana, comparing the protein content in each different compartment.

AVAILABILITY

BaCelLo can be accessed at http://www.biocomp.unibo.it/bacello/.

摘要

动机

了解蛋白质的亚细胞定位是阐明其功能的基础。通过实验高通量方法确定真核细胞的亚细胞定位很困难。因此,在大规模基因组计划中需要计算程序来注释蛋白质的亚细胞定位。

结果

BaCelLo是一种用于预测五类亚细胞定位(分泌途径、细胞质、细胞核、线粒体和叶绿体)的预测工具,它基于组织在决策树中的不同支持向量机。该系统利用从残基序列以及比对图谱中包含的进化信息中获得的信息。它分析整个序列组成以及N端和C端的组成。训练集经过精心策划以避免冗余。首次引入了一种平衡程序以减轻有偏差训练集的影响。实现了三种特定于生物界的预测工具:分别用于动物、植物和真菌。当将动物和真菌的蛋白质分为四类时,BaCelLo的准确率分别达到74%和76%;当将植物的蛋白质分为五类时,准确率为67%。BaCelLo在相同任务上优于其他现有方法,并且为每个类别提供了更平衡的准确率和覆盖率值。我们还预测了五个完整蛋白质组(智人、小家鼠、秀丽隐杆线虫、酿酒酵母和拟南芥)的亚细胞定位,比较了每个不同区室中的蛋白质含量。

可用性

可通过http://www.biocomp.unibo.it/bacello/访问BaCelLo。

相似文献

1
BaCelLo: a balanced subcellular localization predictor.BaCelLo:一种平衡的亚细胞定位预测器。
Bioinformatics. 2006 Jul 15;22(14):e408-16. doi: 10.1093/bioinformatics/btl222.
2
Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization.Hum-PLoc:一种用于预测人类蛋白质亚细胞定位的新型集成分类器。
Biochem Biophys Res Commun. 2006 Aug 18;347(1):150-7. doi: 10.1016/j.bbrc.2006.06.059. Epub 2006 Jun 21.
3
Implicit motif distribution based hybrid computational kernel for sequence classification.基于隐式基序分布的混合计算内核用于序列分类。
Bioinformatics. 2005 Apr 15;21(8):1429-36. doi: 10.1093/bioinformatics/bti212. Epub 2004 Dec 14.
4
Prediction of subcellular protein localization based on functional domain composition.基于功能域组成预测亚细胞蛋白质定位
Biochem Biophys Res Commun. 2007 Jun 1;357(2):366-70. doi: 10.1016/j.bbrc.2007.03.139. Epub 2007 Apr 2.
5
MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition.MultiLoc:利用N端靶向序列、序列基序和氨基酸组成预测蛋白质亚细胞定位
Bioinformatics. 2006 May 15;22(10):1158-65. doi: 10.1093/bioinformatics/btl002. Epub 2006 Jan 20.
6
SherLoc: high-accuracy prediction of protein subcellular localization by integrating text and protein sequence data.SherLoc:通过整合文本和蛋白质序列数据对蛋白质亚细胞定位进行高精度预测。
Bioinformatics. 2007 Jun 1;23(11):1410-7. doi: 10.1093/bioinformatics/btm115. Epub 2007 Mar 28.
7
Predicting protein stability changes from sequences using support vector machines.使用支持向量机从序列预测蛋白质稳定性变化。
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii54-8. doi: 10.1093/bioinformatics/bti1109.
8
Prediction of subcellular localization using sequence-biased recurrent networks.使用序列偏向递归网络预测亚细胞定位。
Bioinformatics. 2005 May 15;21(10):2279-86. doi: 10.1093/bioinformatics/bti372. Epub 2005 Mar 3.
9
Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms.Cell-PLoc:用于预测多种生物体中蛋白质亚细胞定位的一组网络服务器程序包。
Nat Protoc. 2008;3(2):153-62. doi: 10.1038/nprot.2007.494.
10
pTARGET [corrected] a new method for predicting protein subcellular localization in eukaryotes.pTARGET [已修正] 一种预测真核生物中蛋白质亚细胞定位的新方法。
Bioinformatics. 2005 Nov 1;21(21):3963-9. doi: 10.1093/bioinformatics/bti650. Epub 2005 Sep 6.

引用本文的文献

1
Genome-Wide Analysis of the Maize LBD Gene Family Reveals a Role for in the Development of Lateral Roots.玉米 LBD 基因家族的全基因组分析揭示了其在侧根发育中的作用。
Plants (Basel). 2025 Aug 21;14(16):2600. doi: 10.3390/plants14162600.
2
Genome-Wide Profiling of bZIP Transcription Factors and FocbZIP11's Impact on Fusarium TR4 Pathogenicity.bZIP转录因子的全基因组分析以及FocbZIP11对尖孢镰刀菌TR4致病性的影响
Int J Mol Sci. 2025 Feb 9;26(4):1452. doi: 10.3390/ijms26041452.
3
Reliability of plastid and mitochondrial localisation prediction declines rapidly with the evolutionary distance to the training set increasing.
质体和线粒体定位预测的可靠性随着与训练集的进化距离的增加而迅速下降。
PLoS Comput Biol. 2024 Nov 11;20(11):e1012575. doi: 10.1371/journal.pcbi.1012575. eCollection 2024 Nov.
4
Comprehensive analysis of transcription factor binding sites and expression profiling of rice pathogenesis related genes ().水稻病程相关基因转录因子结合位点及表达谱的综合分析()。 (注:原文括号部分内容缺失,翻译只能到此程度)
Front Plant Sci. 2024 Oct 25;15:1463147. doi: 10.3389/fpls.2024.1463147. eCollection 2024.
5
Genome-wide characterization of DNA methyltransferase family genes implies GhDMT6 improving tolerance of salt and drought on cotton.DNA甲基转移酶家族基因的全基因组特征表明GhDMT6提高了棉花对盐和干旱的耐受性。
BMC Plant Biol. 2024 Apr 23;24(1):312. doi: 10.1186/s12870-024-04985-x.
6
Genome-Wide Analysis of Gene Family Associated with Stress Responses in Cotton ( spp.).棉花(棉属)中与胁迫反应相关基因家族的全基因组分析。
Curr Issues Mol Biol. 2024 Mar 11;46(3):2278-2300. doi: 10.3390/cimb46030146.
7
Genome-wide analysis of bZIP gene family members in Pleurotus ostreatus, and potential roles of PobZIP3 in development and the heat stress response.平菇bZIP基因家族成员的全基因组分析以及PobZIP3在发育和热应激反应中的潜在作用。
Microb Biotechnol. 2024 Feb;17(2):e14413. doi: 10.1111/1751-7915.14413.
8
UMAMIT44 is a key player in glutamate export from Arabidopsis chloroplasts.UMAMIT44 是拟南芥叶绿体谷氨酸外排的关键因子。
Plant Cell. 2024 Mar 29;36(4):1119-1139. doi: 10.1093/plcell/koad310.
9
Deep Learning for Genomics: From Early Neural Nets to Modern Large Language Models.深度学习在基因组学中的应用:从早期神经网络到现代大型语言模型。
Int J Mol Sci. 2023 Nov 1;24(21):15858. doi: 10.3390/ijms242115858.
10
Genomic Analysis, Evolution and Characterization of E3 Ubiquitin Protein Ligase (TRIM) Gene Family in Common Carp ().基因组分析、进化与鲤鱼 E3 泛素蛋白连接酶(TRIM)基因家族的特征
Genes (Basel). 2023 Mar 7;14(3):667. doi: 10.3390/genes14030667.