• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

mPLR-Loc:一种基于惩罚逻辑回归的自适应决策多标签分类器,用于蛋白质亚细胞定位预测。

mPLR-Loc: an adaptive decision multi-label classifier based on penalized logistic regression for protein subcellular localization prediction.

作者信息

Wan Shibiao, Mak Man-Wai, Kung Sun-Yuan

机构信息

Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong SAR, China.

Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong SAR, China.

出版信息

Anal Biochem. 2015 Mar 15;473:14-27. doi: 10.1016/j.ab.2014.10.014. Epub 2014 Oct 31.

DOI:10.1016/j.ab.2014.10.014
PMID:25449328
Abstract

Proteins located in appropriate cellular compartments are of paramount importance to exert their biological functions. Prediction of protein subcellular localization by computational methods is required in the post-genomic era. Recent studies have been focusing on predicting not only single-location proteins but also multi-location proteins. However, most of the existing predictors are far from effective for tackling the challenges of multi-label proteins. This article proposes an efficient multi-label predictor, namely mPLR-Loc, based on penalized logistic regression and adaptive decisions for predicting both single- and multi-location proteins. Specifically, for each query protein, mPLR-Loc exploits the information from the Gene Ontology (GO) database by using its accession number (AC) or the ACs of its homologs obtained via BLAST. The frequencies of GO occurrences are used to construct feature vectors, which are then classified by an adaptive decision-based multi-label penalized logistic regression classifier. Experimental results based on two recent stringent benchmark datasets (virus and plant) show that mPLR-Loc remarkably outperforms existing state-of-the-art multi-label predictors. In addition to being able to rapidly and accurately predict subcellular localization of single- and multi-label proteins, mPLR-Loc can also provide probabilistic confidence scores for the prediction decisions. For readers' convenience, the mPLR-Loc server is available online (http://bioinfo.eie.polyu.edu.hk/mPLRLocServer).

摘要

位于适当细胞区室的蛋白质对于发挥其生物学功能至关重要。在后基因组时代,需要通过计算方法预测蛋白质的亚细胞定位。最近的研究不仅集中于预测单定位蛋白质,还包括多定位蛋白质。然而,大多数现有的预测器在应对多标签蛋白质的挑战方面远非有效。本文提出了一种基于惩罚逻辑回归和自适应决策的高效多标签预测器,即mPLR-Loc,用于预测单定位和多定位蛋白质。具体而言,对于每个查询蛋白质,mPLR-Loc通过使用其登录号(AC)或通过BLAST获得的其同源物的AC来利用基因本体(GO)数据库中的信息。GO出现的频率用于构建特征向量,然后由基于自适应决策的多标签惩罚逻辑回归分类器进行分类。基于两个最新的严格基准数据集(病毒和植物)的实验结果表明,mPLR-Loc显著优于现有的最先进的多标签预测器。除了能够快速准确地预测单标签和多标签蛋白质的亚细胞定位外,mPLR-Loc还可以为预测决策提供概率置信度得分。为方便读者,mPLR-Loc服务器可在线获取(http://bioinfo.eie.polyu.edu.hk/mPLRLocServer)。

相似文献

1
mPLR-Loc: an adaptive decision multi-label classifier based on penalized logistic regression for protein subcellular localization prediction.mPLR-Loc:一种基于惩罚逻辑回归的自适应决策多标签分类器,用于蛋白质亚细胞定位预测。
Anal Biochem. 2015 Mar 15;473:14-27. doi: 10.1016/j.ab.2014.10.014. Epub 2014 Oct 31.
2
HybridGO-Loc: mining hybrid features on gene ontology for predicting subcellular localization of multi-location proteins.HybridGO-Loc:在基因本体论上挖掘混合特征以预测多定位蛋白质的亚细胞定位。
PLoS One. 2014 Mar 19;9(3):e89545. doi: 10.1371/journal.pone.0089545. eCollection 2014.
3
mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines.mGOASVM:基于基因本体和支持向量机的多标签蛋白质亚细胞定位。
BMC Bioinformatics. 2012 Nov 6;13:290. doi: 10.1186/1471-2105-13-290.
4
Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins.用于预测和解释多标签蛋白质亚细胞定位的稀疏回归
BMC Bioinformatics. 2016 Feb 24;17:97. doi: 10.1186/s12859-016-0940-x.
5
R3P-Loc: a compact multi-label predictor using ridge regression and random projection for protein subcellular localization.R3P-Loc:一种使用岭回归和随机投影进行蛋白质亚细胞定位的紧凑型多标签预测器。
J Theor Biol. 2014 Nov 7;360:34-45. doi: 10.1016/j.jtbi.2014.06.031. Epub 2014 Jul 2.
6
mLASSO-Hum: A LASSO-based interpretable human-protein subcellular localization predictor.mLASSO-Hum:一种基于套索算法的可解释的人类蛋白质亚细胞定位预测器。
J Theor Biol. 2015 Oct 7;382:223-34. doi: 10.1016/j.jtbi.2015.06.042. Epub 2015 Jul 9.
7
Multi-location gram-positive and gram-negative bacterial protein subcellular localization using gene ontology and multi-label classifier ensemble.利用基因本体论和多标签分类器集成进行多地点革兰氏阳性和革兰氏阴性细菌蛋白质亚细胞定位
BMC Bioinformatics. 2015;16 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-16-S12-S1. Epub 2015 Aug 25.
8
Mem-ADSVM: A two-layer multi-label predictor for identifying multi-functional types of membrane proteins.Mem-ADSVM:一种用于识别多功能膜蛋白类型的双层多标签预测器。
J Theor Biol. 2016 Jun 7;398:32-42. doi: 10.1016/j.jtbi.2016.03.013. Epub 2016 Mar 19.
9
Ensemble Linear Neighborhood Propagation for Predicting Subchloroplast Localization of Multi-Location Proteins.用于预测多定位蛋白质亚叶绿体定位的集成线性邻域传播算法
J Proteome Res. 2016 Dec 2;15(12):4755-4762. doi: 10.1021/acs.jproteome.6b00686. Epub 2016 Nov 3.
10
pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC.pLoc-mPlant:通过将最优的基因本体(GO)信息整合到通用的伪氨基酸组成(PseAAC)中,预测多定位植物蛋白的亚细胞定位
Mol Biosyst. 2017 Aug 22;13(9):1722-1727. doi: 10.1039/c7mb00267j.

引用本文的文献

1
A Comprehensive Review on RNA Subcellular Localization Prediction.RNA亚细胞定位预测综述
ArXiv. 2025 Apr 24:arXiv:2504.17162v1.
2
Protein subcellular localization prediction tools.蛋白质亚细胞定位预测工具。
Comput Struct Biotechnol J. 2024 Apr 15;23:1796-1807. doi: 10.1016/j.csbj.2024.04.032. eCollection 2024 Dec.
3
A Review for Artificial Intelligence Based Protein Subcellular Localization.基于人工智能的蛋白质亚细胞定位研究综述
Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.
4
A novel deep learning-assisted hybrid network for plasmodium falciparum parasite mitochondrial proteins classification.一种新型深度学习辅助混合网络用于疟原虫寄生虫线粒体蛋白分类。
PLoS One. 2022 Oct 6;17(10):e0275195. doi: 10.1371/journal.pone.0275195. eCollection 2022.
5
BERT-m7G: A Transformer Architecture Based on BERT and Stacking Ensemble to Identify RNA N7-Methylguanosine Sites from Sequence Information.BERT-m7G:一种基于 BERT 和堆叠集成的转换器架构,用于从序列信息中识别 RNA N7-甲基鸟苷位点。
Comput Math Methods Med. 2021 Aug 25;2021:7764764. doi: 10.1155/2021/7764764. eCollection 2021.
6
HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network.HumDLoc:使用深度神经网络进行人类蛋白质亚细胞定位预测
Curr Genomics. 2020 Nov;21(7):546-557. doi: 10.2174/1389202921999200528160534.
7
Recognition of Mitochondrial Proteins in Plasmodium Based on the Tripeptide Composition.基于三肽组成对疟原虫线粒体蛋白质的识别
Front Cell Dev Biol. 2020 Sep 16;8:578901. doi: 10.3389/fcell.2020.578901. eCollection 2020.
8
Metabolic pathway inference using multi-label classification with rich pathway features.使用具有丰富途径特征的多标签分类进行代谢途径推断。
PLoS Comput Biol. 2020 Oct 1;16(10):e1008174. doi: 10.1371/journal.pcbi.1008174. eCollection 2020 Oct.
9
Use of Chou's 5-steps rule to predict the subcellular localization of gram-negative and gram-positive bacterial proteins by multi-label learning based on gene ontology annotation and profile alignment.利用 Chou 的 5 步规则,通过基于基因本体论注释和序列比对的多标签学习,预测革兰氏阴性和革兰氏阳性细菌蛋白质的亚细胞定位。
J Integr Bioinform. 2020 Jun 29;18(1):51-79. doi: 10.1515/jib-2019-0091.
10
Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA.基于进化信息和 LDA 的两种新特征提取方法对凋亡蛋白的亚细胞定位预测
BMC Bioinformatics. 2020 May 24;21(1):212. doi: 10.1186/s12859-020-3539-1.