• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于具有单一位点和多个位点的人类蛋白质亚细胞定位预测的不平衡多模态多标签学习

Imbalanced multi-modal multi-label learning for subcellular localization prediction of human proteins with both single and multiple sites.

作者信息

He Jianjun, Gu Hong, Liu Wenqi

机构信息

School of Control Science and Engineering, Dalian University of Technology, Dalian, Liaoning, China.

出版信息

PLoS One. 2012;7(6):e37155. doi: 10.1371/journal.pone.0037155. Epub 2012 Jun 8.

DOI:10.1371/journal.pone.0037155
PMID:22715364
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3371015/
Abstract

It is well known that an important step toward understanding the functions of a protein is to determine its subcellular location. Although numerous prediction algorithms have been developed, most of them typically focused on the proteins with only one location. In recent years, researchers have begun to pay attention to the subcellular localization prediction of the proteins with multiple sites. However, almost all the existing approaches have failed to take into account the correlations among the locations caused by the proteins with multiple sites, which may be the important information for improving the prediction accuracy of the proteins with multiple sites. In this paper, a new algorithm which can effectively exploit the correlations among the locations is proposed by using gaussian process model. Besides, the algorithm also can realize optimal linear combination of various feature extraction technologies and could be robust to the imbalanced data set. Experimental results on a human protein data set show that the proposed algorithm is valid and can achieve better performance than the existing approaches.

摘要

众所周知,理解蛋白质功能的一个重要步骤是确定其亚细胞定位。尽管已经开发了许多预测算法,但它们大多通常只关注单一位置的蛋白质。近年来,研究人员开始关注多位点蛋白质的亚细胞定位预测。然而,几乎所有现有的方法都未能考虑多位点蛋白质所导致的不同定位之间的相关性,而这些相关性可能是提高多位点蛋白质预测准确性的重要信息。本文提出了一种利用高斯过程模型有效利用这些定位相关性的新算法。此外,该算法还能实现各种特征提取技术的最优线性组合,并且对不平衡数据集具有鲁棒性。在一个人类蛋白质数据集上的实验结果表明,所提出的算法是有效的,并且能够比现有方法取得更好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ad4/3371015/588e90f4492b/pone.0037155.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ad4/3371015/017add877384/pone.0037155.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ad4/3371015/588e90f4492b/pone.0037155.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ad4/3371015/017add877384/pone.0037155.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ad4/3371015/588e90f4492b/pone.0037155.g002.jpg

相似文献

1
Imbalanced multi-modal multi-label learning for subcellular localization prediction of human proteins with both single and multiple sites.用于具有单一位点和多个位点的人类蛋白质亚细胞定位预测的不平衡多模态多标签学习
PLoS One. 2012;7(6):e37155. doi: 10.1371/journal.pone.0037155. Epub 2012 Jun 8.
2
Multilabel learning for protein subcellular location prediction.多标签学习在蛋白质亚细胞定位预测中的应用。
IEEE Trans Nanobioscience. 2012 Sep;11(3):237-43. doi: 10.1109/TNB.2012.2212249.
3
Enhancing membrane protein subcellular localization prediction by parallel fusion of multi-view features.通过多视角特征的并行融合提高膜蛋白亚细胞定位预测。
IEEE Trans Nanobioscience. 2012 Dec;11(4):375-85. doi: 10.1109/TNB.2012.2208473. Epub 2012 Aug 3.
4
MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy.MIC_Locator:一种新颖的基于图像的蛋白质亚细胞位置多标签预测模型,基于多尺度单基因信号表示和强度编码策略。
BMC Bioinformatics. 2019 Oct 26;20(1):522. doi: 10.1186/s12859-019-3136-3.
5
Multilabel learning via random label selection for protein subcellular multilocations prediction.通过随机标签选择进行多标签学习,用于预测蛋白质亚细胞多重位置。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):436-46. doi: 10.1109/TCBB.2013.21.
6
Multi-scale deep learning for the imbalanced multi-label protein subcellular localization prediction based on immunohistochemistry images.基于免疫组化图像的不平衡多标签蛋白质亚细胞定位预测的多尺度深度学习。
Bioinformatics. 2022 Apr 28;38(9):2602-2611. doi: 10.1093/bioinformatics/btac123.
7
HPSLPred: An Ensemble Multi-Label Classifier for Human Protein Subcellular Location Prediction with Imbalanced Source.HPSLPred:一种用于人类蛋白质亚细胞定位预测的集成多标签分类器,源数据不均衡。
Proteomics. 2017 Sep;17(17-18). doi: 10.1002/pmic.201700262.
8
Protein subcellular localization prediction using multiple kernel learning based support vector machine.基于多核学习支持向量机的蛋白质亚细胞定位预测
Mol Biosyst. 2017 Mar 28;13(4):785-795. doi: 10.1039/c6mb00860g.
9
Plant-mSubP: a computational framework for the prediction of single- and multi-target protein subcellular localization using integrated machine-learning approaches.植物微小肽:一种使用集成机器学习方法预测单靶点和多靶点蛋白质亚细胞定位的计算框架。
AoB Plants. 2019 Oct 17;12(3):plz068. doi: 10.1093/aobpla/plz068. eCollection 2020 Jun.
10
Hum-mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features.Hum-mPLoc 3.0:通过对基因本体和功能域特征的隐藏相关性进行建模来增强人类蛋白质亚细胞定位预测
Bioinformatics. 2017 Mar 15;33(6):843-853. doi: 10.1093/bioinformatics/btw723.

引用本文的文献

1
Protein subcellular localization prediction tools.蛋白质亚细胞定位预测工具。
Comput Struct Biotechnol J. 2024 Apr 15;23:1796-1807. doi: 10.1016/j.csbj.2024.04.032. eCollection 2024 Dec.
2
Gm-PLoc: A Subcellular Localization Model of Multi-Label Protein Based on GAN and DeepFM.Gm-PLoc:一种基于生成对抗网络(GAN)和深度因子分解机(DeepFM)的多标签蛋白质亚细胞定位模型
Front Genet. 2022 Jun 15;13:912614. doi: 10.3389/fgene.2022.912614. eCollection 2022.
3
Predicting subcellular location of protein with evolution information and sequence-based deep learning.

本文引用的文献

1
iLoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites.iLoc-Hum:利用累积标签量表预测具有单一位点和多个位点的人类蛋白质的亚细胞定位。
Mol Biosyst. 2012 Feb;8(2):629-41. doi: 10.1039/c1mb05420a. Epub 2011 Dec 1.
2
Multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization.基于 Chou 的 PseAAC 构建的多核转移学习在蛋白质亚线粒体定位中的应用。
J Theor Biol. 2012 Jan 21;293:121-30. doi: 10.1016/j.jtbi.2011.10.015. Epub 2011 Oct 21.
3
iLoc-Plant: a multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites.
利用进化信息和基于序列的深度学习预测蛋白质的亚细胞定位。
BMC Bioinformatics. 2021 Oct 22;22(Suppl 10):515. doi: 10.1186/s12859-021-04404-0.
4
Machine and Deep Learning for Prediction of Subcellular Localization.机器和深度学习在预测亚细胞定位中的应用。
Methods Mol Biol. 2021;2361:249-261. doi: 10.1007/978-1-0716-1641-3_15.
5
Predicting human protein subcellular localization by heterogeneous and comprehensive approaches.通过异构和综合方法预测人类蛋白质亚细胞定位。
PLoS One. 2017 Jun 28;12(6):e0178832. doi: 10.1371/journal.pone.0178832. eCollection 2017.
6
Imbalanced multi-label learning for identifying antimicrobial peptides and their functional types.用于识别抗菌肽及其功能类型的不平衡多标签学习
Bioinformatics. 2016 Dec 15;32(24):3745-3752. doi: 10.1093/bioinformatics/btw560. Epub 2016 Aug 26.
7
Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier.基于综合数据源和多标签集成分类器的人类蛋白质亚细胞定位
Sci Rep. 2016 Jun 21;6:28087. doi: 10.1038/srep28087.
8
Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins.用于预测和解释多标签蛋白质亚细胞定位的稀疏回归
BMC Bioinformatics. 2016 Feb 24;17:97. doi: 10.1186/s12859-016-0940-x.
9
Protein (multi-)location prediction: utilizing interdependencies via a generative model.蛋白质(多)定位预测:通过生成模型利用相互依赖性。
Bioinformatics. 2015 Jun 15;31(12):i365-74. doi: 10.1093/bioinformatics/btv264.
10
HybridGO-Loc: mining hybrid features on gene ontology for predicting subcellular localization of multi-location proteins.HybridGO-Loc:在基因本体论上挖掘混合特征以预测多定位蛋白质的亚细胞定位。
PLoS One. 2014 Mar 19;9(3):e89545. doi: 10.1371/journal.pone.0089545. eCollection 2014.
iLoc-Plant:一种用于预测具有单一位点和多个位点的植物蛋白亚细胞定位的多标签分类器。
Mol Biosyst. 2011 Dec;7(12):3287-97. doi: 10.1039/c1mb05232b. Epub 2011 Oct 10.
4
Classification and analysis of regulatory pathways using graph property, biochemical and physicochemical property, and functional property.利用图属性、生化和物理化学性质以及功能性质对调控途径进行分类和分析。
PLoS One. 2011;6(9):e25297. doi: 10.1371/journal.pone.0025297. Epub 2011 Sep 28.
5
iDNA-Prot: identification of DNA binding proteins using random forest with grey model.iDNA-Prot:基于随机森林和灰色模型识别 DNA 结合蛋白。
PLoS One. 2011;6(9):e24756. doi: 10.1371/journal.pone.0024756. Epub 2011 Sep 15.
6
iLoc-Gpos: a multi-layer classifier for predicting the subcellular localization of singleplex and multiplex Gram-positive bacterial proteins.iLoc-Gpos:一种用于预测单重和多重革兰氏阳性细菌蛋白质亚细胞定位的多层分类器。
Protein Pept Lett. 2012 Jan;19(1):4-14. doi: 10.2174/092986612798472839.
7
NR-2L: a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features.NR-2L:一种基于序列衍生特征识别核受体亚家族的两级预测器。
PLoS One. 2011;6(8):e23505. doi: 10.1371/journal.pone.0023505. Epub 2011 Aug 15.
8
Predicting transcriptional activity of multiple site p53 mutants based on hybrid properties.基于混合特性预测多个 p53 突变位点的转录活性。
PLoS One. 2011;6(8):e22940. doi: 10.1371/journal.pone.0022940. Epub 2011 Aug 8.
9
Prediction of body fluids where proteins are secreted into based on protein interaction network.基于蛋白质相互作用网络预测蛋白质分泌的体液。
PLoS One. 2011;6(7):e22989. doi: 10.1371/journal.pone.0022989. Epub 2011 Jul 29.
10
Robust prediction of protein subcellular localization combining PCA and WSVMs.基于 PCA 和 WSVMs 的蛋白质亚细胞定位的稳健预测。
Comput Biol Med. 2011 Aug;41(8):648-52. doi: 10.1016/j.compbiomed.2011.05.016. Epub 2011 Jun 30.