• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过随机标签选择进行多标签学习,用于预测蛋白质亚细胞多重位置。

Multilabel learning via random label selection for protein subcellular multilocations prediction.

机构信息

Key Laboratory of Embedded System and Service Computing, Ministry of Education, Department of Control Science and Engineering, Tongji University, Shanghai 201804, China.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):436-46. doi: 10.1109/TCBB.2013.21.

DOI:10.1109/TCBB.2013.21
PMID:23929867
Abstract

Prediction of protein subcellular localization is an important but challenging problem, particularly when proteins may simultaneously exist at, or move between, two or more different subcellular location sites. Most of the existing protein subcellular localization methods are only used to deal with the single-location proteins. In the past few years, only a few methods have been proposed to tackle proteins with multiple locations. However, they only adopt a simple strategy, that is, transforming the multilocation proteins to multiple proteins with single location, which does not take correlations among different subcellular locations into account. In this paper, a novel method named random label selection (RALS) (multilabel learning via RALS), which extends the simple binary relevance (BR) method, is proposed to learn from multilocation proteins in an effective and efficient way. RALS does not explicitly find the correlations among labels, but rather implicitly attempts to learn the label correlations from data by augmenting original feature space with randomly selected labels as its additional input features. Through the fivefold cross-validation test on a benchmark data set, we demonstrate our proposed method with consideration of label correlations obviously outperforms the baseline BR method without consideration of label correlations, indicating correlations among different subcellular locations really exist and contribute to improvement of prediction performance. Experimental results on two benchmark data sets also show that our proposed methods achieve significantly higher performance than some other state-of-the-art methods in predicting subcellular multilocations of proteins. The prediction web server is available at >http://levis.tongji.edu.cn:8080/bioinfo/MLPred-Euk/ for the public usage.

摘要

蛋白质亚细胞定位预测是一个重要但具有挑战性的问题,特别是当蛋白质可能同时存在于或在两个或更多不同的亚细胞位置之间移动时。大多数现有的蛋白质亚细胞定位方法仅用于处理单定位蛋白质。在过去的几年中,只有少数几种方法被提出用于处理多定位蛋白质。然而,它们仅采用一种简单的策略,即将多定位蛋白质转换为具有单个位置的多个蛋白质,而不考虑不同亚细胞位置之间的相关性。在本文中,提出了一种名为随机标签选择(RALS)(通过 RALS 进行多标签学习)的新方法,该方法扩展了简单的二分类相关性(BR)方法,以有效地从多定位蛋白质中学习。RALS 并没有显式地寻找标签之间的相关性,而是通过用随机选择的标签作为其附加输入特征来扩充原始特征空间,从而从数据中尝试学习标签相关性。通过在基准数据集上进行五重交叉验证测试,我们证明了我们提出的考虑标签相关性的方法明显优于不考虑标签相关性的基线 BR 方法,表明不同亚细胞位置之间确实存在相关性,并有助于提高预测性能。在两个基准数据集上的实验结果还表明,我们提出的方法在预测蛋白质的亚细胞多定位方面明显优于其他一些最先进的方法。预测网络服务器可在 >http://levis.tongji.edu.cn:8080/bioinfo/MLPred-Euk/ 上供公众使用。

相似文献

1
Multilabel learning via random label selection for protein subcellular multilocations prediction.通过随机标签选择进行多标签学习,用于预测蛋白质亚细胞多重位置。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):436-46. doi: 10.1109/TCBB.2013.21.
2
Multilabel learning for protein subcellular location prediction.多标签学习在蛋白质亚细胞定位预测中的应用。
IEEE Trans Nanobioscience. 2012 Sep;11(3):237-43. doi: 10.1109/TNB.2012.2212249.
3
Predict subcellular locations of singleplex and multiplex proteins by semi-supervised learning and dimension-reducing general mode of Chou's PseAAC.通过半监督学习和 Chou 的 PseAAC 通用模式的降维方法预测单plex 和 multiplex 蛋白质的亚细胞定位。
IEEE Trans Nanobioscience. 2013 Dec;12(4):311-20. doi: 10.1109/TNB.2013.2272014.
4
A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins.一种用于识别单plex 和 multiplex 真核蛋白质亚细胞位置的多标签预测器。
PLoS One. 2012;7(5):e36317. doi: 10.1371/journal.pone.0036317. Epub 2012 May 22.
5
MSLoc-DT: a new method for predicting the protein subcellular location of multispecies based on decision templates.MSLoc-DT:一种基于决策模板预测多物种蛋白质亚细胞位置的新方法。
Anal Biochem. 2014 Mar 15;449:164-71. doi: 10.1016/j.ab.2013.12.013. Epub 2013 Dec 21.
6
A new method for predicting the subcellular localization of eukaryotic proteins with both single and multiple sites: Euk-mPLoc 2.0.一种预测真核蛋白单一位点和多位点亚细胞定位的新方法:Euk-mPLoc 2.0。
PLoS One. 2010 Apr 1;5(4):e9931. doi: 10.1371/journal.pone.0009931.
7
Euk-mPLoc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites.Euk-mPLoc:一种通过整合多个位点进行大规模真核生物蛋白质亚细胞定位预测的融合分类器。
J Proteome Res. 2007 May;6(5):1728-34. doi: 10.1021/pr060635i. Epub 2007 Mar 31.
8
HPSLPred: An Ensemble Multi-Label Classifier for Human Protein Subcellular Location Prediction with Imbalanced Source.HPSLPred:一种用于人类蛋白质亚细胞定位预测的集成多标签分类器,源数据不均衡。
Proteomics. 2017 Sep;17(17-18). doi: 10.1002/pmic.201700262.
9
Virus-ECC-mPLoc: a multi-label predictor for predicting the subcellular localization of virus proteins with both single and multiple sites based on a general form of Chou's pseudo amino acid composition.病毒-ECC-mPLoc:一种基于周氏伪氨基酸组成的通用形式,用于预测具有单一位点和多个位点的病毒蛋白亚细胞定位的多标签预测器。
Protein Pept Lett. 2013 Mar;20(3):309-17. doi: 10.2174/0929866511320030009.
10
MultiP-Apo: A Multilabel Predictor for Identifying Subcellular Locations of Apoptosis Proteins.MultiP-Apo:一种用于识别凋亡蛋白亚细胞定位的多标签预测器。
Comput Intell Neurosci. 2017;2017:9183796. doi: 10.1155/2017/9183796. Epub 2017 Jul 4.

引用本文的文献

1
PScL-HDeep: image-based prediction of protein subcellular location in human tissue using ensemble learning of handcrafted and deep learned features with two-layer feature selection.PScL-HDeep:基于图像的人类组织蛋白亚细胞定位预测,使用基于手工和深度学习特征的两层特征选择的集成学习方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab278.
2
MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy.MIC_Locator:一种新颖的基于图像的蛋白质亚细胞位置多标签预测模型,基于多尺度单基因信号表示和强度编码策略。
BMC Bioinformatics. 2019 Oct 26;20(1):522. doi: 10.1186/s12859-019-3136-3.
3
Bastion3: a two-layer ensemble predictor of type III secreted effectors.
堡垒 3:III 型分泌效应物的双层集成预测器。
Bioinformatics. 2019 Jun 1;35(12):2017-2028. doi: 10.1093/bioinformatics/bty914.
4
Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier.结合同源蛋白的GO特征和距离加权KNN分类器预测凋亡蛋白的亚细胞定位
Biomed Res Int. 2016;2016:1793272. doi: 10.1155/2016/1793272. Epub 2016 Apr 24.
5
Syndrome Differentiation Analysis on Mars500 Data of Traditional Chinese Medicine.基于“火星-500”项目数据的中医证候分析
ScientificWorldJournal. 2015;2015:125736. doi: 10.1155/2015/125736. Epub 2015 Oct 1.
6
Patient classification of hypertension in Traditional Chinese Medicine using multi-label learning techniques.基于多标签学习技术的中医高血压患者分类
BMC Med Genomics. 2015;8 Suppl 3(Suppl 3):S4. doi: 10.1186/1755-8794-8-S3-S4. Epub 2015 Sep 23.
7
Qualitative and quantitative analysis for facial complexion in traditional Chinese medicine.中医面部面色的定性与定量分析
Biomed Res Int. 2014;2014:207589. doi: 10.1155/2014/207589. Epub 2014 May 22.
8
Augmenting multi-instance multilabel learning with sparse bayesian models for skin biopsy image analysis.基于稀疏贝叶斯模型的多实例多标签学习在皮肤活检图像分析中的应用。
Biomed Res Int. 2014;2014:305629. doi: 10.1155/2014/305629. Epub 2014 Apr 7.