• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用一种新颖的AdaBoost算法和周氏伪氨基酸组成来预测蛋白质亚细胞定位。

Using a novel AdaBoost algorithm and Chou's Pseudo amino acid composition for predicting protein subcellular localization.

作者信息

Lin Jie, Wang Yan

机构信息

Department of information management and information system, College of Economic and Management.4800 Cao An Road. Shang Hai, China, 216000.

出版信息

Protein Pept Lett. 2011 Dec;18(12):1219-25. doi: 10.2174/092986611797642797.

DOI:10.2174/092986611797642797
PMID:21728988
Abstract

For a protein, an important characteristic is its location or compartment in a cell. This is because a protein has to be located in its proper position in a cell to perform its biological functions. Therefore, predicting protein subcellular location is an important and challenging task in current molecular and cellular biology. In this paper, based on AdaBoost.ME algorithm and Chou's PseAAC (pseudo amino acid composition), a new computational method was developed to identify protein subcellular location. AdaBoost.ME is an improved version of AdaBoost algorithm that can directly extend the original AdaBoost algorithm to deal with multi-class cases without the need to reduce it to multiple two-class problems. In some previous studies the conventional amino acid composition was applied to represent protein samples. In order to take into account the sequence order effects, in this study we use Chou's PseAAC to represent protein samples. To demonstrate that AdaBoost.ME is a robust and efficient model in predicting protein subcellular locations, the same protein dataset used by Cedano et al. (Journal of Molecular Biology, 1997, 266: 594-600) is adopted in this paper. It can be seen from the computed results that the accuracy achieved by our method is better than those by the methods developed by the previous investigators.

摘要

对于一种蛋白质而言,一个重要的特性是其在细胞中的位置或区室。这是因为蛋白质必须位于细胞内的适当位置才能发挥其生物学功能。因此,预测蛋白质亚细胞定位是当前分子和细胞生物学中一项重要且具有挑战性的任务。在本文中,基于AdaBoost.ME算法和周的伪氨基酸组成(PseAAC),开发了一种新的计算方法来识别蛋白质亚细胞定位。AdaBoost.ME是AdaBoost算法的改进版本,它可以直接扩展原始的AdaBoost算法以处理多类情况,而无需将其简化为多个二类问题。在一些先前的研究中,使用传统的氨基酸组成来表示蛋白质样本。为了考虑序列顺序效应,在本研究中我们使用周的伪氨基酸组成来表示蛋白质样本。为了证明AdaBoost.ME在预测蛋白质亚细胞定位方面是一个稳健且高效的模型,本文采用了Cedano等人(《分子生物学杂志》,1997年,266: 594 - 600)使用的相同蛋白质数据集。从计算结果可以看出,我们的方法所达到的准确率优于先前研究者开发的方法。

相似文献

1
Using a novel AdaBoost algorithm and Chou's Pseudo amino acid composition for predicting protein subcellular localization.使用一种新颖的AdaBoost算法和周氏伪氨基酸组成来预测蛋白质亚细胞定位。
Protein Pept Lett. 2011 Dec;18(12):1219-25. doi: 10.2174/092986611797642797.
2
pLoc_bal-mVirus: Predict Subcellular Localization of Multi-Label Virus Proteins by Chou's General PseAAC and IHTS Treatment to Balance Training Dataset.pLoc_bal-mVirus:基于周式广义伪氨基酸组成和用于平衡训练数据集的迭代启发式阈值选择处理预测多标签病毒蛋白的亚细胞定位
Med Chem. 2019;15(5):496-509. doi: 10.2174/1573406415666181217114710.
3
Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.基于过采样方法和周式广义伪氨基酸组成预测蛋白质亚细胞定位
J Theor Biol. 2018 Jan 21;437:239-250. doi: 10.1016/j.jtbi.2017.10.030. Epub 2017 Oct 31.
4
Predicting viral protein subcellular localization with Chou's pseudo amino acid composition and imbalance-weighted multi-label K-nearest neighbor algorithm.利用周氏伪氨基酸组成和不平衡加权多标签K近邻算法预测病毒蛋白亚细胞定位
Protein Pept Lett. 2012 Nov;19(11):1163-9. doi: 10.2174/092986612803216999.
5
Predicting protein subcellular location using Chou's pseudo amino acid composition and improved hybrid approach.利用周氏伪氨基酸组成和改进的混合方法预测蛋白质亚细胞定位。
Protein Pept Lett. 2008;15(6):612-6. doi: 10.2174/092986608784966930.
6
Prediction of Golgi-resident protein types using general form of Chou's pseudo-amino acid compositions: Approaches with minimal redundancy maximal relevance feature selection.基于周氏伪氨基酸组成的一般形式预测高尔基体驻留蛋白类型:采用最小冗余最大相关特征选择的方法
J Theor Biol. 2016 Aug 7;402:38-44. doi: 10.1016/j.jtbi.2016.04.032. Epub 2016 May 4.
7
Using the concept of Chou's pseudo amino acid composition to predict protein subcellular localization: an approach by incorporating evolutionary information and von Neumann entropies.利用周氏伪氨基酸组成概念预测蛋白质亚细胞定位:一种融合进化信息和冯·诺依曼熵的方法
Amino Acids. 2008 May;34(4):565-72. doi: 10.1007/s00726-007-0010-9. Epub 2007 Dec 11.
8
Using Chou's pseudo amino acid composition based on approximate entropy and an ensemble of AdaBoost classifiers to predict protein subnuclear location.基于近似熵的周氏伪氨基酸组成和AdaBoost分类器集成来预测蛋白质亚核定位。
Amino Acids. 2008 May;34(4):669-75. doi: 10.1007/s00726-008-0034-9. Epub 2008 Feb 7.
9
A new signal characterization and signal-based Chou's PseAAC representation of protein sequences.一种新的蛋白质序列信号表征及基于信号的周氏伪氨基酸组成表示法。
J Bioinform Comput Biol. 2015 Oct;13(5):1550024. doi: 10.1142/S0219720015500249. Epub 2015 Aug 21.
10
Prediction of subcellular localization of apoptosis protein using Chou's pseudo amino acid composition.利用周氏伪氨基酸组成预测凋亡蛋白的亚细胞定位
Acta Biotheor. 2009 Sep;57(3):321-30. doi: 10.1007/s10441-008-9067-4. Epub 2009 Jan 24.

引用本文的文献

1
Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix.基于改进的伪位置特异性评分矩阵的整合不同特征预测细胞凋亡蛋白
Biomed Res Int. 2020 Jan 14;2020:4071508. doi: 10.1155/2020/4071508. eCollection 2020.
2
Some illuminating remarks on molecular genetics and genomics as well as drug development.关于分子遗传学和基因组学以及药物开发的一些有启发性的观点。
Mol Genet Genomics. 2020 Mar;295(2):261-274. doi: 10.1007/s00438-019-01634-z. Epub 2020 Jan 1.
3
MultiP-Apo: A Multilabel Predictor for Identifying Subcellular Locations of Apoptosis Proteins.
MultiP-Apo:一种用于识别凋亡蛋白亚细胞定位的多标签预测器。
Comput Intell Neurosci. 2017;2017:9183796. doi: 10.1155/2017/9183796. Epub 2017 Jul 4.
4
Learning from Heterogeneous Data Sources: An Application in Spatial Proteomics.从异构数据源学习:空间蛋白质组学中的应用
PLoS Comput Biol. 2016 May 13;12(5):e1004920. doi: 10.1371/journal.pcbi.1004920. eCollection 2016 May.
5
PseAAC-General: fast building various modes of general form of Chou's pseudo-amino acid composition for large-scale protein datasets.PseAAC-General:快速构建用于大规模蛋白质数据集的周氏伪氨基酸组成通用形式的各种模式。
Int J Mol Sci. 2014 Feb 26;15(3):3495-506. doi: 10.3390/ijms15033495.
6
Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.RNA结合蛋白结构域的综合比较分析与鉴定:多类分类与特征选择
J Theor Biol. 2012 Nov 7;312:65-75. doi: 10.1016/j.jtbi.2012.07.013. Epub 2012 Aug 3.
7
Prediction of protein domain with mRMR feature selection and analysis.基于 mRMR 特征选择的蛋白质结构域预测及分析。
PLoS One. 2012;7(6):e39308. doi: 10.1371/journal.pone.0039308. Epub 2012 Jun 15.
8
A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins.一种用于识别单plex 和 multiplex 真核蛋白质亚细胞位置的多标签预测器。
PLoS One. 2012;7(5):e36317. doi: 10.1371/journal.pone.0036317. Epub 2012 May 22.
9
Predicting Anatomical Therapeutic Chemical (ATC) classification of drugs by integrating chemical-chemical interactions and similarities.通过整合化学-化学相互作用和相似性来预测药物的解剖治疗化学 (ATC) 分类。
PLoS One. 2012;7(4):e35254. doi: 10.1371/journal.pone.0035254. Epub 2012 Apr 13.