• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用周氏伪氨基酸组成概念预测蛋白质亚细胞定位:一种融合进化信息和冯·诺依曼熵的方法

Using the concept of Chou's pseudo amino acid composition to predict protein subcellular localization: an approach by incorporating evolutionary information and von Neumann entropies.

作者信息

Zhang Shao-Wu, Zhang Yun-Long, Yang Hui-Fang, Zhao Chun-Hui, Pan Quan

机构信息

College of Automation, Northwestern Polytechnical University, No. 127 Youyi West Road, Xi'an 710072, China.

出版信息

Amino Acids. 2008 May;34(4):565-72. doi: 10.1007/s00726-007-0010-9. Epub 2007 Dec 11.

DOI:10.1007/s00726-007-0010-9
PMID:18074191
Abstract

The rapidly increasing number of sequence entering into the genome databank has called for the need for developing automated methods to analyze them. Information on the subcellular localization of new found protein sequences is important for helping to reveal their functions in time and conducting the study of system biology at the cellular level. Based on the concept of Chou's pseudo-amino acid composition, a series of useful information and techniques, such as residue conservation scores, von Neumann entropies, multi-scale energy, and weighted auto-correlation function were utilized to generate the pseudo-amino acid components for representing the protein samples. Based on such an infrastructure, a hybridization predictor was developed for identifying uncharacterized proteins among the following 12 subcellular localizations: chloroplast, cytoplasm, cytoskeleton, endoplasmic reticulum, extracell, Golgi apparatus, lysosome, mitochondria, nucleus, peroxisome, plasma membrane, and vacuole. Compared with the results reported by the previous investigators, higher success rates were obtained, suggesting that the current approach is quite promising, and may become a useful high-throughput tool in the relevant areas.

摘要

进入基因组数据库的序列数量迅速增加,这就需要开发自动化方法来对其进行分析。新发现的蛋白质序列的亚细胞定位信息对于及时揭示其功能以及在细胞水平上开展系统生物学研究至关重要。基于周的伪氨基酸组成概念,利用了一系列有用的信息和技术,如残基保守分数、冯·诺依曼熵、多尺度能量和加权自相关函数来生成用于表征蛋白质样本的伪氨基酸成分。基于这样的基础架构,开发了一种杂交预测器,用于在以下12种亚细胞定位中识别未表征的蛋白质:叶绿体、细胞质、细胞骨架、内质网、细胞外、高尔基体、溶酶体、线粒体、细胞核、过氧化物酶体、质膜和液泡。与先前研究者报道的结果相比,获得了更高的成功率,这表明当前的方法很有前景,可能会成为相关领域中一种有用的高通量工具。

相似文献

1
Using the concept of Chou's pseudo amino acid composition to predict protein subcellular localization: an approach by incorporating evolutionary information and von Neumann entropies.利用周氏伪氨基酸组成概念预测蛋白质亚细胞定位:一种融合进化信息和冯·诺依曼熵的方法
Amino Acids. 2008 May;34(4):565-72. doi: 10.1007/s00726-007-0010-9. Epub 2007 Dec 11.
2
Using Chou's pseudo amino acid composition based on approximate entropy and an ensemble of AdaBoost classifiers to predict protein subnuclear location.基于近似熵的周氏伪氨基酸组成和AdaBoost分类器集成来预测蛋白质亚核定位。
Amino Acids. 2008 May;34(4):669-75. doi: 10.1007/s00726-008-0034-9. Epub 2008 Feb 7.
3
Using pseudo amino acid composition to predict protein subcellular location: approached with amino acid composition distribution.利用伪氨基酸组成预测蛋白质亚细胞定位:基于氨基酸组成分布的方法。
Amino Acids. 2008 Aug;35(2):321-7. doi: 10.1007/s00726-007-0623-z. Epub 2008 Jan 22.
4
Using Chou's pseudo amino acid composition to predict protein quaternary structure: a sequence-segmented PseAAC approach.利用周氏伪氨基酸组成预测蛋白质四级结构:一种序列分段伪氨基酸组成方法。
Amino Acids. 2008 Oct;35(3):591-8. doi: 10.1007/s00726-008-0086-x. Epub 2008 Apr 22.
5
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.
6
Prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition.利用多尺度能量和伪氨基酸组成的支持向量机预测蛋白质亚细胞定位
Amino Acids. 2007 Jul;33(1):69-74. doi: 10.1007/s00726-006-0475-y. Epub 2007 Jan 19.
7
Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes.利用伪氨基酸组成和二叉树支持向量机预测蛋白质结构类别。
Amino Acids. 2007 Nov;33(4):623-9. doi: 10.1007/s00726-007-0496-1. Epub 2007 Feb 19.
8
Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction.Euk-PLoc:一种用于大规模真核生物蛋白质亚细胞定位预测的集成分类器。
Amino Acids. 2007 Jul;33(1):57-67. doi: 10.1007/s00726-006-0478-8. Epub 2007 Jan 19.
9
Prediction and classification of protein subcellular location-sequence-order effect and pseudo amino acid composition.蛋白质亚细胞定位的预测与分类——序列顺序效应和伪氨基酸组成
J Cell Biochem. 2003 Dec 15;90(6):1250-60. doi: 10.1002/jcb.10719.
10
Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-Nearest Neighbor classifiers.通过融合优化的证据理论K近邻分类器预测真核生物蛋白质亚细胞定位
J Proteome Res. 2006 Aug;5(8):1888-97. doi: 10.1021/pr060167c.

引用本文的文献

1
Some illuminating remarks on molecular genetics and genomics as well as drug development.关于分子遗传学和基因组学以及药物开发的一些有启发性的观点。
Mol Genet Genomics. 2020 Mar;295(2):261-274. doi: 10.1007/s00438-019-01634-z. Epub 2020 Jan 1.
2
Gene Prediction in Metagenomic Fragments with Deep Learning.利用深度学习进行宏基因组片段中的基因预测
Biomed Res Int. 2017;2017:4740354. doi: 10.1155/2017/4740354. Epub 2017 Nov 8.
3
MultiP-Apo: A Multilabel Predictor for Identifying Subcellular Locations of Apoptosis Proteins.MultiP-Apo:一种用于识别凋亡蛋白亚细胞定位的多标签预测器。
Comput Intell Neurosci. 2017;2017:9183796. doi: 10.1155/2017/9183796. Epub 2017 Jul 4.
4
Prediction of protein-protein interactions with clustered amino acids and weighted sparse representation.基于聚类氨基酸和加权稀疏表示的蛋白质-蛋白质相互作用预测
Int J Mol Sci. 2015 May 13;16(5):10855-69. doi: 10.3390/ijms160510855.
5
Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.结合周氏距离对伪氨基酸组成和主成分分析进行蛋白质远程同源性检测。
Mol Genet Genomics. 2015 Oct;290(5):1919-31. doi: 10.1007/s00438-015-1044-4. Epub 2015 Apr 21.
6
iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.iDNA-Prot|dis:通过将氨基酸距离对和简化字母表概况纳入通用伪氨基酸组成来鉴定DNA结合蛋白。
PLoS One. 2014 Sep 3;9(9):e106691. doi: 10.1371/journal.pone.0106691. eCollection 2014.
7
iMethyl-PseAAC: identification of protein methylation sites via a pseudo amino acid composition approach.iMethyl-PseAAC:通过伪氨基酸组成方法鉴定蛋白质甲基化位点。
Biomed Res Int. 2014;2014:947416. doi: 10.1155/2014/947416. Epub 2014 May 22.
8
iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.iSS-PseDNC:利用伪二核苷酸组成识别剪接位点。
Biomed Res Int. 2014;2014:623149. doi: 10.1155/2014/623149. Epub 2014 May 21.
9
Prediction of protein S-nitrosylation sites based on adapted normal distribution bi-profile Bayes and Chou's pseudo amino acid composition.基于适配正态分布双轮廓贝叶斯和周氏伪氨基酸组成的蛋白质S-亚硝基化位点预测
Int J Mol Sci. 2014 Jun 10;15(6):10410-23. doi: 10.3390/ijms150610410.
10
Sequence-specific flexibility organization of splicing flanking sequence and prediction of splice sites in the human genome.人类基因组中剪接侧翼序列的序列特异性灵活性组织及剪接位点预测
Chromosome Res. 2014 Sep;22(3):321-34. doi: 10.1007/s10577-014-9414-z. Epub 2014 Apr 12.