• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用位置特异性图谱和带加权输入的神经网络预测真核生物蛋白质的亚细胞定位

Prediction of subcellular localization of eukaryotic proteins using position-specific profiles and neural network with weighted inputs.

作者信息

Zou Lingyun, Wang Zhengzhi, Huang Jiaomin

机构信息

College of Mechatronics and Automation, National University of Defense Technology, Changsha 410073, China.

出版信息

J Genet Genomics. 2007 Dec;34(12):1080-7. doi: 10.1016/S1673-8527(07)60123-4.

DOI:10.1016/S1673-8527(07)60123-4
PMID:18155620
Abstract

Subcellular location is one of the key biological characteristics of proteins. Position-specific profiles (PSP) have been introduced as important characteristics of proteins in this article. In this study, to obtain position-specific profiles, the Position Specific Iterative-Basic Local Alignment Search Tool (PSI-BLAST) has been used to search for protein sequences in a database. Position-specific scoring matrices are extracted from the profiles as one class of characteristics. Four-part amino acid compositions and 1st-7th order dipeptide compositions have also been calculated as the other two classes of characteristics. Therefore, twelve characteristic vectors are extracted from each of the protein sequences. Next, the characteristic vectors are weighed by a simple weighing function and inputted into a BP neural network predictor named PSP-Weighted Neural Network (PSP-WNN). The Levenberg-Marquardt algorithm is employed to adjust the weight matrices and thresholds during the network training instead of the error back propagation algorithm. With a jackknife test on the RH2427 dataset, PSP-WNN has achieved a higher overall prediction accuracy of 88.4% rather than the prediction results by the general BP neural network, Markov model, and fuzzy k-nearest neighbors algorithm on this dataset. In addition, the prediction performance of PSP-WNN has been evaluated with a five-fold cross validation test on the PK7579 dataset and the prediction results have been consistently better than those of the previous method on the basis of several support vector machines, using compositions of both amino acids and amino acid pairs. These results indicate that PSP-WNN is a powerful tool for subcellular localization prediction. At the end of the article, influences on prediction accuracy using different weighting proportions among three characteristic vector categories have been discussed. An appropriate proportion is considered by increasing the prediction accuracy.

摘要

亚细胞定位是蛋白质的关键生物学特性之一。本文引入了位置特异性谱(PSP)作为蛋白质的重要特性。在本研究中,为了获得位置特异性谱,使用位置特异性迭代基本局部比对搜索工具(PSI-BLAST)在数据库中搜索蛋白质序列。从这些谱中提取位置特异性评分矩阵作为一类特征。还计算了四部分氨基酸组成和一阶至七阶二肽组成作为另外两类特征。因此,从每个蛋白质序列中提取十二个特征向量。接下来,通过一个简单的加权函数对特征向量进行加权,并将其输入到一个名为PSP加权神经网络(PSP-WNN)的BP神经网络预测器中。在网络训练过程中,采用Levenberg-Marquardt算法来调整权重矩阵和阈值,而不是误差反向传播算法。通过对RH2427数据集进行留一法检验,PSP-WNN在该数据集上取得了88.4%的更高总体预测准确率,高于普通BP神经网络、马尔可夫模型和模糊k近邻算法的预测结果。此外,通过对PK7579数据集进行五折交叉验证测试来评估PSP-WNN的预测性能,基于氨基酸和氨基酸对组成的几种支持向量机的预测结果始终优于先前方法。这些结果表明,PSP-WNN是亚细胞定位预测的有力工具。在文章结尾,讨论了在三类特征向量中使用不同加权比例对预测准确率的影响。通过提高预测准确率来考虑合适的比例。

相似文献

1
Prediction of subcellular localization of eukaryotic proteins using position-specific profiles and neural network with weighted inputs.利用位置特异性图谱和带加权输入的神经网络预测真核生物蛋白质的亚细胞定位
J Genet Genomics. 2007 Dec;34(12):1080-7. doi: 10.1016/S1673-8527(07)60123-4.
2
LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST.LOCSVMPSI:一个利用支持向量机和PSI-BLAST序列谱进行真核生物蛋白质亚细胞定位的网络服务器。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W105-10. doi: 10.1093/nar/gki359.
3
Prediction of ubiquitin proteins using artificial neural networks, hidden markov model and support vector machines.使用人工神经网络、隐马尔可夫模型和支持向量机对泛素蛋白进行预测。
In Silico Biol. 2007;7(6):559-68.
4
Prediction of protein subcellular location using a combined feature of sequence.利用序列的组合特征预测蛋白质亚细胞定位。
FEBS Lett. 2005 Jun 20;579(16):3444-8. doi: 10.1016/j.febslet.2005.05.021.
5
A neural network method for prediction of beta-turn types in proteins using evolutionary information.一种利用进化信息预测蛋白质中β-转角类型的神经网络方法。
Bioinformatics. 2004 Nov 1;20(16):2751-8. doi: 10.1093/bioinformatics/bth322. Epub 2004 May 14.
6
Protein subcellular location prediction using optimally weighted fuzzy k-NN algorithm.使用最优加权模糊k近邻算法进行蛋白质亚细胞定位预测。
Comput Biol Chem. 2008 Dec;32(6):448-51. doi: 10.1016/j.compbiolchem.2008.07.011. Epub 2008 Jul 16.
7
Prediction of protein subcellular localization.蛋白质亚细胞定位预测
Proteins. 2006 Aug 15;64(3):643-51. doi: 10.1002/prot.21018.
8
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
9
Using Chou's pseudo amino acid composition based on approximate entropy and an ensemble of AdaBoost classifiers to predict protein subnuclear location.基于近似熵的周氏伪氨基酸组成和AdaBoost分类器集成来预测蛋白质亚核定位。
Amino Acids. 2008 May;34(4):669-75. doi: 10.1007/s00726-008-0034-9. Epub 2008 Feb 7.
10
Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.基于统计几何学,使用随机森林和神经模糊分类器预测非同义单核苷酸多态性的功能效应
Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838.

引用本文的文献

1
An ensemble classifier for eukaryotic protein subcellular location prediction using gene ontology categories and amino acid hydrophobicity.基于基因本体论类别和氨基酸疏水性的真核蛋白质亚细胞定位预测的集成分类器。
PLoS One. 2012;7(1):e31057. doi: 10.1371/journal.pone.0031057. Epub 2012 Jan 30.