• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于创建基于周氏伪氨基酸特征以进行亚线粒体定位的遗传编程。

Genetic programming for creating Chou's pseudo amino acid based features for submitochondria localization.

作者信息

Nanni Loris, Lumini Alessandra

机构信息

DEIS, IEIIT-CNR, Università di Bologna, Viale Risorgimento 2, 40136 Bologna, Italy.

出版信息

Amino Acids. 2008 May;34(4):653-60. doi: 10.1007/s00726-007-0018-1. Epub 2008 Jan 4.

DOI:10.1007/s00726-007-0018-1
PMID:18175047
Abstract

Given a protein that is localized in the mitochondria it is very important to know the submitochondria localization of that protein to understand its function. In this work, we propose a submitochondria localizer whose feature extraction method is based on the Chou's pseudo-amino acid composition. The pseudo-amino acid based features are obtained by combining pseudo-amino acid compositions with hundreds of amino-acid indices and amino-acid substitution matrices, then from this huge set of features a small set of 15 "artificial" features is created. The feature creation is performed by genetic programming combining one or more "original" features by means of some mathematical operators. Finally, the set of combined features are used to train a radial basis function support vector machine. This method is named GP-Loc. Moreover, we also propose a very few parameterized method, named ALL-Loc, where all the "original" features are used to train a linear support vector machine. The overall prediction accuracy obtained by GP-Loc is 89% when the jackknife cross-validation is used, this result outperforms the performance obtained in the literature (85.2%) using the same dataset. While the overall prediction accuracy obtained by ALL-Loc is 83.9%.

摘要

对于一种定位于线粒体的蛋白质而言,了解该蛋白质在线粒体内亚结构的定位对于理解其功能非常重要。在这项工作中,我们提出了一种线粒体内亚结构定位器,其特征提取方法基于周(Chou)的伪氨基酸组成。基于伪氨基酸的特征是通过将伪氨基酸组成与数百种氨基酸指数和氨基酸替换矩阵相结合而获得的,然后从这一庞大的特征集中创建一小套15个“人工”特征。特征创建是通过遗传编程借助一些数学运算符将一个或多个“原始”特征组合起来进行的。最后,使用组合特征集训练径向基函数支持向量机。这种方法被命名为GP-Loc。此外,我们还提出了一种参数极少的方法,名为ALL-Loc,其中所有“原始”特征都用于训练线性支持向量机。当使用留一法交叉验证时,GP-Loc获得的总体预测准确率为89%,这一结果优于使用相同数据集在文献中获得的性能(85.2%)。而ALL-Loc获得的总体预测准确率为83.9%。

相似文献

1
Genetic programming for creating Chou's pseudo amino acid based features for submitochondria localization.用于创建基于周氏伪氨基酸特征以进行亚线粒体定位的遗传编程。
Amino Acids. 2008 May;34(4):653-60. doi: 10.1007/s00726-007-0018-1. Epub 2008 Jan 4.
2
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.
3
Combing ontologies and dipeptide composition for predicting DNA-binding proteins.结合本体论和二肽组成来预测DNA结合蛋白。
Amino Acids. 2008 May;34(4):635-41. doi: 10.1007/s00726-007-0016-3. Epub 2008 Jan 4.
4
An ensemble of reduced alphabets with protein encoding based on grouped weight for predicting DNA-binding proteins.一种基于分组权重进行蛋白质编码的简化字母表集合,用于预测DNA结合蛋白。
Amino Acids. 2009 Feb;36(2):167-75. doi: 10.1007/s00726-008-0044-7. Epub 2008 Feb 21.
5
Using Chou's pseudo amino acid composition based on approximate entropy and an ensemble of AdaBoost classifiers to predict protein subnuclear location.基于近似熵的周氏伪氨基酸组成和AdaBoost分类器集成来预测蛋白质亚核定位。
Amino Acids. 2008 May;34(4):669-75. doi: 10.1007/s00726-008-0034-9. Epub 2008 Feb 7.
6
Using the augmented Chou's pseudo amino acid composition for predicting protein submitochondria locations based on auto covariance approach.基于自协方差方法,使用增强的周氏伪氨基酸组成预测蛋白质亚线粒体定位。
J Theor Biol. 2009 Jul 21;259(2):366-72. doi: 10.1016/j.jtbi.2009.03.028. Epub 2009 Mar 31.
7
Prediction of protein homo-oligomer types by pseudo amino acid composition: Approached with an improved feature extraction and Naive Bayes Feature Fusion.基于伪氨基酸组成预测蛋白质同源寡聚体类型:采用改进的特征提取和朴素贝叶斯特征融合方法
Amino Acids. 2006 Jun;30(4):461-8. doi: 10.1007/s00726-006-0263-8. Epub 2006 May 15.
8
Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes.利用周氏两亲性伪氨基酸组成和支持向量机预测酶亚家族类别。
J Theor Biol. 2007 Oct 7;248(3):546-51. doi: 10.1016/j.jtbi.2007.06.001. Epub 2007 Jun 9.
9
Prediction of protein submitochondria locations based on data fusion of various features of sequences.基于序列多种特征数据融合的蛋白质亚线粒体位置预测。
J Theor Biol. 2011 Jan 21;269(1):208-16. doi: 10.1016/j.jtbi.2010.10.026. Epub 2010 Oct 30.
10
Prediction of subcellular localization of apoptosis protein using Chou's pseudo amino acid composition.利用周氏伪氨基酸组成预测凋亡蛋白的亚细胞定位
Acta Biotheor. 2009 Sep;57(3):321-30. doi: 10.1007/s10441-008-9067-4. Epub 2009 Jan 24.

引用本文的文献

1
Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction.Circ-LocNet:一种用于环状 RNA 亚细胞定位预测的计算框架。
Int J Mol Sci. 2022 Jul 26;23(15):8221. doi: 10.3390/ijms23158221.
2
Large-scale prediction and analysis of protein sub-mitochondrial localization with DeepMito.DeepMito 实现大规模预测和分析蛋白质亚线粒体定位
BMC Bioinformatics. 2020 Sep 16;21(Suppl 8):266. doi: 10.1186/s12859-020-03617-z.
3
SubmitoLoc: Identification of mitochondrial sub cellular locations of proteins using support vector machine.
SubmitoLoc:使用支持向量机鉴定蛋白质的线粒体亚细胞定位
Bioinformation. 2019 Dec 31;15(12):863-868. doi: 10.6026/97320630015863. eCollection 2019.
4
Some illuminating remarks on molecular genetics and genomics as well as drug development.关于分子遗传学和基因组学以及药物开发的一些有启发性的观点。
Mol Genet Genomics. 2020 Mar;295(2):261-274. doi: 10.1007/s00438-019-01634-z. Epub 2020 Jan 1.
5
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks.利用蛋白质相互作用网络预测蛋白质亚线粒体定位
Iran J Biotechnol. 2018 Aug 11;16(3):e1933. doi: 10.15171/ijb.1933. eCollection 2018 Aug.
6
DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks.DeepMito:使用卷积神经网络准确预测蛋白质亚线粒体定位
Bioinformatics. 2020 Jan 1;36(1):56-64. doi: 10.1093/bioinformatics/btz512.
7
Predicting Protein Submitochondrial Locations: The 10th Anniversary.预测蛋白质的亚线粒体定位:十周年纪念
Curr Genomics. 2017 Aug;18(4):316-321. doi: 10.2174/1389202918666170228143256.
8
Mathematical Modeling of Intestinal Iron Absorption Using Genetic Programming.使用遗传编程对肠道铁吸收进行数学建模
PLoS One. 2017 Jan 10;12(1):e0169601. doi: 10.1371/journal.pone.0169601. eCollection 2017.
9
Recombination spot identification Based on gapped k-mers.基于缺口 k- -mer 的重组位点识别。
Sci Rep. 2016 Mar 31;6:23934. doi: 10.1038/srep23934.
10
Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.通过将二肽组成纳入周氏广义伪氨基酸组成来预测蛋白质的亚线粒体定位
J Membr Biol. 2016 Jun;249(3):293-304. doi: 10.1007/s00232-015-9868-8. Epub 2016 Jan 8.