• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用支持向量机和隐马尔可夫模型预测线粒体蛋白质。

Prediction of mitochondrial proteins using support vector machine and hidden Markov model.

作者信息

Kumar Manish, Verma Ruchi, Raghava Gajendra P S

机构信息

Institute of Microbial Technology, Sector 39-A, Chandigarh 160036, India.

出版信息

J Biol Chem. 2006 Mar 3;281(9):5357-63. doi: 10.1074/jbc.M511061200. Epub 2005 Dec 8.

DOI:10.1074/jbc.M511061200
PMID:16339140
Abstract

Mitochondria are considered as one of the core organelles of eukaryotic cells hence prediction of mitochondrial proteins is one of the major challenges in the field of genome annotation. This study describes a method, MitPred, developed for predicting mitochondrial proteins with high accuracy. The data set used in this study was obtained from Guda, C., Fahy, E. & Subramaniam, S. (2004) Bioinformatics 20, 1785-1794. First support vector machine-based modules/methods were developed using amino acid and dipeptide composition of proteins and achieved accuracy of 78.37 and 79.38%, respectively. The accuracy of prediction further improved to 83.74% when split amino acid composition (25 N-terminal, 25 C-terminal, and remaining residues) of proteins was used. Then BLAST search and support vector machine-based method were combined to get 88.22% accuracy. Finally we developed a hybrid approach that combined hidden Markov model profiles of domains (exclusively found in mitochondrial proteins) and the support vector machine-based method. We were able to predict mitochondrial protein with 100% specificity at a 56.36% sensitivity rate and with 80.50% specificity at 98.95% sensitivity. The method estimated 9.01, 6.35, 4.84, 3.95, and 4.25% of proteins as mitochondrial in Saccharomyces cerevisiae, Drosophila melanogaster, Caenorhabditis elegans, mouse, and human proteomes, respectively. MitPred was developed on the above hybrid approach.

摘要

线粒体被认为是真核细胞的核心细胞器之一,因此预测线粒体蛋白是基因组注释领域的主要挑战之一。本研究描述了一种名为MitPred的方法,该方法用于高精度预测线粒体蛋白。本研究中使用的数据集来自古达、法希和苏布拉马尼亚姆(2004年)发表于《生物信息学》20卷第1785 - 1794页的文章。首先,利用蛋白质的氨基酸和二肽组成开发了基于支持向量机的模块/方法,其准确率分别达到了78.37%和79.38%。当使用蛋白质的拆分氨基酸组成(25个N端、25个C端和其余残基)时,预测准确率进一步提高到83.74%。然后,将BLAST搜索与基于支持向量机的方法相结合,准确率达到了88.22%。最后,我们开发了一种混合方法,该方法结合了(仅在线粒体蛋白中发现的)结构域的隐马尔可夫模型概况和基于支持向量机的方法。我们能够以100%的特异性在56.36%的灵敏度下预测线粒体蛋白,以及在98.95%的灵敏度下以80.50%的特异性进行预测。该方法分别估计酿酒酵母、黑腹果蝇、秀丽隐杆线虫、小鼠和人类蛋白质组中有9.01%、6.35%、4.84%、3.95%和4.25%的蛋白质为线粒体蛋白。MitPred就是基于上述混合方法开发的。

相似文献

1
Prediction of mitochondrial proteins using support vector machine and hidden Markov model.使用支持向量机和隐马尔可夫模型预测线粒体蛋白质。
J Biol Chem. 2006 Mar 3;281(9):5357-63. doi: 10.1074/jbc.M511061200. Epub 2005 Dec 8.
2
Prediction of nuclear proteins using SVM and HMM models.使用支持向量机和隐马尔可夫模型预测核蛋白。
BMC Bioinformatics. 2009 Jan 19;10:22. doi: 10.1186/1471-2105-10-22.
3
A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.一种基于机器学习的方法,利用氨基酸组成、顺序和相似性搜索来预测分泌蛋白。
In Silico Biol. 2008;8(2):129-40.
4
Predicting sub-cellular localization of tRNA synthetases from their primary structures.从一级结构预测 tRNA 合成酶的亚细胞定位。
Amino Acids. 2012 May;42(5):1703-13. doi: 10.1007/s00726-011-0872-8. Epub 2011 Mar 13.
5
Mito-GSAAC: mitochondria prediction using genetic ensemble classifier and split amino acid composition.Mito-GSAAC:基于遗传集成分类器和分裂氨基酸组成的线粒体预测。
Amino Acids. 2012 Apr;42(4):1443-54. doi: 10.1007/s00726-011-0888-0. Epub 2011 Mar 29.
6
Prediction of protein binding sites in protein structures using hidden Markov support vector machine.利用隐马尔可夫支持向量机预测蛋白质结构中的蛋白质结合位点。
BMC Bioinformatics. 2009 Nov 20;10:381. doi: 10.1186/1471-2105-10-381.
7
Prediction of mitochondrial proteins based on genetic algorithm - partial least squares and support vector machine.基于遗传算法-偏最小二乘法和支持向量机的线粒体蛋白质预测
Amino Acids. 2007 Nov;33(4):669-75. doi: 10.1007/s00726-006-0465-0. Epub 2007 Aug 15.
8
Proteome-wide prediction and annotation of mitochondrial and sub-mitochondrial proteins by incorporating domain information.通过整合结构域信息,对线粒体和亚线粒体蛋白质进行蛋白质组范围的预测和注释。
Mitochondrion. 2018 Sep;42:11-22. doi: 10.1016/j.mito.2017.10.004. Epub 2017 Oct 12.
9
The mitochondrial proteome database: MitoP2.线粒体蛋白质组数据库:MitoP2
Methods Enzymol. 2009;457:3-20. doi: 10.1016/S0076-6879(09)05001-0.
10
Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile.利用氨基酸组成拆分和 PSSM 图谱预测疟原虫的线粒体蛋白。
Amino Acids. 2010 Jun;39(1):101-10. doi: 10.1007/s00726-009-0381-1. Epub 2009 Nov 12.

引用本文的文献

1
AMPGAN v2: Machine Learning-Guided Design of Antimicrobial Peptides.AMPGAN v2:基于机器学习的抗菌肽设计。
J Chem Inf Model. 2021 May 24;61(5):2198-2207. doi: 10.1021/acs.jcim.0c01441. Epub 2021 Mar 31.
2
Recognition of Mitochondrial Proteins in Plasmodium Based on the Tripeptide Composition.基于三肽组成对疟原虫线粒体蛋白质的识别
Front Cell Dev Biol. 2020 Sep 16;8:578901. doi: 10.3389/fcell.2020.578901. eCollection 2020.
3
SubmitoLoc: Identification of mitochondrial sub cellular locations of proteins using support vector machine.
SubmitoLoc:使用支持向量机鉴定蛋白质的线粒体亚细胞定位
Bioinformation. 2019 Dec 31;15(12):863-868. doi: 10.6026/97320630015863. eCollection 2019.
4
Prediction of endoplasmic reticulum resident proteins using fragmented amino acid composition and support vector machine.利用片段氨基酸组成和支持向量机预测内质网驻留蛋白
PeerJ. 2017 Sep 4;5:e3561. doi: 10.7717/peerj.3561. eCollection 2017.
5
The Slo(w) path to identifying the mitochondrial channels responsible for ischemic protection.识别负责缺血保护的线粒体通道的缓慢之路。
Biochem J. 2017 Jun 9;474(12):2067-2094. doi: 10.1042/BCJ20160623.
6
PredHSP: Sequence Based Proteome-Wide Heat Shock Protein Prediction and Classification Tool to Unlock the Stress Biology.PredHSP:基于序列的全蛋白质组热休克蛋白预测与分类工具,用于揭示应激生物学
PLoS One. 2016 May 19;11(5):e0155872. doi: 10.1371/journal.pone.0155872. eCollection 2016.
7
Development of Antimicrobial Peptide Prediction Tool for Aquaculture Industries.水产养殖行业抗菌肽预测工具的开发。
Probiotics Antimicrob Proteins. 2016 Sep;8(3):141-9. doi: 10.1007/s12602-016-9215-0.
8
JPPRED: Prediction of Types of J-Proteins from Imbalanced Data Using an Ensemble Learning Method.JPPRED:使用集成学习方法从不平衡数据预测J蛋白类型
Biomed Res Int. 2015;2015:705156. doi: 10.1155/2015/705156. Epub 2015 Oct 26.
9
BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors.BEAN 2.0:用于III型分泌效应子鉴定和功能分析的综合网络资源。
Database (Oxford). 2015 Jun 27;2015:bav064. doi: 10.1093/database/bav064. Print 2015.
10
NRfamPred: a proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families.NRfamPred:一种用于预测核受体蛋白及其亚家族的蛋白质组规模的两级方法。
Sci Rep. 2014 Oct 29;4:6810. doi: 10.1038/srep06810.