• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Mem-PHybrid:一种基于混合特征的膜蛋白类型分类预测系统。

Mem-PHybrid: hybrid features-based prediction system for classifying membrane protein types.

机构信息

Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, Pakistan.

出版信息

Anal Biochem. 2012 May 1;424(1):35-44. doi: 10.1016/j.ab.2012.02.007. Epub 2012 Feb 14.

DOI:10.1016/j.ab.2012.02.007
PMID:22342883
Abstract

Membrane proteins are a major class of proteins and encoded by approximately 20% to 30% of genes in most organisms. In this work, a two-layer novel membrane protein prediction system, called Mem-PHybrid, is proposed. It is able to first identify the protein query as a membrane or nonmembrane protein. In the second level, it further identifies the type of membrane protein. The proposed Mem-PHybrid prediction system is based on hybrid features, whereby a fusion of both the physicochemical and split amino acid composition-based features is performed. This enables the proposed Mem-PHybrid to exploit the discrimination capabilities of both types of feature extraction strategy. In addition, minimum redundancy and maximum relevance has also been applied to reduce the dimensionality of a feature vector. We employ random forest, evidence-theoretic K-nearest neighbor, and support vector machine (SVM) as classifiers and analyze their performance on two datasets. SVM using hybrid features yields the highest accuracy of 89.6% and 97.3% on dataset1 and 91.5% and 95.5% on dataset2 for jackknife and independent dataset tests, respectively. The enhanced prediction performance of Mem-PHybrid is largely attributed to the exploitation of the discrimination power of the hybrid features and of the learning capability of SVM. Mem-PHybrid is accessible at http://www.111.68.99.218/Mem-PHybrid.

摘要

膜蛋白是一大类蛋白质,约占大多数生物体中 20%至 30%的基因编码。在这项工作中,提出了一种两层新型膜蛋白预测系统,称为 Mem-PHybrid。它首先能够识别蛋白质查询是膜蛋白还是非膜蛋白。在第二级,它进一步识别膜蛋白的类型。所提出的 Mem-PHybrid 预测系统基于混合特征,即融合了物理化学和分裂氨基酸组成特征。这使得所提出的 Mem-PHybrid 能够利用这两种特征提取策略的区分能力。此外,还应用了最小冗余和最大相关性来降低特征向量的维数。我们使用随机森林、证据理论 K-最近邻和支持向量机(SVM)作为分类器,并在两个数据集上分析它们的性能。SVM 使用混合特征在数据集 1 上的 jackknife 和独立数据集测试中分别产生了 89.6%和 97.3%的最高精度,在数据集 2 上分别产生了 91.5%和 95.5%的最高精度。Mem-PHybrid 的增强预测性能主要归因于混合特征的区分能力和 SVM 的学习能力的利用。Mem-PHybrid 可在 http://www.111.68.99.218/Mem-PHybrid 上访问。

相似文献

1
Mem-PHybrid: hybrid features-based prediction system for classifying membrane protein types.Mem-PHybrid:一种基于混合特征的膜蛋白类型分类预测系统。
Anal Biochem. 2012 May 1;424(1):35-44. doi: 10.1016/j.ab.2012.02.007. Epub 2012 Feb 14.
2
Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition.通过将复合蛋白质序列特征融合到伪氨基酸组成中来预测膜蛋白类型。
J Theor Biol. 2011 Feb 21;271(1):10-7. doi: 10.1016/j.jtbi.2010.11.017. Epub 2010 Nov 24.
3
Prediction of membrane proteins using split amino acid and ensemble classification.基于氨基酸分割和集成分类的膜蛋白预测。
Amino Acids. 2012 Jun;42(6):2447-60. doi: 10.1007/s00726-011-1053-5. Epub 2011 Aug 18.
4
Mito-GSAAC: mitochondria prediction using genetic ensemble classifier and split amino acid composition.Mito-GSAAC:基于遗传集成分类器和分裂氨基酸组成的线粒体预测。
Amino Acids. 2012 Apr;42(4):1443-54. doi: 10.1007/s00726-011-0888-0. Epub 2011 Mar 29.
5
A two-stage SVM method to predict membrane protein types by incorporating amino acid classifications and physicochemical properties into a general form of Chou's PseAAC.一种两阶段支持向量机方法,通过将氨基酸分类和物理化学性质纳入到 Chou 的 PseAAC 的一般形式中,来预测膜蛋白类型。
J Theor Biol. 2014 Mar 7;344:31-9. doi: 10.1016/j.jtbi.2013.11.017. Epub 2013 Dec 4.
6
A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.一种基于机器学习的方法,利用氨基酸组成、顺序和相似性搜索来预测分泌蛋白。
In Silico Biol. 2008;8(2):129-40.
7
Discriminating outer membrane proteins with Fuzzy K-nearest Neighbor algorithms based on the general form of Chou's PseAAC.基于周式伪氨基酸组成通用形式,运用模糊K近邻算法鉴别外膜蛋白。
Protein Pept Lett. 2012 Apr;19(4):411-21. doi: 10.2174/092986612799789387.
8
Weighted-support vector machines for predicting membrane protein types based on pseudo-amino acid composition.基于伪氨基酸组成的用于预测膜蛋白类型的加权支持向量机
Protein Eng Des Sel. 2004 Jun;17(6):509-16. doi: 10.1093/protein/gzh061. Epub 2004 Aug 16.
9
CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition.CE-PLoc:一种通过融合不同模式的伪氨基酸组成来预测蛋白质亚细胞位置的集成分类器。
Comput Biol Chem. 2011 Aug 10;35(4):218-29. doi: 10.1016/j.compbiolchem.2011.05.003. Epub 2011 May 27.
10
Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.通过将二肽组成纳入周氏广义伪氨基酸组成来预测蛋白质的亚线粒体定位
J Membr Biol. 2016 Jun;249(3):293-304. doi: 10.1007/s00232-015-9868-8. Epub 2016 Jan 8.

引用本文的文献

1
StackedEnC-AOP: prediction of antioxidant proteins using transform evolutionary and sequential features based multi-scale vector with stacked ensemble learning.StackedEnC-AOP:基于多尺度向量的转换进化和序列特征与堆叠集成学习预测抗氧化蛋白。
BMC Bioinformatics. 2024 Aug 4;25(1):256. doi: 10.1186/s12859-024-05884-6.
2
Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning.基于序列和进化信息的膜蛋白类型的深度学习精确分类。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):700. doi: 10.1186/s12859-019-3275-6.
3
A Novel Hybrid Feature Extraction Model for Classification on Pulmonary Nodules.
一种用于肺结节分类的新型混合特征提取模型。
Asian Pac J Cancer Prev. 2019 Feb 26;20(2):457-468. doi: 10.31557/APJCP.2019.20.2.457.
4
Employing a novel 2-gram subgroup intra pattern (2GSIP) with stacked auto encoder for membrane protein classification.采用带有堆叠自动编码器的新型二元子组内部模式(2GSIP)进行膜蛋白分类。
Mol Biol Rep. 2019 Apr;46(2):2259-2272. doi: 10.1007/s11033-019-04680-3. Epub 2019 Feb 18.
5
A Treatise to Computational Approaches Towards Prediction of Membrane Protein and Its Subtypes.关于膜蛋白及其亚型预测的计算方法的论文
J Membr Biol. 2017 Feb;250(1):55-76. doi: 10.1007/s00232-016-9937-7. Epub 2016 Nov 19.
6
iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.iRSpot-GAEnsC:通过集成分类器识别重组位点并扩展周氏伪氨基酸组成概念以构建DNA样本
Mol Genet Genomics. 2016 Feb;291(1):285-96. doi: 10.1007/s00438-015-1108-5. Epub 2015 Aug 30.
7
Customised fragments libraries for protein structure prediction based on structural class annotations.基于结构类注释的用于蛋白质结构预测的定制片段文库。
BMC Bioinformatics. 2015 Apr 29;16(1):136. doi: 10.1186/s12859-015-0576-2.
8
An ensemble method with hybrid features to identify extracellular matrix proteins.一种具有混合特征的集成方法用于识别细胞外基质蛋白。
PLoS One. 2015 Feb 13;10(2):e0117804. doi: 10.1371/journal.pone.0117804. eCollection 2015.
9
Protein inter-domain linker prediction using Random Forest and amino acid physiochemical properties.利用随机森林和氨基酸理化性质进行蛋白质结构域间连接子预测。
BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S8. doi: 10.1186/1471-2105-15-S16-S8. Epub 2014 Dec 8.
10
A multi-label classifier for prediction membrane protein functional types in animal.一种用于预测动物膜蛋白功能类型的多标签分类器。
J Membr Biol. 2014 Nov;247(11):1141-8. doi: 10.1007/s00232-014-9708-2. Epub 2014 Aug 9.