Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, Pakistan.
Anal Biochem. 2012 May 1;424(1):35-44. doi: 10.1016/j.ab.2012.02.007. Epub 2012 Feb 14.
Membrane proteins are a major class of proteins and encoded by approximately 20% to 30% of genes in most organisms. In this work, a two-layer novel membrane protein prediction system, called Mem-PHybrid, is proposed. It is able to first identify the protein query as a membrane or nonmembrane protein. In the second level, it further identifies the type of membrane protein. The proposed Mem-PHybrid prediction system is based on hybrid features, whereby a fusion of both the physicochemical and split amino acid composition-based features is performed. This enables the proposed Mem-PHybrid to exploit the discrimination capabilities of both types of feature extraction strategy. In addition, minimum redundancy and maximum relevance has also been applied to reduce the dimensionality of a feature vector. We employ random forest, evidence-theoretic K-nearest neighbor, and support vector machine (SVM) as classifiers and analyze their performance on two datasets. SVM using hybrid features yields the highest accuracy of 89.6% and 97.3% on dataset1 and 91.5% and 95.5% on dataset2 for jackknife and independent dataset tests, respectively. The enhanced prediction performance of Mem-PHybrid is largely attributed to the exploitation of the discrimination power of the hybrid features and of the learning capability of SVM. Mem-PHybrid is accessible at http://www.111.68.99.218/Mem-PHybrid.
膜蛋白是一大类蛋白质,约占大多数生物体中 20%至 30%的基因编码。在这项工作中,提出了一种两层新型膜蛋白预测系统,称为 Mem-PHybrid。它首先能够识别蛋白质查询是膜蛋白还是非膜蛋白。在第二级,它进一步识别膜蛋白的类型。所提出的 Mem-PHybrid 预测系统基于混合特征,即融合了物理化学和分裂氨基酸组成特征。这使得所提出的 Mem-PHybrid 能够利用这两种特征提取策略的区分能力。此外,还应用了最小冗余和最大相关性来降低特征向量的维数。我们使用随机森林、证据理论 K-最近邻和支持向量机(SVM)作为分类器,并在两个数据集上分析它们的性能。SVM 使用混合特征在数据集 1 上的 jackknife 和独立数据集测试中分别产生了 89.6%和 97.3%的最高精度,在数据集 2 上分别产生了 91.5%和 95.5%的最高精度。Mem-PHybrid 的增强预测性能主要归因于混合特征的区分能力和 SVM 的学习能力的利用。Mem-PHybrid 可在 http://www.111.68.99.218/Mem-PHybrid 上访问。