• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过将复合蛋白质序列特征融合到伪氨基酸组成中来预测膜蛋白类型。

Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition.

作者信息

Hayat Maqsood, Khan Asifullah

机构信息

Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, Pakistan.

出版信息

J Theor Biol. 2011 Feb 21;271(1):10-7. doi: 10.1016/j.jtbi.2010.11.017. Epub 2010 Nov 24.

DOI:10.1016/j.jtbi.2010.11.017
PMID:21110985
Abstract

Membrane proteins are vital type of proteins that serve as channels, receptors, and energy transducers in a cell. Prediction of membrane protein types is an important research area in bioinformatics. Knowledge of membrane protein types provides some valuable information for predicting novel example of the membrane protein types. However, classification of membrane protein types can be both time consuming and susceptible to errors due to the inherent similarity of membrane protein types. In this paper, neural networks based membrane protein type prediction system is proposed. Composite protein sequence representation (CPSR) is used to extract the features of a protein sequence, which includes seven feature sets; amino acid composition, sequence length, 2 gram exchange group frequency, hydrophobic group, electronic group, sum of hydrophobicity, and R-group. Principal component analysis is then employed to reduce the dimensionality of the feature vector. The probabilistic neural network (PNN), generalized regression neural network, and support vector machine (SVM) are used as classifiers. A high success rate of 86.01% is obtained using SVM for the jackknife test. In case of independent dataset test, PNN yields the highest accuracy of 95.73%. These classifiers exhibit improved performance using other performance measures such as sensitivity, specificity, Mathew's correlation coefficient, and F-measure. The experimental results show that the prediction performance of the proposed scheme for classifying membrane protein types is the best reported, so far. This performance improvement may largely be credited to the learning capabilities of neural networks and the composite feature extraction strategy, which exploits seven different properties of protein sequences. The proposed Mem-Predictor can be accessed at http://111.68.99.218/Mem-Predictor.

摘要

膜蛋白是一类重要的蛋白质,在细胞中充当通道、受体和能量转换器。膜蛋白类型的预测是生物信息学中的一个重要研究领域。膜蛋白类型的知识为预测膜蛋白类型的新实例提供了一些有价值的信息。然而,由于膜蛋白类型的内在相似性,膜蛋白类型的分类既耗时又容易出错。本文提出了基于神经网络的膜蛋白类型预测系统。复合蛋白质序列表示(CPSR)用于提取蛋白质序列的特征,它包括七个特征集:氨基酸组成、序列长度、二联体交换基团频率、疏水基团、电子基团、疏水性总和以及R基团。然后采用主成分分析来降低特征向量的维度。概率神经网络(PNN)、广义回归神经网络和支持向量机(SVM)用作分类器。使用SVM进行留一法检验获得了86.01%的高成功率。在独立数据集测试中,PNN的准确率最高,为95.73%。这些分类器在使用其他性能指标(如灵敏度、特异性、马修斯相关系数和F值)时表现出更好的性能。实验结果表明,到目前为止,所提出的膜蛋白类型分类方案的预测性能是已报道的最佳性能。这种性能的提高很大程度上归功于神经网络的学习能力和复合特征提取策略,该策略利用了蛋白质序列的七种不同特性。所提出的Mem-Predictor可通过http://111.68.99.218/Mem-Predictor访问。

相似文献

1
Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition.通过将复合蛋白质序列特征融合到伪氨基酸组成中来预测膜蛋白类型。
J Theor Biol. 2011 Feb 21;271(1):10-7. doi: 10.1016/j.jtbi.2010.11.017. Epub 2010 Nov 24.
2
Prediction of membrane proteins using split amino acid and ensemble classification.基于氨基酸分割和集成分类的膜蛋白预测。
Amino Acids. 2012 Jun;42(6):2447-60. doi: 10.1007/s00726-011-1053-5. Epub 2011 Aug 18.
3
Discriminating lysosomal membrane protein types using dynamic neural network.利用动态神经网络区分溶酶体膜蛋白类型。
J Biomol Struct Dyn. 2014;32(10):1575-82. doi: 10.1080/07391102.2013.827133. Epub 2013 Aug 22.
4
Predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of Chou's pseudo amino acid composition.通过将蛋白质拓扑结构、结构域、信号肽和物理化学性质纳入到周的伪氨基酸组成的通用形式中,来预测膜蛋白类型。
J Theor Biol. 2013 Feb 7;318:1-12. doi: 10.1016/j.jtbi.2012.10.033. Epub 2012 Nov 5.
5
Mem-PHybrid: hybrid features-based prediction system for classifying membrane protein types.Mem-PHybrid:一种基于混合特征的膜蛋白类型分类预测系统。
Anal Biochem. 2012 May 1;424(1):35-44. doi: 10.1016/j.ab.2012.02.007. Epub 2012 Feb 14.
6
A two-stage SVM method to predict membrane protein types by incorporating amino acid classifications and physicochemical properties into a general form of Chou's PseAAC.一种两阶段支持向量机方法,通过将氨基酸分类和物理化学性质纳入到 Chou 的 PseAAC 的一般形式中,来预测膜蛋白类型。
J Theor Biol. 2014 Mar 7;344:31-9. doi: 10.1016/j.jtbi.2013.11.017. Epub 2013 Dec 4.
7
Classification of membrane protein types using Voting Feature Interval in combination with Chou's Pseudo Amino Acid Composition.结合周氏伪氨基酸组成,利用投票特征区间对膜蛋白类型进行分类。
J Theor Biol. 2015 Nov 7;384:78-83. doi: 10.1016/j.jtbi.2015.07.034. Epub 2015 Aug 20.
8
Signal peptide discrimination and cleavage site identification using SVM and NN.使用 SVM 和 NN 进行信号肽识别和切割位点鉴定。
Comput Biol Med. 2014 Feb;45:98-110. doi: 10.1016/j.compbiomed.2013.11.017. Epub 2013 Dec 1.
9
GPCR-MPredictor: multi-level prediction of G protein-coupled receptors using genetic ensemble.GPCR-MPredictor:基于遗传集成的 G 蛋白偶联受体多层次预测
Amino Acids. 2012 May;42(5):1809-23. doi: 10.1007/s00726-011-0902-6. Epub 2011 Apr 20.
10
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.

引用本文的文献

1
EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information.EnDM-CPP:一种基于深度学习和机器学习的多视图可解释框架,用于使用Transformer识别细胞穿透肽并分析序列信息。
Interdiscip Sci. 2024 Dec 23. doi: 10.1007/s12539-024-00673-4.
2
DPI_CDF: druggable protein identifier using cascade deep forest.DPI_CDF:基于级联深度森林的可成药性蛋白识别方法。
BMC Bioinformatics. 2024 Apr 5;25(1):145. doi: 10.1186/s12859-024-05744-3.
3
Artificial intelligence-based HDX (AI-HDX) prediction reveals fundamental characteristics to protein dynamics: Mechanisms on SARS-CoV-2 immune escape.
基于人工智能的氢氘交换(AI-HDX)预测揭示了蛋白质动力学的基本特征:SARS-CoV-2免疫逃逸机制。
iScience. 2023 Apr 21;26(4):106282. doi: 10.1016/j.isci.2023.106282. Epub 2023 Feb 27.
4
Omics Data and Data Representations for Deep Learning-Based Predictive Modeling.基于组学数据和数据表示的深度学习预测模型
Int J Mol Sci. 2022 Oct 14;23(20):12272. doi: 10.3390/ijms232012272.
5
An analytical study on the identification of N-linked glycosylation sites using machine learning model.基于机器学习模型的N-糖基化位点识别分析研究
PeerJ Comput Sci. 2022 Sep 21;8:e1069. doi: 10.7717/peerj-cs.1069. eCollection 2022.
6
To Assist Oncologists: An Efficient Machine Learning-Based Approach for Anti-Cancer Peptides Classification.辅助肿瘤学家:一种基于机器学习的高效抗癌肽分类方法。
Sensors (Basel). 2022 May 25;22(11):4005. doi: 10.3390/s22114005.
7
Optimizing acute stroke outcome prediction models: Comparison of generalized regression neural networks and logistic regressions.优化急性脑卒中预后预测模型:广义回归神经网络与逻辑回归的比较。
PLoS One. 2022 May 11;17(5):e0267747. doi: 10.1371/journal.pone.0267747. eCollection 2022.
8
iMPT-FDNPL: Identification of Membrane Protein Types with Functional Domains and a Natural Language Processing Approach.iMPT-FDNPL:基于功能域和自然语言处理方法识别膜蛋白类型。
Comput Math Methods Med. 2021 Oct 11;2021:7681497. doi: 10.1155/2021/7681497. eCollection 2021.
9
Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning.基于序列和进化信息的膜蛋白类型的深度学习精确分类。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):700. doi: 10.1186/s12859-019-3275-6.
10
Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning.基于序列的深度学习同时提高稳定性、准确性和假阳性率的蛋白质功能注释。
Brief Bioinform. 2020 Jul 15;21(4):1437-1447. doi: 10.1093/bib/bbz081.