MemType-2L：一个通过伪位置特异性得分矩阵整合进化信息来预测膜蛋白及其类型的网络服务器。

MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.

作者信息

Chou Kuo-Chen, Shen Hong-Bin

机构信息

Gordon Life Science Institute, San Diego, CA 92130, USA.

出版信息

Biochem Biophys Res Commun. 2007 Aug 24;360(2):339-45. doi: 10.1016/j.bbrc.2007.06.027. Epub 2007 Jun 15.

DOI:10.1016/j.bbrc.2007.06.027

PMID:17586467

Abstract

Given an uncharacterized protein sequence, how can we identify whether it is a membrane protein or not? If it is, which membrane protein type it belongs to? These questions are important because they are closely relevant to the biological function of the query protein and to its interaction process with other molecules in a biological system. Particularly, with the avalanche of protein sequences generated in the Post-Genomic Age and the relatively much slower progress in using biochemical experiments to determine their functions, it is highly desired to develop an automated method that can be used to help address these questions. In this study, a 2-layer predictor, called MemType-2L, has been developed: the 1st layer prediction engine is to identify a query protein as membrane or non-membrane; if it is a membrane protein, the process will be automatically continued with the 2nd-layer prediction engine to further identify its type among the following eight categories: (1) type I, (2) type II, (3) type III, (4) type IV, (5) multipass, (6) lipid-chain-anchored, (7) GPI-anchored, and (8) peripheral. MemType-2L is featured by incorporating the evolution information through representing the protein samples with the Pse-PSSM (Pseudo Position-Specific Score Matrix) vectors, and by containing an ensemble classifier formed by fusing many powerful individual OET-KNN (Optimized Evidence-Theoretic K-Nearest Neighbor) classifiers. The success rates obtained by MemType-2L on a new-constructed stringent dataset by both the jackknife test and the independent dataset test are quite high, indicating that MemType-2L may become a very useful high throughput tool. As a Web server, MemType-2L is freely accessible to the public at http://chou.med.harvard.edu/bioinf/MemType.

摘要

对于一个未表征的蛋白质序列，我们如何确定它是否为膜蛋白呢？如果是，它属于哪种膜蛋白类型呢？这些问题很重要，因为它们与查询蛋白的生物学功能以及它在生物系统中与其他分子的相互作用过程密切相关。特别是在后基因组时代产生了大量的蛋白质序列，而利用生化实验来确定其功能的进展相对缓慢得多，因此迫切需要开发一种自动化方法来帮助解决这些问题。在本研究中，开发了一种名为MemType-2L的两层预测器：第一层预测引擎用于将查询蛋白识别为膜蛋白或非膜蛋白；如果是膜蛋白，该过程将自动进入第二层预测引擎，以在以下八种类别中进一步确定其类型：(1) I型，(2) II型，(3) III型，(4) IV型，(5) 多次跨膜型，(6) 脂链锚定型，(7) GPI锚定型，以及(8) 外周型。MemType-2L的特点是通过用伪位置特异性得分矩阵（Pse-PSSM）向量表示蛋白质样本纳入进化信息，并包含一个由融合许多强大的个体优化证据理论K近邻（OET-KNN）分类器形成的集成分类器。通过留一法测试和独立数据集测试，MemType-2L在新构建的严格数据集上获得的成功率相当高，这表明MemType-2L可能成为一个非常有用的高通量工具。作为一个网络服务器，公众可以通过http://chou.med.harvard.edu/bioinf/MemType免费访问MemType-2L。

相似文献

MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.

Biochem Biophys Res Commun. 2007 Aug 24;360(2):339-45. doi: 10.1016/j.bbrc.2007.06.027. Epub 2007 Jun 15.

Signal-3L: A 3-layer approach for predicting signal peptides.

Biochem Biophys Res Commun. 2007 Nov 16;363(2):297-303. doi: 10.1016/j.bbrc.2007.08.140. Epub 2007 Aug 31.

Using optimized evidence-theoretic K-nearest neighbor classifier and pseudo-amino acid composition to predict membrane protein types.

Biochem Biophys Res Commun. 2005 Aug 19;334(1):288-92. doi: 10.1016/j.bbrc.2005.06.087.

Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization.

Biochem Biophys Res Commun. 2006 Aug 18;347(1):150-7. doi: 10.1016/j.bbrc.2006.06.059. Epub 2006 Jun 21.

Predicting membrane protein type by functional domain composition and pseudo-amino acid composition.

J Theor Biol. 2006 Jan 21;238(2):395-400. doi: 10.1016/j.jtbi.2005.05.035. Epub 2005 Jul 25.

EzyPred: a top-down approach for predicting enzyme functional classes and subclasses.

Biochem Biophys Res Commun. 2007 Dec 7;364(1):53-9. doi: 10.1016/j.bbrc.2007.09.098. Epub 2007 Oct 2.

Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-Nearest Neighbor classifiers.

J Proteome Res. 2006 Aug;5(8):1888-97. doi: 10.1021/pr060167c.

GPCR-2L: predicting G protein-coupled receptors and their types by hybridizing two different modes of pseudo amino acid compositions.

Mol Biosyst. 2011 Mar;7(3):911-9. doi: 10.1039/c0mb00170h. Epub 2010 Dec 23.

Signal-CF: a subsite-coupled and window-fusing approach for predicting signal peptides.

Biochem Biophys Res Commun. 2007 Jun 8;357(3):633-40. doi: 10.1016/j.bbrc.2007.03.162. Epub 2007 Apr 5.

Predicting protein subnuclear location with optimized evidence-theoretic K-nearest classifier and pseudo amino acid composition.

Biochem Biophys Res Commun. 2005 Nov 25;337(3):752-6. doi: 10.1016/j.bbrc.2005.09.117. Epub 2005 Sep 28.

引用本文的文献

Enhancing the Feature Representation of Protein Sequence Descriptors in Protein-Protein Interaction Prediction.

Interdiscip Sci. 2025 Jun 2. doi: 10.1007/s12539-025-00723-5.

Engineered Proteins and Chemical Tools to Probe the Cell Surface Proteome.

Chem Rev. 2025 Apr 23;125(8):4069-4110. doi: 10.1021/acs.chemrev.4c00554. Epub 2025 Apr 3.

PLMC: Language Model of Protein Sequences Enhances Protein Crystallization Prediction.

Interdiscip Sci. 2024 Dec;16(4):802-813. doi: 10.1007/s12539-024-00639-6. Epub 2024 Aug 19.

Hybrid framework for membrane protein type prediction based on the PSSM.

Sci Rep. 2024 Jul 26;14(1):17156. doi: 10.1038/s41598-024-68163-7.

A deep learning method to predict bacterial ADP-ribosyltransferase toxins.

Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae378.

Fusion of multi-source relationships and topology to infer lncRNA-protein interactions.

Mol Ther Nucleic Acids. 2024 Apr 6;35(2):102187. doi: 10.1016/j.omtn.2024.102187. eCollection 2024 Jun 11.

DeepLoc 2.1: multi-label membrane protein type prediction using protein language models.

Nucleic Acids Res. 2024 Jul 5;52(W1):W215-W220. doi: 10.1093/nar/gkae237.

An Assessment of the Penile Squamous Cell Carcinoma Surfaceome for Biomarker and Therapeutic Target Discovery.

Cancers (Basel). 2023 Jul 15;15(14):3636. doi: 10.3390/cancers15143636.

Machine learning in computational modelling of membrane protein sequences and structures: From methodologies to applications.

Comput Struct Biotechnol J. 2023 Jan 28;21:1205-1226. doi: 10.1016/j.csbj.2023.01.036. eCollection 2023.

: Membrane and Non-Membrane Protein Structure, Function, Immune Response Interaction, and Vaccine Development.

Membranes (Basel). 2022 Oct 31;12(11):1079. doi: 10.3390/membranes12111079.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MemType-2L：一个通过伪位置特异性得分矩阵整合进化信息来预测膜蛋白及其类型的网络服务器。

MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献