• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用伪氨基酸组成预测蛋白质中的跨膜区域:细胞自动机与莱姆-齐夫复杂度

Using pseudo amino acid composition to predict transmembrane regions in protein: cellular automata and Lempel-Ziv complexity.

作者信息

Diao Y, Ma D, Wen Z, Yin J, Xiang J, Li M

机构信息

College of Chemistry, Sichuan University, Chengdu, Sichuan, China.

出版信息

Amino Acids. 2008 Jan;34(1):111-7. doi: 10.1007/s00726-007-0550-z. Epub 2007 May 23.

DOI:10.1007/s00726-007-0550-z
PMID:17520325
Abstract

Transmembrane (TM) proteins represent about 20-30% of the protein sequences in higher eukaryotes, playing important roles across a range of cellular functions. Moreover, knowledge about topology of these proteins often provides crucial hints toward their function. Due to the difficulties in experimental structure determinations of TM protein, theoretical prediction methods are highly preferred in identifying the topology of newly found ones according to their primary sequences, useful in both basic research and drug discovery. In this paper, based on the concept of pseudo amino acid composition (PseAA) that can incorporate sequence-order information of a protein sequence so as to remarkably enhance the power of discrete models (Chou, K. C., Proteins: Structure, Function, and Genetics, 2001, 43: 246-255), cellular automata and Lempel-Ziv complexity are introduced to predict the TM regions of integral membrane proteins including both alpha-helical and beta-barrel membrane proteins, validated by jackknife test. The result thus obtained is quite promising, which indicates that the current approach might be a quite potential high throughput tool in the post-genomic era. The source code and dataset are available for academic users at liml@scu.edu.cn.

摘要

跨膜(TM)蛋白约占高等真核生物蛋白质序列的20%-30%,在一系列细胞功能中发挥着重要作用。此外,关于这些蛋白质拓扑结构的知识通常为其功能提供关键线索。由于跨膜蛋白实验结构测定存在困难,在根据新发现的跨膜蛋白的一级序列确定其拓扑结构时,理论预测方法备受青睐,这在基础研究和药物发现中都很有用。本文基于伪氨基酸组成(PseAA)的概念,该概念可以纳入蛋白质序列的序列顺序信息,从而显著增强离散模型的能力(Chou,K.C.,《蛋白质:结构、功能与遗传学》,2001年,43:246-255),引入细胞自动机和莱姆尔-齐夫复杂度来预测包括α-螺旋和β-桶状膜蛋白在内的整合膜蛋白的跨膜区域,并通过留一法检验进行验证。由此获得的结果很有前景,这表明当前方法可能是后基因组时代一种颇具潜力的高通量工具。学术用户可通过liml@scu.edu.cn获取源代码和数据集。

相似文献

1
Using pseudo amino acid composition to predict transmembrane regions in protein: cellular automata and Lempel-Ziv complexity.利用伪氨基酸组成预测蛋白质中的跨膜区域:细胞自动机与莱姆-齐夫复杂度
Amino Acids. 2008 Jan;34(1):111-7. doi: 10.1007/s00726-007-0550-z. Epub 2007 May 23.
2
Using optimized evidence-theoretic K-nearest neighbor classifier and pseudo-amino acid composition to predict membrane protein types.使用优化的证据理论K近邻分类器和伪氨基酸组成来预测膜蛋白类型。
Biochem Biophys Res Commun. 2005 Aug 19;334(1):288-92. doi: 10.1016/j.bbrc.2005.06.087.
3
Low-frequency Fourier spectrum for predicting membrane protein types.用于预测膜蛋白类型的低频傅里叶频谱。
Biochem Biophys Res Commun. 2005 Oct 28;336(3):737-9. doi: 10.1016/j.bbrc.2005.08.160.
4
Using pseudo amino acid composition to predict protein subnuclear location with improved hybrid approach.使用伪氨基酸组成通过改进的混合方法预测蛋白质亚核定位。
Amino Acids. 2008 Jan;34(1):119-25. doi: 10.1007/s00726-007-0545-9. Epub 2007 May 21.
5
Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes.利用伪氨基酸组成和二叉树支持向量机预测蛋白质结构类别。
Amino Acids. 2007 Nov;33(4):623-9. doi: 10.1007/s00726-007-0496-1. Epub 2007 Feb 19.
6
Using complexity measure factor to predict protein subcellular location.利用复杂性度量因子预测蛋白质亚细胞定位。
Amino Acids. 2005 Feb;28(1):57-61. doi: 10.1007/s00726-004-0148-7. Epub 2004 Dec 22.
7
Predicting membrane protein type by functional domain composition and pseudo-amino acid composition.通过功能域组成和伪氨基酸组成预测膜蛋白类型。
J Theor Biol. 2006 Jan 21;238(2):395-400. doi: 10.1016/j.jtbi.2005.05.035. Epub 2005 Jul 25.
8
Enhanced membrane protein topology prediction using a hierarchical classification method and a new scoring function.使用分层分类方法和新评分函数增强膜蛋白拓扑结构预测
J Proteome Res. 2008 Feb;7(2):487-96. doi: 10.1021/pr0702058.
9
Using pseudo amino acid composition to predict protein structural classes: approached with complexity measure factor.利用伪氨基酸组成预测蛋白质结构类别:基于复杂度度量因子的方法
J Comput Chem. 2006 Mar;27(4):478-82. doi: 10.1002/jcc.20354.
10
Predicting protein subnuclear location with optimized evidence-theoretic K-nearest classifier and pseudo amino acid composition.使用优化的证据理论K近邻分类器和伪氨基酸组成预测蛋白质亚核定位
Biochem Biophys Res Commun. 2005 Nov 25;337(3):752-6. doi: 10.1016/j.bbrc.2005.09.117. Epub 2005 Sep 28.

引用本文的文献

1
New distance measure for comparing protein using cellular automata image.使用细胞自动机图像比较蛋白质的新距离度量。
PLoS One. 2023 Oct 5;18(10):e0287880. doi: 10.1371/journal.pone.0287880. eCollection 2023.
2
Roles of Physicochemical and Structural Properties of RNA-Binding Proteins in Predicting the Activities of Trans-Acting Splicing Factors with Machine Learning.基于物理化学和结构特性的 RNA 结合蛋白在机器学习预测反式剪接因子活性中的作用。
Int J Mol Sci. 2022 Apr 17;23(8):4426. doi: 10.3390/ijms23084426.
3
Computational Modeling of Proteins based on Cellular Automata: A Method of HP Folding Approximation.
基于元胞自动机的蛋白质计算建模:HP 折叠逼近方法。
Protein J. 2018 Jun;37(3):248-260. doi: 10.1007/s10930-018-9771-0.
4
Some remarks on protein attribute prediction and pseudo amino acid composition.关于蛋白质属性预测和伪氨基酸组成的一些说明。
J Theor Biol. 2011 Mar 21;273(1):236-47. doi: 10.1016/j.jtbi.2010.12.024. Epub 2010 Dec 17.
5
Predicting drug-target interaction networks based on functional groups and biological features.基于功能基团和生物特征预测药物-靶标相互作用网络。
PLoS One. 2010 Mar 11;5(3):e9603. doi: 10.1371/journal.pone.0009603.
6
Bridging protein local structures and protein functions.连接蛋白质局部结构与蛋白质功能。
Amino Acids. 2008 Oct;35(3):627-50. doi: 10.1007/s00726-008-0088-8. Epub 2008 Apr 18.
7
Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences.利用支持向量机结合自协方差从蛋白质序列预测蛋白质-蛋白质相互作用。
Nucleic Acids Res. 2008 May;36(9):3025-30. doi: 10.1093/nar/gkn159. Epub 2008 Apr 4.