• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于哈夫曼树方法的蛋白质序列二维图形表示的应用。

Application of 2D graphic representation of protein sequence based on Huffman tree method.

机构信息

College of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang, Hebei, People's Republic of China.

出版信息

Comput Biol Med. 2012 May;42(5):556-63. doi: 10.1016/j.compbiomed.2012.01.011. Epub 2012 Feb 10.

DOI:10.1016/j.compbiomed.2012.01.011
PMID:22325072
Abstract

Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0-1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0-1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes.

摘要

基于哈夫曼树方法,我们提出了一种新的蛋白质序列二维图形表示方法。这种表示方法可以完全避免在将蛋白质序列转换为图形表示时信息的丢失。该方法包括两部分。一部分是根据氨基酸频率的哈夫曼树对 20 种氨基酸进行 0-1 编码。氨基酸频率定义为分析的蛋白质序列中某一氨基酸的统计数。另一部分是基于 0-1 编码的蛋白质序列的二维图形表示。然后详细介绍了该方法在 10 个 ND5 基因和 7 个大肠杆菌菌株上的应用。结果表明,所提出的模型可能为我们提供一些新的视角来理解由蛋白质序列和完整基因组决定的进化模式。

相似文献

1
Application of 2D graphic representation of protein sequence based on Huffman tree method.基于哈夫曼树方法的蛋白质序列二维图形表示的应用。
Comput Biol Med. 2012 May;42(5):556-63. doi: 10.1016/j.compbiomed.2012.01.011. Epub 2012 Feb 10.
2
2D-MH: A web-server for generating graphic representation of protein sequences based on the physicochemical properties of their constituent amino acids.2D-MH:一个基于组成蛋白质的氨基酸理化性质生成蛋白质序列图形表示的网络服务器。
J Theor Biol. 2010 Nov 7;267(1):29-34. doi: 10.1016/j.jtbi.2010.08.007. Epub 2010 Aug 7.
3
A novel graphical representation of protein sequences and its application.一种新颖的蛋白质序列图形表示及其应用。
J Comput Chem. 2011 Sep;32(12):2539-44. doi: 10.1002/jcc.21833. Epub 2011 Jun 2.
4
Using Huffman coding method to visualize and analyze DNA sequences.使用哈夫曼编码方法对 DNA 序列进行可视化和分析。
J Comput Chem. 2011 Nov 30;32(15):3233-40. doi: 10.1002/jcc.21906. Epub 2011 Aug 26.
5
Similarity/dissimilarity studies of protein sequences based on a new 2D graphical representation.基于新的 2D 图形表示的蛋白质序列相似性/差异性研究。
J Comput Chem. 2010 Apr 15;31(5):1045-52. doi: 10.1002/jcc.21391.
6
S curve, a graphic representation of protein secondary structure sequence and its applications.S曲线,一种蛋白质二级结构序列的图形表示及其应用。
Biopolymers. 2000 Jun;53(7):539-49. doi: 10.1002/(SICI)1097-0282(200006)53:7<539::AID-BIP2>3.0.CO;2-2.
7
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
8
2D representation of protein secondary structure sequences and its applications.蛋白质二级结构序列的二维表示及其应用。
J Comput Chem. 2006 Aug;27(11):1119-24. doi: 10.1002/jcc.20430.
9
Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein.基于蛋白质的氨基酸和二肽组成对基因表达水平进行相关性分析与预测。
BMC Bioinformatics. 2005 Mar 17;6:59. doi: 10.1186/1471-2105-6-59.
10
Potential implications of availability of short amino acid sequences in proteins: an old and new approach to protein decoding and design.蛋白质中短氨基酸序列可用性的潜在影响:蛋白质解码与设计的新旧方法
Biotechnol Annu Rev. 2008;14:109-41. doi: 10.1016/S1387-2656(08)00004-5.

引用本文的文献

1
ADLD: a novel graphical representation of protein sequences and its application.ADLD:一种蛋白质序列的新型图形表示及其应用
Comput Math Methods Med. 2014;2014:959753. doi: 10.1155/2014/959753. Epub 2014 Oct 30.
2
3D representations of amino acids-applications to protein sequence comparison and classification.氨基酸的 3D 表示——在蛋白质序列比较和分类中的应用。
Comput Struct Biotechnol J. 2014 Sep 6;11(18):47-58. doi: 10.1016/j.csbj.2014.09.001. eCollection 2014 Aug.
3
Similarity/Dissimilarity analysis of protein sequences based on a new spectrum-like graphical representation.
基于新的类光谱图形表示的蛋白质序列相似性/差异性分析。
Evol Bioinform Online. 2014 Jun 12;10:87-96. doi: 10.4137/EBO.S14713. eCollection 2014.