Suppr超能文献

弦的实际频率与预期频率的比较揭示了大分子的信息容量。

Comparison of Real Frequencies of Strings vs. the Expected Ones Reveals the Information Capacity of Macromoleculae.

作者信息

Sadovsky Michael G

机构信息

Division of Russian Academy of Sciences, Institute of Biophysics of Siberian, Akademgorodok, Krasnoyarsk, 660036.

出版信息

J Biol Phys. 2003 Mar;29(1):23-38. doi: 10.1023/A:1022554613105.

Abstract

The information capacity of nucleotide sequences is defined through the calculation of specific entropy of their frequency dictionary. The specificentropy of the frequency dictionary is calculated against the reconstructeddictionary; this latter bears the most probable continuations of the shorterstrings. This developed measure allows to distinguish the sequences both from the randons ones, and from those with high level of (rather simple) order. Some implications of the developed methodology in the fields of genetics,bioinformatics, and molecular biology are discussed.

摘要

核苷酸序列的信息容量是通过计算其频率字典的比熵来定义的。频率字典的比熵是相对于重建字典计算的;后者包含较短字符串最可能的延续。这种改进的度量方法能够区分随机序列和具有高度(相当简单)有序性的序列。本文还讨论了这种改进方法在遗传学、生物信息学和分子生物学领域的一些应用。

相似文献

2
Information capacity of nucleotide sequences and its applications.
Bull Math Biol. 2006 May;68(4):785-806. doi: 10.1007/s11538-005-9017-0. Epub 2006 Apr 7.
3
Genes, information and sense: complexity and knowledge retrieval.基因、信息与意义:复杂性与知识检索
Theory Biosci. 2008 Jun;127(2):69-78. doi: 10.1007/s12064-008-0032-1. Epub 2008 Apr 29.
4
The method to compare nucleotide sequences based on the minimum entropy principle.
Bull Math Biol. 2003 Mar;65(2):309-22. doi: 10.1016/S0092-8240(02)00107-6.
6
High-recall protein entity recognition using a dictionary.使用词典进行高召回率蛋白质实体识别。
Bioinformatics. 2005 Jun;21 Suppl 1(Suppl 1):i266-73. doi: 10.1093/bioinformatics/bti1006.
10

本文引用的文献

2
Correlation property of length sequences based on global structure of the complete genome.基于完整基因组全局结构的长度序列的相关性特性。
Phys Rev E Stat Nonlin Soft Matter Phys. 2001 Jan;63(1 Pt 1):011903. doi: 10.1103/PhysRevE.63.011903. Epub 2000 Dec 20.
3
Information content of protein sequences.蛋白质序列的信息内容。
J Theor Biol. 2000 Oct 7;206(3):379-86. doi: 10.1006/jtbi.2000.2138.
4
Evolution of biological information.生物信息的进化
Nucleic Acids Res. 2000 Jul 15;28(14):2794-9. doi: 10.1093/nar/28.14.2794.
6
Zones of low entropy in genomic sequences.基因组序列中的低熵区域。
Comput Chem. 1999 Jun 15;23(3-4):275-82. doi: 10.1016/s0097-8485(99)00009-1.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验