Suppr超能文献

为何自然界选择了A、C、G和U/T:核苷酸字母组成的错误编码视角

Why nature chose A, C, G and U/T: an error-coding perspective of nucleotide alphabet composition.

作者信息

Mac Dónaill Dónall A

机构信息

Department of Chemistry, Trinity College, Dublin 2, Ireland.

出版信息

Orig Life Evol Biosph. 2003 Oct;33(4-5):433-55. doi: 10.1023/a:1025715209867.

Abstract

The question of whether the size and make-up of the natural nucleotide alphabet is a consequence of selection pressure, or simply a frozen accident, is one of the fundamental questions of biology. Nucleotide replication is essentially an information transmission phenomenon, and so it seems reasonable to explore the issue from the perspective of theoretical computer science, and of error-coding theory in particular. In this analysis it is shown that the essential recognition features of nucleotides may be naturally expressed as 4-digit binary numbers, capturing the hydrogen acceptor/donor patterns (3-bits) and the purine/pyrimidine feature (1-bit). Optimal alphabets consist of nucleotides in which the purine/pyrimidine feature is related to the acceptor/donor pattern as a parity bit. Numerically interpreted, such alphabets correspond to parity check codes, simple but effective error-resistant structures. The natural alphabet appears to be an adaptation of one of two optimal solutions, constrained to its present size and composition by a combination of chemical and coding-theory factors.

摘要

天然核苷酸字母表的大小和组成是选择压力的结果,还是仅仅是一个固定的偶然现象,这一问题是生物学的基本问题之一。核苷酸复制本质上是一种信息传递现象,因此从理论计算机科学,特别是错误编码理论的角度来探讨这个问题似乎是合理的。在这一分析中表明,核苷酸的基本识别特征可以自然地表示为4位二进制数,捕捉氢受体/供体模式(3位)和嘌呤/嘧啶特征(1位)。最佳字母表由嘌呤/嘧啶特征与作为奇偶校验位的受体/供体模式相关的核苷酸组成。从数字上解释,这样的字母表对应于奇偶校验码,这是简单但有效的抗错结构。天然字母表似乎是两种最佳解决方案之一的适应性结果,由于化学和编码理论因素的结合而被限制在其目前的大小和组成。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验