Suppr超能文献

遗传信息中的强无逗号码

Strong Comma-Free Codes in Genetic Information.

作者信息

Fimmel Elena, Michel Christian J, Strüngmann Lutz

机构信息

Institute of Mathematical Biology, Faculty for Computer Sciences, Mannheim University of Applied Sciences, 68163, Mannheim, Germany.

Theoretical Bioinformatics, ICube, CNRS, University of Strasbourg, 300 Boulevard Sébastien Brant, 67400, Illkirch, France.

出版信息

Bull Math Biol. 2017 Aug;79(8):1796-1819. doi: 10.1007/s11538-017-0307-0. Epub 2017 Jun 22.

Abstract

Comma-free codes constitute a class of circular codes, which has been widely studied, in particular by Golomb et al. (Biologiske Meddelelser, Kongelige Danske Videnskabernes Selskab 23:1-34, 1958a, Can J Math 10:202-209, 1958b), Michel et al. (Comput Math Appl 55:989-996, 2008a, Theor Comput Sci 401:17-26, 2008b, Inf Comput 212:55-63, 2012), Michel and Pirillo (Int J Comb 2011:659567, 2011), and Fimmel and Strüngmann (J Theor Biol 389:206-213, 2016). Based on a recent approach using graph theory to study circular codes Fimmel et al. (Philos Trans R Soc 374:20150058, 2016), a new class of circular codes, called strong comma-free codes, is identified. These codes detect a frameshift during the translation process immediately after a reading window of at most two nucleotides. We describe several combinatorial properties of strong comma-free codes: enumeration, maximality, self-complementarity and [Formula: see text]-property (comma-free property in all the three possible frames). These combinatorial results also highlight some new properties of the genetic code and its evolution. Each amino acid in the standard genetic code is coded by at least one strong comma-free code of size 1. There are 9 amino acids [Formula: see text] among 20 such that for each amino acid from S, its synonymous trinucleotide set (excluding the necessary periodic trinucleotides [Formula: see text]) is a strong comma-free code. The primeval comma-free RNY code of Eigen and Schuster (Naturwissenschaften 65:341-369, 1978) is a self-complementary [Formula: see text]-code of size 16. Furthermore, it is the union of two strong comma-free codes of size 8 which are complementary to each other.

摘要

无逗号码构成了一类循环码,对此已有广泛研究,特别是戈洛姆等人(《丹麦皇家科学院生物学通报》23:1 - 34,1958a;《加拿大数学杂志》10:202 - 209,1958b)、米歇尔等人(《计算数学及其应用》55:989 - 996,2008a;《理论计算机科学》401:17 - 26,2008b;《信息与计算》212:55 - 63,2012)、米歇尔和皮里洛(《国际组合学杂志》2011:659567,2011)以及菲默尔和施特伦曼(《理论生物学杂志》389:206 - 213,2016)的研究。基于菲默尔等人(《皇家学会哲学学报》374:20150058,2016)近期使用图论研究循环码的方法,确定了一类新的循环码,称为强无逗号码。这些码在翻译过程中,在最多两个核苷酸的阅读窗口之后能立即检测到移码。我们描述了强无逗号码的几个组合性质:枚举、极大性、自互补性和[公式:见原文]性质(在所有三种可能的阅读框中的无逗号性质)。这些组合结果也突出了遗传密码及其进化的一些新性质。标准遗传密码中的每个氨基酸至少由一个大小为1的强无逗号码编码。在20种氨基酸中有9种[公式:见原文],对于来自S的每种氨基酸,其同义三核苷酸集(不包括必要的周期性三核苷酸[公式:见原文])是一个强无逗号码。艾根和施特泽尔(《自然科学》65:341 - 369,1978)的原始无逗号RNY码是一个大小为16的自互补[公式:见原文]码。此外,它是两个大小为8且相互互补的强无逗号码的并集。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验