Michel Christian J
Theoretical Bioinformatics, ICube, CNRS, University of Strasbourg, 300 Boulevard Sébastien Brant, 67400 Illkirch, France.
Biosystems. 2020 Nov;197:104201. doi: 10.1016/j.biosystems.2020.104201. Epub 2020 Jul 2.
The maximality of circular codes in genes has 20 preferential trinucleotides in each frame. This combinatorial property is statistically verified in the genes of both bacteria and eukaryotes, and by two approaches computing the trinucleotide occurrence frequencies in the 3 frames at the gene population level (classical method) and at the gene level (recent method). Several remarks explain why the codon usage parameter is unable to identify the circular codes. Some historical and theoretical considerations on comma-free and circular codes are presented. An evolutionary process by trinucleotide permutation is proposed to describe the transformation of a circular code (and its motifs) into another circular code.
基因中循环码的极大性在每个阅读框中有20个优先三核苷酸。这种组合特性在细菌和真核生物的基因中均通过两种方法在统计学上得到验证,这两种方法分别是在基因群体水平(经典方法)和基因水平(最新方法)计算三个阅读框中的三核苷酸出现频率。一些注释解释了为什么密码子使用参数无法识别循环码。文中还介绍了关于无逗号码和循环码的一些历史和理论思考。提出了一个通过三核苷酸置换的进化过程来描述一种循环码(及其基序)向另一种循环码的转变。