Suppr超能文献

转移 RNA 和 16S 核糖体 RNA 中的环状码基序:基因中可能的翻译密码子。

Circular code motifs in transfer and 16S ribosomal RNAs: a possible translation code in genes.

机构信息

Equipe de Bioinformatique Théorique, BFO, LSIIT (UMR 7005), Université de Strasbourg, Pôle API, Boulevard Sébastien Brant, 67400 Illkirch, France.

出版信息

Comput Biol Chem. 2012 Apr;37:24-37. doi: 10.1016/j.compbiolchem.2011.10.002. Epub 2011 Oct 11.

Abstract

In 1996, a common trinucleotide circular code, called X, is identified in genes of eukaryotes and prokaryotes (Arquès and Michel, 1996). This circular code X is a set of 20 trinucleotides allowing the reading frames in genes to be retrieved locally, i.e. anywhere in genes and in particular without start codons. This reading frame retrieval needs a window length l of 12 nucleotides (l ≥ 12). With a window length strictly less than 12 nucleotides (l < 12), some words of X, called ambiguous words, are found in the shifted frames (the reading frame shifted by one or two nucleotides) preventing the reading frame in genes to be retrieved. Since 1996, these ambiguous words of X were never studied. In the first part of this paper, we identify all the ambiguous words of the common trinucleotide circular code X. With a length l varying from 1 to 11 nucleotides, the type and the occurrence number (multiplicity) of ambiguous words of X are given in each shifted frame. Maximal ambiguous words of X, words which are not factors of another ambiguous words, are also determined. Two probability definitions based on these results show that the common trinucleotide circular code X retrieves the reading frame in genes with a probability of about 90% with a window length of 6 nucleotides, and a probability of 99.9% with a window length of 9 nucleotides (100% with a window length of 12 nucleotides, by definition of a circular code). In the second part of this paper, we identify X circular code motifs (shortly X motifs) in transfer RNA and 16S ribosomal RNA: a tRNA X motif of 26 nucleotides including the anticodon stem-loop and seven 16S rRNA X motifs of length greater or equal to 15 nucleotides. Window lengths of reading frame retrieval with each trinucleotide of these X motifs are also determined. Thanks to the crystal structure 3I8G (Jenner et al., 2010), a 3D visualization of X motifs in the ribosome shows several spatial configurations involving mRNA X motifs, A-tRNA and E-tRNA X motifs, and four 16S rRNA X motifs. Another identified 16S rRNA X motif is involved in the decoding center which recognizes the codon-anticodon helix in A-tRNA. From a code theory point of view, these identified X circular code motifs and their mathematical properties may constitute a translation code involved in retrieval, maintenance and synchronization of reading frames in genes.

摘要

1996 年,在真核生物和原核生物的基因中发现了一种常见的三核苷酸环状密码子 X(Arquès 和 Michel,1996)。这个环状密码子 X 是一组 20 个三核苷酸,允许在基因中局部检索阅读框,即基因中的任何位置,特别是无需起始密码子。这种阅读框检索需要窗口长度 l 为 12 个核苷酸(l≥12)。窗口长度严格小于 12 个核苷酸(l<12)时,在移位框架中会发现 X 的一些称为模糊词的单词,从而阻止在基因中检索阅读框。自 1996 年以来,这些 X 的模糊词从未被研究过。在本文的第一部分,我们确定了常见三核苷酸环状密码子 X 的所有模糊词。对于长度从 1 到 11 个核苷酸的 l,在每个移位框架中给出了 X 的模糊词的类型和出现次数(多重性)。还确定了 X 的最大模糊词,即不是另一个模糊词的因子的词。基于这些结果的两个概率定义表明,使用 6 个核苷酸的窗口长度,三核苷酸环状密码子 X 检索基因阅读框的概率约为 90%,使用 9 个核苷酸的窗口长度,概率为 99.9%(使用 12 个核苷酸的窗口长度,定义为环状密码子为 100%)。在本文的第二部分,我们确定了转移 RNA 和 16S 核糖体 RNA 中的 X 环状密码子模体(简称 X 模体):包含反密码子茎环的 26 个核苷酸的 tRNA X 模体和长度大于或等于 15 个核苷酸的七个 16S rRNA X 模体。还确定了这些 X 模体中每个三核苷酸的阅读框检索窗口长度。借助 3I8G(Jenner 等人,2010)的晶体结构,核糖体中 X 模体的 3D 可视化显示了涉及 mRNA X 模体、A-tRNA 和 E-tRNA X 模体以及四个 16S rRNA X 模体的几个空间构型。另一个鉴定的 16S rRNA X 模体涉及解码中心,该中心识别 A-tRNA 中的密码子-反密码子螺旋。从编码理论的角度来看,这些已识别的 X 环状密码子模体及其数学特性可能构成涉及在基因中检索、维护和同步阅读框的翻译码。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验