Suppr超能文献

人类和小鼠基因组编码区同义密码子使用偏好的香农信息理论计算

Shannon information theoretic computation of synonymous codon usage biases in coding regions of human and mouse genomes.

作者信息

Zeeberg Barry

机构信息

Laboratory of Molecular Pharmacology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

出版信息

Genome Res. 2002 Jun;12(6):944-55. doi: 10.1101/gr.213402.

Abstract

Exonic GC of human mRNA reference sequences (RefSeqs), as well as A, C, G, and T in codon position 3 are linearly correlated with genomic GC. These observations utilize information from the completed human genome sequence and a large, high-quality set of human and mouse coding sequences, and are in accord with similar determinations published by others. A Shannon Information Theoretic measure of bias in synonymous codon usage was developed. When applied to either human or mouse RefSeqs, this measure is nonlinearly correlated with genomic, exonic, and third codon position A, C, G, and T. Information values between orthologous mouse and human RefSeqs are linearly correlated: mouse = 0.092 + 0.55 human. Mouse genes were consistently placed in genomic regions whose GC content was closer to 50% than was the GC content of the human ortholog. Since the (nonlinear) information versus percent GC curve has a minimum at 50% GC and monotonically increases with increasing distance from 50% GC, this phenomenon directly results in the low slope of 0.55. This appears to be a manifestation of an evolutionary strategy for placement of genes in regions of the genome with a GC content that relates synonymous codon bias and protein folding.

摘要

人类mRNA参考序列(RefSeqs)的外显子GC以及密码子第3位的A、C、G和T与基因组GC呈线性相关。这些观察结果利用了来自已完成的人类基因组序列以及大量高质量的人类和小鼠编码序列的信息,并且与其他人发表的类似测定结果一致。开发了一种关于同义密码子使用偏差的香农信息理论度量。当应用于人类或小鼠RefSeqs时,该度量与基因组、外显子和第三密码子位置的A、C、G和T呈非线性相关。直系同源小鼠和人类RefSeqs之间的信息值呈线性相关:小鼠 = 0.092 + 0.55×人类。小鼠基因始终位于基因组区域,其GC含量比人类直系同源基因的GC含量更接近50%。由于(非线性的)信息与GC百分比曲线在GC含量为50%时具有最小值,并且随着与50% GC距离的增加而单调增加,这种现象直接导致了0.55的低斜率。这似乎是一种进化策略的表现,即将基因放置在基因组中GC含量与同义密码子偏好和蛋白质折叠相关的区域。

相似文献

3
CodonO: codon usage bias analysis within and across genomes.CodonO:基因组内及跨基因组的密码子使用偏好性分析。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W132-6. doi: 10.1093/nar/gkm392. Epub 2007 May 30.
9
Universality and Shannon entropy of codon usage.密码子使用的通用性和香农熵
Phys Rev E Stat Nonlin Soft Matter Phys. 2003 Dec;68(6 Pt 1):061910. doi: 10.1103/PhysRevE.68.061910. Epub 2003 Dec 24.

引用本文的文献

2
Information theoretic perspective on genome clustering.基因组聚类的信息论视角
Saudi J Biol Sci. 2021 Mar;28(3):1867-1889. doi: 10.1016/j.sjbs.2020.12.039. Epub 2020 Dec 31.

本文引用的文献

6
The compositional evolution of vertebrate genomes.脊椎动物基因组的组成演变。
Gene. 2000 Dec 23;259(1-2):31-43. doi: 10.1016/s0378-1119(00)00441-8.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验