• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Differences in dinucleotide frequencies of human, yeast, and Escherichia coli genes.

作者信息

Nakashima H, Nishikawa K, Ooi T

机构信息

School of Health Sciences, Faculty of Medicine, Kanazawa University, Japan.

出版信息

DNA Res. 1997 Jun 30;4(3):185-92. doi: 10.1093/dnares/4.3.185.

DOI:10.1093/dnares/4.3.185
PMID:9330906
Abstract

Nucleotide sequences coding proteins in human, yeast and Escherichia coli genes were analyzed in terms of dinucleotide occurrences. Every gene is plotted as a point in the dinucleotide space, which is spanned by 16 axes corresponding to the 16 components of the dinucleotide. The metric unit in the space is defined using the log-odds ratio of dinucleotide occurrences in a gene. The distribution of points showed that genes from the same organism are clustered in the space. The clusters of human and E. coli are completely separated, and the yeast cluster sits between, implying that individual genes are classified into the three sources from their location. In fact, they could be identified with accuracy of 90%, using the DNA data alone. Even genes encoding homologous proteins belonging to the same protein superfamily were discriminated by the DNA data, and were correctly identified into their sources with the same accuracy as above. DNA sequences of non-coding regions, including human introns, as well as human genes of GC-rich and GC-poor types, were also analyzed in the same manner. The most significant finding is that human genomic DNA sequences, including genes and introns together, exhibit the largest deviation of dinucleotide occurrence from the random expectation. Possible origins for this phenomenon are discussed.

摘要

相似文献

1
Differences in dinucleotide frequencies of human, yeast, and Escherichia coli genes.
DNA Res. 1997 Jun 30;4(3):185-92. doi: 10.1093/dnares/4.3.185.
2
Genes from nine genomes are separated into their organisms in the dinucleotide composition space.九个基因组中的基因在二核苷酸组成空间中按其所属生物体进行分离。
DNA Res. 1998 Oct 30;5(5):251-9. doi: 10.1093/dnares/5.5.251.
3
[Site of integration of IS1-element responsible for expression of the yeast ADE1 gene in Escherichia coli].[负责酵母ADE1基因在大肠杆菌中表达的IS1元件整合位点]
Genetika. 1987 Sep;23(9):1702-4.
4
Escherichia coli thymidylate kinase: molecular cloning, nucleotide sequence, and genetic organization of the corresponding tmk locus.大肠杆菌胸苷酸激酶:相应tmk基因座的分子克隆、核苷酸序列及基因组织
J Bacteriol. 1996 May;178(10):2804-12. doi: 10.1128/jb.178.10.2804-2812.1996.
5
Comparison of the small molecule metabolic enzymes of Escherichia coli and Saccharomyces cerevisiae.大肠杆菌和酿酒酵母小分子代谢酶的比较
Genome Res. 2002 Jun;12(6):916-29. doi: 10.1101/gr.228002.
6
Correlation between the flexibility and periodic dinucleotide patterns in yeast nucleosomal DNA sequences.酵母核小体 DNA 序列中柔性与周期性二核苷酸模式的相关性。
J Theor Biol. 2011 Sep 7;284(1):92-8. doi: 10.1016/j.jtbi.2011.06.026. Epub 2011 Jun 25.
7
A family of genes encode the multiple forms of the Saccharomyces cerevisiae ribosomal proteins equivalent to the Escherichia coli L12 protein and a single form of the L10-equivalent ribosomal protein.
J Bacteriol. 1990 Feb;172(2):579-88. doi: 10.1128/jb.172.2.579-588.1990.
8
(CA/GT)(n) microsatellites affect homologous recombination during yeast meiosis.(CA/GT)(n)微卫星在酵母减数分裂过程中影响同源重组。
Genes Dev. 2000 May 15;14(10):1261-8.
9
Suppression of Escherichia coli alkB mutants by Saccharomyces cerevisiae genes.酿酒酵母基因对大肠杆菌alkB突变体的抑制作用
J Bacteriol. 1995 Sep;177(17):5009-15. doi: 10.1128/jb.177.17.5009-5015.1995.
10
Widespread selection for local RNA secondary structure in coding regions of bacterial genes.细菌基因编码区中局部RNA二级结构的广泛选择。
Genome Res. 2003 Sep;13(9):2042-51. doi: 10.1101/gr.1257503.

引用本文的文献

1
DNA methylation-based high-resolution mapping of long-distance chromosomal interactions in nucleosome-depleted regions.基于 DNA 甲基化的核小体缺失区域中长距离染色体相互作用的高分辨率图谱绘制。
Nat Commun. 2024 May 22;15(1):4358. doi: 10.1038/s41467-024-48718-y.
2
A Synthetic Biology Approach for Vaccine Candidate Design against Delta Strain of SARS-CoV-2 Revealed Disruption of Favored Codon Pair as a Better Strategy over Using Rare Codons.一种针对新冠病毒Delta毒株的候选疫苗设计的合成生物学方法表明,破坏偏好密码子对是比使用稀有密码子更好的策略。
Vaccines (Basel). 2023 Feb 20;11(2):487. doi: 10.3390/vaccines11020487.
3
Comparative analysis of machine learning algorithms on the microbial strain-specific AMP prediction.
机器学习算法在微生物菌株特异性 AMP 预测上的比较分析。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac233.
4
An overview of dinucleotide and codon usage in all viruses.所有病毒中二核苷酸和密码子使用情况概述。
Arch Virol. 2022 Jun;167(6):1443-1448. doi: 10.1007/s00705-022-05454-2. Epub 2022 Apr 25.
5
Folate Deficiency Triggers the Abnormal Segregation of a Region With Large Cluster of CG-Rich Trinucleotide Repeats on Human Chromosome 2.叶酸缺乏引发人类2号染色体上富含CG的三核苷酸重复序列大簇区域的异常分离。
Front Genet. 2021 Jul 1;12:695124. doi: 10.3389/fgene.2021.695124. eCollection 2021.
6
In silico identification of pseudo-exon activation events in personal genome and transcriptome data.在个人基因组和转录组数据中假外显子激活事件的计算鉴定。
RNA Biol. 2021 Mar;18(3):382-390. doi: 10.1080/15476286.2020.1809195. Epub 2020 Aug 30.
7
Variable-order sequence modeling improves bacterial strain discrimination for Ion Torrent DNA reads.可变阶序列建模可提高对Ion Torrent DNA读数的细菌菌株鉴别能力。
BMC Bioinformatics. 2017 Jun 12;18(1):299. doi: 10.1186/s12859-017-1710-0.
8
Oligonucleotide frequencies of barcoding loci can discriminate species across kingdoms.寡核苷酸条形码基因座的频率可以区分不同生物界的物种。
PLoS One. 2010 Aug 20;5(8):e12330. doi: 10.1371/journal.pone.0012330.
9
Detecting horizontally transferred and essential genes based on dinucleotide relative abundance.基于二核苷酸相对丰度检测水平转移基因和必需基因。
DNA Res. 2008 Oct;15(5):267-76. doi: 10.1093/dnares/dsn021. Epub 2008 Sep 16.
10
Circuit assemblages derived from net dinucleotide values provide a succinct identity for the HIV-1 genome and each of its genes.源自净二核苷酸值的电路组合为HIV-1基因组及其每个基因提供了简洁的标识。
Virus Genes. 2008 Feb;36(1):11-26. doi: 10.1007/s11262-007-0128-6. Epub 2007 Nov 7.