• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

九个基因组中的基因在二核苷酸组成空间中按其所属生物体进行分离。

Genes from nine genomes are separated into their organisms in the dinucleotide composition space.

作者信息

Nakashima H, Ota M, Nishikawa K, Ooi T

机构信息

School of Health Sciences, Faculty of Medicine, Kanazawa University, Japan.

出版信息

DNA Res. 1998 Oct 30;5(5):251-9. doi: 10.1093/dnares/5.5.251.

DOI:10.1093/dnares/5.5.251
PMID:9872449
Abstract

A set of 16 kinds of dinucleotide compositions was used to analyze the protein-encoding nucleotide sequences in nine complete genomes: Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Mycoplasma genitalium, Mycoplasma pneumoniae, Synechocystis sp., Methanococcus jannaschii, Archaeoglobus fulgidus, and Saccharomyces cerevisiae. The dinucleotide composition was significantly different between the organisms. The distribution of genes from an organism was clustered around its center in the dinucleotide composition space. The genes from closely related organisms such as Gram-negative bacteria, mycoplasma species and eukaryotes showed some overlap in the space. The genes from nine complete genomes together with those from human were discriminated into respective clusters with 80% accuracy using the dinucleotide composition alone. The composition data estimated from a whole genome was close to that obtained from genes, indicating that the characteristic feature of dinucleotides holds not only for protein coding regions but also noncoding regions. When a dendrogram was constructed from the disposition of the clusters in the dinucleotide space, it resembled the real phylogenetic tree. Thus, the distinct feature observed in the dinucleotide composition may reflect the phylogenetic relationship of organisms.

摘要

使用一组16种二核苷酸组成来分析9个完整基因组中的蛋白质编码核苷酸序列,这些基因组包括:大肠杆菌、流感嗜血杆菌、幽门螺杆菌、生殖支原体、肺炎支原体、聚球藻属、詹氏甲烷球菌、嗜热栖热菌和酿酒酵母。不同生物体之间的二核苷酸组成存在显著差异。生物体的基因分布在二核苷酸组成空间中围绕其中心聚集。来自亲缘关系较近的生物体(如革兰氏阴性菌、支原体物种和真核生物)的基因在该空间中显示出一些重叠。仅使用二核苷酸组成,就能以80%的准确率将来自9个完整基因组以及人类的基因区分到各自的簇中。从整个基因组估计的组成数据与从基因获得的数据相近,这表明二核苷酸的特征不仅适用于蛋白质编码区域,也适用于非编码区域。当根据二核苷酸空间中簇的分布构建树形图时,它类似于真实的系统发育树。因此,在二核苷酸组成中观察到的独特特征可能反映了生物体的系统发育关系。

相似文献

1
Genes from nine genomes are separated into their organisms in the dinucleotide composition space.九个基因组中的基因在二核苷酸组成空间中按其所属生物体进行分离。
DNA Res. 1998 Oct 30;5(5):251-9. doi: 10.1093/dnares/5.5.251.
2
Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea.古菌与细菌基因组的比较:蛋白质序列的计算机分析预测新功能并暗示古菌的嵌合起源。
Mol Microbiol. 1997 Aug;25(4):619-37. doi: 10.1046/j.1365-2958.1997.4821861.x.
3
Microbial genome analyses: global comparisons of transport capabilities based on phylogenies, bioenergetics and substrate specificities.微生物基因组分析:基于系统发育、生物能量学和底物特异性的转运能力全局比较。
J Mol Biol. 1998 Apr 3;277(3):573-92. doi: 10.1006/jmbi.1998.1609.
4
The frequency distribution of gene family sizes in complete genomes.完整基因组中基因家族大小的频率分布。
Mol Biol Evol. 1998 May;15(5):583-9. doi: 10.1093/oxfordjournals.molbev.a025959.
5
A genomic perspective on protein families.蛋白质家族的基因组视角。
Science. 1997 Oct 24;278(5338):631-7. doi: 10.1126/science.278.5338.631.
6
Compositional biases of bacterial genomes and evolutionary implications.细菌基因组的组成偏差及其进化意义。
J Bacteriol. 1997 Jun;179(12):3899-913. doi: 10.1128/jb.179.12.3899-3913.1997.
7
Differences in dinucleotide frequencies of human, yeast, and Escherichia coli genes.
DNA Res. 1997 Jun 30;4(3):185-92. doi: 10.1093/dnares/4.3.185.
8
Polypurine.polypyrimidine sequences in complete bacterial genomes: preference for polypurines in protein-coding regions.完整细菌基因组中的聚嘌呤-聚嘧啶序列:蛋白质编码区域对聚嘌呤的偏好性。
Gene. 2000 Jan 25;242(1-2):275-83. doi: 10.1016/s0378-1119(99)00505-3.
9
Abundant microsatellite polymorphism in Saccharomyces cerevisiae, and the different distributions of microsatellites in eight prokaryotes and S. cerevisiae, result from strong mutation pressures and a variety of selective forces.酿酒酵母中丰富的微卫星多态性,以及微卫星在八种原核生物和酿酒酵母中的不同分布,是由强大的突变压力和多种选择力导致的。
Proc Natl Acad Sci U S A. 1998 Feb 17;95(4):1647-52. doi: 10.1073/pnas.95.4.1647.
10
Differentiation of single-cell organisms according to elongation stages crucial for gene expression efficacy.
FEBS Lett. 2002 Apr 10;516(1-3):87-92. doi: 10.1016/s0014-5793(02)02507-3.

引用本文的文献

1
Statistical Analysis and Tokenization of Epitopes to Construct Artificial Neoepitope Libraries.对表位进行统计分析和标记化,构建人工新表位文库。
ACS Synth Biol. 2023 Oct 20;12(10):2812-2818. doi: 10.1021/acssynbio.3c00201. Epub 2023 Sep 13.
2
Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking.为真实细菌群落中的几乎所有组件构建宏基因组组装基因组,用于分箱基准测试。
BMC Genomics. 2022 Nov 10;23(1):746. doi: 10.1186/s12864-022-08967-x.
3
Comparative analysis of machine learning algorithms on the microbial strain-specific AMP prediction.
机器学习算法在微生物菌株特异性 AMP 预测上的比较分析。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac233.
4
A completeness-independent method for pre-selection of closely related genomes for species delineation in prokaryotes.一种用于原核生物物种划分的完整无关的近缘基因组预筛选方法。
BMC Genomics. 2020 Feb 26;21(1):183. doi: 10.1186/s12864-020-6597-x.
5
A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms.一种基于高分辨率基因组组成的方法,具有区分相似细菌的能力。
BMC Genomics. 2019 Oct 21;20(1):754. doi: 10.1186/s12864-019-6119-x.
6
MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm.MaxBin:一种基于期望最大化算法的自动分类方法,可从宏基因组中回收单个基因组。
Microbiome. 2014 Aug 1;2:26. doi: 10.1186/2049-2618-2-26. eCollection 2014.
7
Microbial lifestyle and genome signatures.微生物的生活方式和基因组特征。
Curr Genomics. 2012 Apr;13(2):153-62. doi: 10.2174/138920212799860698.
8
Coordinating environmental genomics and geochemistry reveals metabolic transitions in a hot spring ecosystem.协调环境基因组学和地球化学揭示了温泉生态系统中的代谢转变。
PLoS One. 2012;7(6):e38108. doi: 10.1371/journal.pone.0038108. Epub 2012 Jun 4.
9
Mining genomic patterns in Mycobacterium tuberculosis H37Rv using a web server Tuber-Gene.利用 Tuber-Gene 网络服务器挖掘结核分枝杆菌 H37Rv 的基因组模式。
Genomics Proteomics Bioinformatics. 2011 Oct;9(4-5):171-8. doi: 10.1016/S1672-0229(11)60020-X.
10
Differences in dinucleotide frequencies of thermophilic genes encoding water soluble and membrane proteins.嗜热基因编码水溶性和膜蛋白的二核苷酸频率差异。
J Zhejiang Univ Sci B. 2011 Jun;12(6):419-27. doi: 10.1631/jzus.B1000331.