• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用基因组特征研究原核生物的基因组同质性。

Examination of genome homogeneity in prokaryotes using genomic signatures.

机构信息

Department of Food Safety, Norwegian School of Veterinary Science, Oslo, Norway.

出版信息

PLoS One. 2009 Dec 2;4(12):e8113. doi: 10.1371/journal.pone.0008113.

DOI:10.1371/journal.pone.0008113
PMID:19956556
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2781299/
Abstract

BACKGROUND

DNA word frequencies, normalized for genomic AT content, are remarkably stable within prokaryotic genomes and are therefore said to reflect a "genomic signature." The genomic signatures can be used to phylogenetically classify organisms from arbitrary sampled DNA. Genomic signatures can also be used to search for horizontally transferred DNA or DNA regions subjected to special selection forces. Thus, the stability of the genomic signature can be used as a measure of genomic homogeneity. The factors associated with the stability of the genomic signatures are not known, and this motivated us to investigate further. We analyzed the intra-genomic variance of genomic signatures based on AT content normalization (0(th) order Markov model) as well as genomic signatures normalized by smaller DNA words (1(st) and 2(nd) order Markov models) for 636 sequenced prokaryotic genomes. Regression models were fitted, with intra-genomic signature variance as the response variable, to a set of factors representing genomic properties such as genomic AT content, genome size, habitat, phylum, oxygen requirement, optimal growth temperature and oligonucleotide usage variance (OUV, a measure of oligonucleotide usage bias), measured as the variance between genomic tetranucleotide frequencies and Markov chain approximated tetranucleotide frequencies, as predictors.

PRINCIPAL FINDINGS

Regression analysis revealed that OUV was the most important factor (p<0.001) determining intra-genomic homogeneity as measured using genomic signatures. This means that the less random the oligonucleotide usage is in the sense of higher OUV, the more homogeneous the genome is in terms of the genomic signature. The other factors influencing variance in the genomic signature (p<0.001) were genomic AT content, phylum and oxygen requirement.

CONCLUSIONS

Genomic homogeneity in prokaryotes is intimately linked to genomic GC content, oligonucleotide usage bias (OUV) and aerobiosis, while oligonucleotide usage bias (OUV) is associated with genomic GC content, aerobiosis and habitat.

摘要

背景

经基因组 AT 含量归一化后的 DNA 单词频率在原核基因组中非常稳定,因此被认为反映了一种“基因组特征”。基因组特征可用于对任意采样 DNA 进行系统发育分类的生物体。基因组特征也可用于搜索水平转移的 DNA 或受特殊选择压力影响的 DNA 区域。因此,基因组特征的稳定性可用作基因组同质性的衡量标准。与基因组特征稳定性相关的因素尚不清楚,这促使我们进行了进一步的研究。我们基于 AT 含量归一化(0 阶马尔可夫模型)以及通过较小的 DNA 单词归一化的基因组特征(1 阶和 2 阶马尔可夫模型),分析了 636 个已测序原核基因组的基因组特征内的方差。以基因组特征内方差作为响应变量,针对一系列代表基因组特性的因素(包括基因组 AT 含量、基因组大小、栖息地、门、氧气需求、最佳生长温度和寡核苷酸使用方差[OUV,衡量寡核苷酸使用偏向的指标,作为基因组四核苷酸频率和马尔可夫链逼近四核苷酸频率之间的方差])拟合回归模型。

主要发现

回归分析表明,寡核苷酸使用方差(OUV)是决定使用基因组特征衡量的基因组内同质性的最重要因素(p<0.001)。这意味着,寡核苷酸使用的随机性越低,即 OUV 越高,基因组在基因组特征方面的同质性就越高。影响基因组特征方差的其他因素(p<0.001)包括基因组 AT 含量、门和氧气需求。

结论

原核生物的基因组同质性与基因组 GC 含量、寡核苷酸使用偏向(OUV)和需氧性密切相关,而寡核苷酸使用偏向(OUV)与基因组 GC 含量、需氧性和栖息地相关。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/4a845ab185d6/pone.0008113.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/9adf5108f1ba/pone.0008113.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/eda7ecd060ef/pone.0008113.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/0e2e00672d11/pone.0008113.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/4061789d9605/pone.0008113.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/f52364ee3b86/pone.0008113.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/4a845ab185d6/pone.0008113.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/9adf5108f1ba/pone.0008113.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/eda7ecd060ef/pone.0008113.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/0e2e00672d11/pone.0008113.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/4061789d9605/pone.0008113.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/f52364ee3b86/pone.0008113.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/373b/2781299/4a845ab185d6/pone.0008113.g006.jpg

相似文献

1
Examination of genome homogeneity in prokaryotes using genomic signatures.利用基因组特征研究原核生物的基因组同质性。
PLoS One. 2009 Dec 2;4(12):e8113. doi: 10.1371/journal.pone.0008113.
2
Reliability and applications of statistical methods based on oligonucleotide frequencies in bacterial and archaeal genomes.基于寡核苷酸频率的统计方法在细菌和古细菌基因组中的可靠性及应用
BMC Genomics. 2008 Feb 28;9:104. doi: 10.1186/1471-2164-9-104.
3
Analysis of intra-genomic GC content homogeneity within prokaryotes.原核生物基因组内 GC 含量同质性分析。
BMC Genomics. 2010 Aug 6;11:464. doi: 10.1186/1471-2164-11-464.
4
Analysis of genomic signatures in prokaryotes using multinomial regression and hierarchical clustering.使用多项回归和层次聚类分析原核生物中的基因组特征。
BMC Genomics. 2009 Oct 21;10:487. doi: 10.1186/1471-2164-10-487.
5
Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes.自然选择在原核生物中保留了大量的框架外终止密码子,以防止移码肽的产生。
BMC Genomics. 2010 Sep 9;11:491. doi: 10.1186/1471-2164-11-491.
6
Investigations of oligonucleotide usage variance within and between prokaryotes.原核生物内部以及不同原核生物之间寡核苷酸使用差异的研究。
PLoS Comput Biol. 2008 Apr 18;4(4):e1000057. doi: 10.1371/journal.pcbi.1000057.
7
Amino acid usage is asymmetrically biased in AT- and GC-rich microbial genomes.在富含 AT 和 GC 的微生物基因组中,氨基酸的使用存在不对称的偏向性。
PLoS One. 2013 Jul 26;8(7):e69878. doi: 10.1371/journal.pone.0069878. Print 2013.
8
Global features of sequences of bacterial chromosomes, plasmids and phages revealed by analysis of oligonucleotide usage patterns.通过寡核苷酸使用模式分析揭示的细菌染色体、质粒和噬菌体序列的全局特征。
BMC Bioinformatics. 2004 Jul 7;5:90. doi: 10.1186/1471-2105-5-90.
9
Genomic signatures in microbes -- properties and applications.微生物中的基因组特征——特性与应用
ScientificWorldJournal. 2011 Mar 22;11:715-25. doi: 10.1100/tsw.2011.70.
10
Quantifying the species-specificity in genomic signatures, synonymous codon choice, amino acid usage and G+C content.量化基因组特征、同义密码子选择、氨基酸使用和G+C含量中的物种特异性。
Gene. 2003 Jun 5;311:35-42. doi: 10.1016/s0378-1119(03)00581-x.

引用本文的文献

1
Compression rates of microbial genomes are associated with genome size and base composition.微生物基因组的压缩率与基因组大小和碱基组成有关。
Genomics Inform. 2024 Oct 10;22(1):16. doi: 10.1186/s44342-024-00018-z.
2
Genomic Signature in Evolutionary Biology: A Review.进化生物学中的基因组特征:综述
Biology (Basel). 2023 Feb 16;12(2):322. doi: 10.3390/biology12020322.
3
A simple stochastic model describing the evolution of genomic GC content in asexually reproducing organisms.一个简单的随机模型,描述了无性繁殖生物中基因组 GC 含量的进化。

本文引用的文献

1
Detecting horizontally transferred and essential genes based on dinucleotide relative abundance.基于二核苷酸相对丰度检测水平转移基因和必需基因。
DNA Res. 2008 Oct;15(5):267-76. doi: 10.1093/dnares/dsn021. Epub 2008 Sep 16.
2
Investigations of oligonucleotide usage variance within and between prokaryotes.原核生物内部以及不同原核生物之间寡核苷酸使用差异的研究。
PLoS Comput Biol. 2008 Apr 18;4(4):e1000057. doi: 10.1371/journal.pcbi.1000057.
3
Ecophysiological significance of scale-dependent patterns in prokaryotic genomes unveiled by a combination of statistic and genometric analyses.
Sci Rep. 2022 Nov 3;12(1):18569. doi: 10.1038/s41598-022-21709-z.
4
Classification of COVID-19 and Other Pathogenic Sequences: A Dinucleotide Frequency and Machine Learning Approach.新型冠状病毒肺炎及其他致病序列的分类:一种二核苷酸频率与机器学习方法
IEEE Access. 2020 Oct 15;8:195263-195273. doi: 10.1109/ACCESS.2020.3031387. eCollection 2020.
5
Insights from the complete genome sequence of Clostridium tyrobutyricum provide a platform for biotechnological and industrial applications.酪丁酸梭菌全基因组序列的见解为生物技术和工业应用提供了一个平台。
J Ind Microbiol Biotechnol. 2017 Aug;44(8):1245-1260. doi: 10.1007/s10295-017-1956-6. Epub 2017 May 23.
6
The nucleotide composition of microbial genomes indicates differential patterns of selection on core and accessory genomes.微生物基因组的核苷酸组成表明,核心基因组和辅助基因组受到不同的选择模式影响。
BMC Genomics. 2017 Feb 10;18(1):151. doi: 10.1186/s12864-017-3543-7.
7
Phylogenomic Analysis of Oenococcus oeni Reveals Specific Domestication of Strains to Cider and Wines.酒类酒球菌的系统基因组分析揭示了菌株对苹果酒和葡萄酒的特异性驯化。
Genome Biol Evol. 2015 May 14;7(6):1506-18. doi: 10.1093/gbe/evv084.
8
Microbial genomic taxonomy.微生物基因组分类学。
BMC Genomics. 2013 Dec 23;14:913. doi: 10.1186/1471-2164-14-913.
9
Amino acid usage is asymmetrically biased in AT- and GC-rich microbial genomes.在富含 AT 和 GC 的微生物基因组中,氨基酸的使用存在不对称的偏向性。
PLoS One. 2013 Jul 26;8(7):e69878. doi: 10.1371/journal.pone.0069878. Print 2013.
10
Relative entropy differences in bacterial chromosomes, plasmids, phages and genomic islands.细菌染色体、质粒、噬菌体和基因组岛的相对熵差异。
BMC Genomics. 2012 Feb 10;13:66. doi: 10.1186/1471-2164-13-66.
通过统计分析和基因组分析相结合揭示的原核生物基因组中尺度依赖性模式的生态生理学意义。
Genomics. 2008 Jun;91(6):538-43. doi: 10.1016/j.ygeno.2008.03.001. Epub 2008 Apr 16.
4
Reliability and applications of statistical methods based on oligonucleotide frequencies in bacterial and archaeal genomes.基于寡核苷酸频率的统计方法在细菌和古细菌基因组中的可靠性及应用
BMC Genomics. 2008 Feb 28;9:104. doi: 10.1186/1471-2164-9-104.
5
Bacterial genome sequencing and its use in infectious diseases.细菌基因组测序及其在传染病中的应用。
Lancet Infect Dis. 2007 Nov;7(11):711-23. doi: 10.1016/S1473-3099(07)70260-8.
6
Bacterial pathogenomics.细菌病原组学
Nature. 2007 Oct 18;449(7164):835-42. doi: 10.1038/nature06248.
7
Reconstructing the ancestor of Mycobacterium leprae: the dynamics of gene loss and genome reduction.重建麻风分枝杆菌的祖先:基因丢失与基因组缩减的动态变化
Genome Res. 2007 Aug;17(8):1178-85. doi: 10.1101/gr.6360207. Epub 2007 Jul 10.
8
RNAmmer: consistent and rapid annotation of ribosomal RNA genes.RNAmmer:核糖体RNA基因的一致性快速注释
Nucleic Acids Res. 2007;35(9):3100-8. doi: 10.1093/nar/gkm160. Epub 2007 Apr 22.
9
GC content and genome length in Chargaff compliant genomes.符合查加夫法则的基因组中的鸟嘌呤-胞嘧啶含量及基因组长度
Biochem Biophys Res Commun. 2007 Feb 2;353(1):207-10. doi: 10.1016/j.bbrc.2006.12.008. Epub 2006 Dec 11.
10
An environmental signature for 323 microbial genomes based on codon adaptation indices.基于密码子适应指数的323个微生物基因组的环境特征
Genome Biol. 2006;7(12):R114. doi: 10.1186/gb-2006-7-12-r114.