• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

测试细菌核心基因组和泛基因组进化的无限基因模型。

Testing the infinitely many genes model for the evolution of the bacterial core genome and pangenome.

机构信息

Origins Institute and Department of Physics and Astronomy, McMaster University, Hamilton, Ontario, Canada.

出版信息

Mol Biol Evol. 2012 Nov;29(11):3413-25. doi: 10.1093/molbev/mss163. Epub 2012 Jun 29.

DOI:10.1093/molbev/mss163
PMID:22752048
Abstract

When groups of related bacterial genomes are compared, the number of core genes found in all genomes is usually much less than the mean genome size, whereas the size of the pangenome (the set of genes found on at least one of the genomes) is much larger than the mean size of one genome. We analyze 172 complete genomes of Bacilli and compare the properties of the pangenomes and core genomes of monophyletic subsets taken from this group. We then assess the capabilities of several evolutionary models to predict these properties. The infinitely many genes (IMG) model is based on the assumption that each new gene can arise only once. The predictions of the model depend on the shape of the evolutionary tree that underlies the divergence of the genomes. We calculate results for coalescent trees, star trees, and arbitrary phylogenetic trees of predefined fixed branch length. On a star tree, the pangenome size increases linearly with the number of genomes, as has been suggested in some previous studies, whereas on a coalescent tree, it increases logarithmically. The coalescent tree gives a better fit to the data, for all the examples we consider. In some cases, a fixed phylogenetic tree proved better than the coalescent tree at reproducing structure in the gene frequency spectrum, but little improvement was gained in predictions of the core and pangenome sizes. Most of the data are well explained by a model with three classes of gene: an essential class that is found in all genomes, a slow class whose rate of origination and deletion is slow compared with the time of divergence of the genomes, and a fast class showing rapid origination and deletion. Although the majority of genes originating in a genome are in the fast class, these genes are not retained for long periods, and the majority of genes present in a genome are in the slow or essential classes. In general, we show that the IMG model is useful for comparison with experimental genome data both for species level and widely divergent taxonomic groups. Software implementing the described formulae is provided at http://github.com/rec3141/pangenome.

摘要

当比较相关细菌基因组群体时,在所有基因组中发现的核心基因数量通常远少于平均基因组大小,而泛基因组(至少在一个基因组中发现的基因集合)的大小远大于一个基因组的平均大小。我们分析了 172 个芽孢杆菌的完整基因组,并比较了从该组中提取的单系子集的泛基因组和核心基因组的性质。然后,我们评估了几种进化模型预测这些特性的能力。无限多基因 (IMG) 模型基于这样的假设,即每个新基因只能出现一次。该模型的预测取决于作为基因组分歧基础的进化树的形状。我们为合并树、星状树和预定固定分支长度的任意系统发育树计算结果。在星状树上,如一些先前的研究中所建议的,随着基因组数量的增加,泛基因组大小呈线性增加,而在合并树上,它呈对数增加。对于我们考虑的所有示例,合并树更能拟合数据。在某些情况下,固定的系统发育树在复制基因频率谱中的结构方面比合并树表现更好,但在核心和泛基因组大小的预测方面几乎没有改进。大多数数据都可以很好地用具有三类基因的模型来解释:一类是所有基因组中都存在的必需基因,一类是与基因组分歧时间相比起源和删除速度较慢的慢基因,一类是起源和删除速度较快的快基因。虽然起源于一个基因组的大多数基因都属于快基因,但这些基因不会长期保留,而存在于一个基因组中的大多数基因都属于慢基因或必需基因。一般来说,我们表明,IMG 模型对于与实验基因组数据的比较是有用的,无论是在物种水平还是在广泛分歧的分类群中。在 http://github.com/rec3141/pangenome 上提供了实现描述公式的软件。

相似文献

1
Testing the infinitely many genes model for the evolution of the bacterial core genome and pangenome.测试细菌核心基因组和泛基因组进化的无限基因模型。
Mol Biol Evol. 2012 Nov;29(11):3413-25. doi: 10.1093/molbev/mss163. Epub 2012 Jun 29.
2
Defining orthologs and pangenome size metrics.定义直系同源基因和泛基因组大小指标。
Methods Mol Biol. 2015;1231:191-202. doi: 10.1007/978-1-4939-1720-4_13.
3
BPhyOG: an interactive server for genome-wide inference of bacterial phylogenies based on overlapping genes.BPhyOG:一个基于重叠基因进行全基因组细菌系统发育推断的交互式服务器。
BMC Bioinformatics. 2007 Jul 25;8:266. doi: 10.1186/1471-2105-8-266.
4
Assessing the performance of single-copy genes for recovering robust phylogenies.评估单拷贝基因在恢复稳健系统发育中的性能。
Syst Biol. 2008 Aug;57(4):613-27. doi: 10.1080/10635150802306527.
5
Persistence drives gene clustering in bacterial genomes.持久性驱动细菌基因组中的基因聚类。
BMC Genomics. 2008 Jan 7;9:4. doi: 10.1186/1471-2164-9-4.
6
Patterns of bacterial gene movement.细菌基因移动模式。
Mol Biol Evol. 2004 Jul;21(7):1294-307. doi: 10.1093/molbev/msh129. Epub 2004 Apr 28.
7
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
8
From phylogenetics to phylogenomics: the evolutionary relationships of insect endosymbiotic gamma-Proteobacteria as a test case.从系统发育学到系统基因组学:以昆虫内共生γ-变形菌的进化关系为例
Syst Biol. 2007 Feb;56(1):1-16. doi: 10.1080/10635150601109759.
9
Population genetics and evolution of the pan-genome of Streptococcus pneumoniae.肺炎链球菌的群体遗传学和泛基因组进化。
Int J Med Microbiol. 2011 Dec;301(8):619-22. doi: 10.1016/j.ijmm.2011.09.008. Epub 2011 Oct 13.
10
Computing prokaryotic gene ubiquity: rescuing the core from extinction.计算原核基因的普遍性:拯救核心免于灭绝。
Genome Res. 2004 Dec;14(12):2469-77. doi: 10.1101/gr.3024704.

引用本文的文献

1
The Complex Epigenetic Panorama in the Multipartite Genome of the Nitrogen-Fixing Bacterium Sinorhizobium meliloti.固氮细菌苜蓿中华根瘤菌多部分基因组中的复杂表观遗传全景图。
Genome Biol Evol. 2025 Jan 6;17(1). doi: 10.1093/gbe/evae245.
2
When less is more: sketching with minimizers in genomics.少即是多:基因组学中的最小化器草图。
Genome Biol. 2024 Oct 14;25(1):270. doi: 10.1186/s13059-024-03414-4.
3
Developing the PIP-eco: An integrated genomic pipeline for identification and characterization of pathotypes encompassing hybrid forms.
开发PIP-eco:一种用于鉴定和表征包含杂交形式的致病型的综合基因组流程。
Comput Struct Biotechnol J. 2024 Jul 20;23:3040-3049. doi: 10.1016/j.csbj.2024.07.017. eCollection 2024 Dec.
4
Emergence of potentially disinfection-resistant, naturalized Escherichia coli populations across food- and water-associated engineered environments.可能具有消毒抗性、已自然归化的大肠杆菌种群在与食物和水相关的工程环境中出现。
Sci Rep. 2024 Jun 12;14(1):13478. doi: 10.1038/s41598-024-64241-y.
5
Quantification and modeling of turnover dynamics of de novo transcripts in Drosophila melanogaster.量化和建模黑腹果蝇中新转录本周转动态。
Nucleic Acids Res. 2024 Jan 11;52(1):274-287. doi: 10.1093/nar/gkad1079.
6
Comparison of gene clustering criteria reveals intrinsic uncertainty in pangenome analyses.基因聚类标准的比较揭示了泛基因组分析中的固有不确定性。
Genome Biol. 2023 Oct 30;24(1):250. doi: 10.1186/s13059-023-03089-3.
7
A multidrug-resistant Typhimurium DT104 complex lineage circulating among humans and cattle in the USA lost the ability to produce pertussis-like toxin ArtAB.一种流行于美国人群和牛群中的耐药性鼠伤寒沙门氏菌 DT104 复合谱系失去了产生百日咳样毒素 ArtAB 的能力。
Microb Genom. 2023 Jul;9(7). doi: 10.1099/mgen.0.001050.
8
Comparative Genomics of Strains Isolated from Different Ecological Niches.从不同生态位分离出的菌株的比较基因组学
Antibiotics (Basel). 2023 May 7;12(5):866. doi: 10.3390/antibiotics12050866.
9
Challenges in prokaryote pangenomics.原核生物泛基因组学的挑战。
Microb Genom. 2023 May;9(5). doi: 10.1099/mgen.0.001021.
10
Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe.利用 Panstripe 对原核生物泛基因组的基因增益和损耗率进行稳健分析。
Genome Res. 2023 Jan;33(1):129-140. doi: 10.1101/gr.277340.122. Epub 2023 Jan 20.