• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于识别中度相关微生物基因组核心结构的多基因组比对

Multiple genome alignment for identifying the core structure among moderately related microbial genomes.

作者信息

Uchiyama Ikuo

机构信息

Department of Theoretical Biology, National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Aichi, Japan.

出版信息

BMC Genomics. 2008 Oct 31;9:515. doi: 10.1186/1471-2164-9-515.

DOI:10.1186/1471-2164-9-515
PMID:18976470
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2615449/
Abstract

BACKGROUND

Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders.

RESULTS

The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes.

CONCLUSION

The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.

摘要

背景

在相关基因组中识别内在保守基因集或基因组核心,对于理解水平基因转移常见的原核生物基因组至关重要。尽管在亲缘关系非常近的基因组中,核心基因组的识别似乎很明显,但在比较亲缘关系较远的基因组时,难度会增加。在这里,我们将核心结构视为一组足够长的片段,其中基因顺序是保守的,因此它们很可能主要通过垂直转移遗传而来,并开发了一种方法,通过找到预先识别的直系同源组(OGs)的顺序来识别核心结构,该顺序能最大程度地保留保守的基因顺序。

结果

该方法应用于两个特征明确的科(芽孢杆菌科和肠杆菌科)的基因组比较,分别识别出它们的核心结构包含1438个和2125个OGs。核心集包含了大部分必需基因及其相关基因,这些基因主要包含在两个核心集的交集中,约有700个OGs。基于基因顺序保守性的基因组核心定义被证明比仅基于基因保守性的更简单方法更稳健。我们还从G+C含量同质性和系统发育一致性方面研究了核心结构,发现核心基因比非核心基因更主要地表现出预期特征,即具有本土性且共享相同历史。

结论

结果表明,我们基于基因顺序保守性的基因组比对策略能够为识别中等亲缘关系的微生物基因组中的基因组核心提供一种有效方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/ee61c13ce308/1471-2164-9-515-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/69e47762f938/1471-2164-9-515-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/b58ef5cc093f/1471-2164-9-515-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/9549718584cd/1471-2164-9-515-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/e8d78235414a/1471-2164-9-515-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/3dfc9ab604bc/1471-2164-9-515-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/75aa869ec97f/1471-2164-9-515-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/9d9ba7da4023/1471-2164-9-515-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/1b9fb96f9d05/1471-2164-9-515-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/afbfdf0ab17c/1471-2164-9-515-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/ee61c13ce308/1471-2164-9-515-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/69e47762f938/1471-2164-9-515-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/b58ef5cc093f/1471-2164-9-515-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/9549718584cd/1471-2164-9-515-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/e8d78235414a/1471-2164-9-515-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/3dfc9ab604bc/1471-2164-9-515-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/75aa869ec97f/1471-2164-9-515-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/9d9ba7da4023/1471-2164-9-515-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/1b9fb96f9d05/1471-2164-9-515-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/afbfdf0ab17c/1471-2164-9-515-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dc7/2615449/ee61c13ce308/1471-2164-9-515-10.jpg

相似文献

1
Multiple genome alignment for identifying the core structure among moderately related microbial genomes.用于识别中度相关微生物基因组核心结构的多基因组比对
BMC Genomics. 2008 Oct 31;9:515. doi: 10.1186/1471-2164-9-515.
2
A phylogenetic analysis of the pSymB replicon from the Sinorhizobium meliloti genome reveals a complex evolutionary history.对苜蓿中华根瘤菌基因组中pSymB复制子的系统发育分析揭示了其复杂的进化历史。
Can J Microbiol. 2003 Apr;49(4):269-80. doi: 10.1139/w03-037.
3
Using homolog groups to create a whole-genomic tree of free-living organisms: an update.利用同源基因簇构建自由生活生物体的全基因组树:最新进展
J Mol Evol. 2002 Apr;54(4):539-47. doi: 10.1007/s00239-001-0054-5.
4
A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm.一个使用黑马算法识别出的古菌和细菌基因组中系统发育非典型基因的数据库。
BMC Bioinformatics. 2008 Oct 7;9:419. doi: 10.1186/1471-2105-9-419.
5
A selective barrier to horizontal gene transfer in the T4-type bacteriophages that has preserved a core genome with the viral replication and structural genes.T4型噬菌体中存在一种对水平基因转移的选择性屏障,它保留了包含病毒复制和结构基因的核心基因组。
Mol Biol Evol. 2006 Sep;23(9):1688-96. doi: 10.1093/molbev/msl036. Epub 2006 Jun 16.
6
A computational approach for identifying pathogenicity islands in prokaryotic genomes.一种用于识别原核生物基因组中致病岛的计算方法。
BMC Bioinformatics. 2005 Jul 21;6:184. doi: 10.1186/1471-2105-6-184.
7
Reconstruction of ancient genome and gene order from complete microbial genome sequences.从完整的微生物基因组序列重建古代基因组和基因顺序。
J Theor Biol. 2006 Apr 21;239(4):494-8. doi: 10.1016/j.jtbi.2005.08.013. Epub 2005 Sep 26.
8
Defining the core of nontransferable prokaryotic genes: the euryarchaeal core.定义不可转移原核基因的核心:广古菌核心。
J Mol Evol. 2001 Oct-Nov;53(4-5):340-50. doi: 10.1007/s002390010224.
9
The tree of life viewed through the contents of genomes.通过基因组内容物审视生命之树。
Methods Mol Biol. 2009;532:141-61. doi: 10.1007/978-1-60327-853-9_8.
10
AMIGOS: a method for the inspection of genomic organisation or structure and its application to characterise conserved gene arrangements.AMIGOS:一种用于检查基因组组织或结构的方法及其在表征保守基因排列中的应用。
In Silico Biol. 2006;6(4):281-306.

引用本文的文献

1
Comprehensive genomic analysis of strain BP9, pan-genomic and genetic basis of biocontrol mechanism.菌株BP9的全基因组分析、泛基因组及生物防治机制的遗传基础
Comput Struct Biotechnol J. 2023 Oct 3;21:4647-4662. doi: 10.1016/j.csbj.2023.09.043. eCollection 2023.
2
The physiological potential of anammox bacteria as revealed by their core genome structure.厌氧氨氧化细菌的核心基因组结构揭示的生理潜力。
DNA Res. 2021 Jan 19;28(1). doi: 10.1093/dnares/dsaa028.
3
MBGD update 2018: microbial genome database based on hierarchical orthology relations covering closely related and distantly related comparisons.

本文引用的文献

1
KEGG for linking genomes to life and the environment.京都基因与基因组百科全书,用于将基因组与生命及环境相联系。
Nucleic Acids Res. 2008 Jan;36(Database issue):D480-4. doi: 10.1093/nar/gkm882. Epub 2007 Dec 12.
2
The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata.2007年的在线基因组数据库(GOLD):基因组和宏基因组项目及其相关元数据的状况。
Nucleic Acids Res. 2008 Jan;36(Database issue):D475-9. doi: 10.1093/nar/gkm884. Epub 2007 Nov 2.
3
Construction of consecutive deletions of the Escherichia coli chromosome.
MBGD 更新 2018 版:基于层次同源关系的微生物基因组数据库,涵盖了密切相关和远距离相关的比较。
Nucleic Acids Res. 2019 Jan 8;47(D1):D382-D389. doi: 10.1093/nar/gky1054.
4
Comparative genomics of cocci-shaped Sporosarcina strains with diverse spatial isolation.具有不同空间隔离的球菌形芽孢杆菌菌株的比较基因组学
BMC Genomics. 2018 May 2;19(1):310. doi: 10.1186/s12864-018-4635-8.
5
An acid-tolerant ammonia-oxidizing γ-proteobacterium from soil.一株从土壤中分离得到的耐酸性氨氧化γ-变形菌。
ISME J. 2017 May;11(5):1130-1141. doi: 10.1038/ismej.2016.191. Epub 2017 Jan 10.
6
Complete genome sequence and expression profile of the commercial lytic enzyme producer Lysobacter enzymogenes M497-1.商业溶菌酶产生菌 Lysobacter enzymogenes M497-1 的全基因组序列和表达谱。
DNA Res. 2017 Apr 1;24(2):169-177. doi: 10.1093/dnares/dsw055.
7
A Novel Approach to Helicobacter pylori Pan-Genome Analysis for Identification of Genomic Islands.一种用于鉴定基因组岛的幽门螺杆菌泛基因组分析新方法。
PLoS One. 2016 Aug 9;11(8):e0159419. doi: 10.1371/journal.pone.0159419. eCollection 2016.
8
Genomic Basis for Methicillin Resistance in Staphylococcus aureus.金黄色葡萄球菌耐甲氧西林的基因组基础。
Infect Chemother. 2013 Jun;45(2):117-36. doi: 10.3947/ic.2013.45.2.117. Epub 2013 Jun 26.
9
Chromosome painting in silico in a bacterial species reveals fine population structure.在一个细菌物种中进行虚拟染色体作图揭示了精细的种群结构。
Mol Biol Evol. 2013 Jun;30(6):1454-64. doi: 10.1093/molbev/mst055. Epub 2013 Mar 16.
10
Evaluation method for the potential functionome harbored in the genome and metagenome.基因组和宏基因组中潜在功能组的评估方法。
BMC Genomics. 2012 Dec 12;13:699. doi: 10.1186/1471-2164-13-699.
大肠杆菌染色体连续缺失的构建
Mol Syst Biol. 2007;3:132. doi: 10.1038/msb4100174. Epub 2007 Aug 14.
4
Evolution of the core and pan-genome of Streptococcus: positive selection, recombination, and genome composition.链球菌核心基因组和泛基因组的进化:正选择、重组与基因组组成
Genome Biol. 2007;8(5):R71. doi: 10.1186/gb-2007-8-5-r71.
5
MBGD: a platform for microbial comparative genomics based on the automated construction of orthologous groups.MBGD:一个基于直系同源群自动构建的微生物比较基因组学平台。
Nucleic Acids Res. 2007 Jan;35(Database issue):D343-6. doi: 10.1093/nar/gkl978. Epub 2006 Nov 29.
6
CGAT: a comparative genome analysis tool for visualizing alignments in the analysis of complex evolutionary changes between closely related genomes.CGAT:一种用于在分析密切相关基因组之间复杂进化变化时可视化比对结果的比较基因组分析工具。
BMC Bioinformatics. 2006 Oct 24;7:472. doi: 10.1186/1471-2105-7-472.
7
M-GCAT: interactively and efficiently constructing large-scale multiple genome comparison frameworks in closely related species.M-GCAT:在近缘物种中交互式高效构建大规模多基因组比较框架
BMC Bioinformatics. 2006 Oct 5;7:433. doi: 10.1186/1471-2105-7-433.
8
Reconstructing contiguous regions of an ancestral genome.重建祖先基因组的连续区域。
Genome Res. 2006 Dec;16(12):1557-65. doi: 10.1101/gr.5383506. Epub 2006 Sep 18.
9
Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection.大肠杆菌K-12框内单基因敲除突变体的构建:Keio文库。
Mol Syst Biol. 2006;2:2006.0008. doi: 10.1038/msb4100050. Epub 2006 Feb 21.
10
Hierarchical clustering algorithm for comprehensive orthologous-domain classification in multiple genomes.用于多基因组中综合直系同源域分类的层次聚类算法。
Nucleic Acids Res. 2006 Jan 25;34(2):647-58. doi: 10.1093/nar/gkj448. Print 2006.