面向进化和功能微生物系统发生学中核心基因识别的有效方法。

Toward an efficient method of identifying core genes for evolutionary and functional microbial phylogenies.

机构信息

Biostatistics Department, Harvard School of Public Health, Harvard University, Boston, Massachusetts, United States of America.

出版信息

PLoS One. 2011;6(9):e24704. doi: 10.1371/journal.pone.0024704. Epub 2011 Sep 12.

DOI:10.1371/journal.pone.0024704

PMID:21931822

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3171473/

Abstract

Microbial community metagenomes and individual microbial genomes are becoming increasingly accessible by means of high-throughput sequencing. Assessing organismal membership within a community is typically performed using one or a few taxonomic marker genes such as the 16S rDNA, and these same genes are also employed to reconstruct molecular phylogenies. There is thus a growing need to bioinformatically catalog strongly conserved core genes that can serve as effective taxonomic markers, to assess the agreement among phylogenies generated from different core gene, and to characterize the biological functions enriched within core genes and thus conserved throughout large microbial clades. We present a method to recursively identify core genes (i.e. genes ubiquitous within a microbial clade) in high-throughput from a large number of complete input genomes. We analyzed over 1,100 genomes to produce core gene sets spanning 2,861 bacterial and archaeal clades, ranging in size from one to >2,000 genes in inverse correlation with the α-diversity (total phylogenetic branch length) spanned by each clade. These cores are enriched as expected for housekeeping functions including translation, transcription, and replication, in addition to significant representations of regulatory, chaperone, and conserved uncharacterized proteins. In agreement with previous manually curated core gene sets, phylogenies constructed from one or more of these core genes agree with those built using 16S rDNA sequence similarity, suggesting that systematic core gene selection can be used to optimize both comparative genomics and determination of microbial community structure. Finally, we examine functional phylogenies constructed by clustering genomes by the presence or absence of orthologous gene families and show that they provide an informative complement to standard sequence-based molecular phylogenies.

摘要

高通量测序技术的发展使得微生物群落宏基因组和单个微生物基因组越来越容易获取。通常使用一个或几个分类标记基因（如 16S rDNA）来评估群落中的生物组成，这些基因也被用于重建分子系统发育关系。因此，越来越需要从大量完整的输入基因组中通过生物信息学方法对能够作为有效分类标记的强保守核心基因进行编目，以评估不同核心基因产生的系统发育关系之间的一致性，并描述核心基因内富集的生物学功能，从而了解其在大的微生物类群中的保守性。我们提出了一种递归方法，可以从大量完整的输入基因组中识别高通量的核心基因（即在微生物类群中普遍存在的基因）。我们分析了超过 1100 个基因组，生成了跨越 2861 个细菌和古菌类群的核心基因集，大小从一个到 >2000 个基因不等，与每个类群的 α-多样性（总系统发育分支长度）呈反比。这些核心基因集如预期的那样富集了与翻译、转录和复制等细胞活动相关的管家功能，以及显著的调控、伴侣和保守未知功能的蛋白质。与之前手动编目的核心基因集一致，使用一个或多个这些核心基因构建的系统发育关系与使用 16S rDNA 序列相似性构建的系统发育关系一致，这表明系统地选择核心基因可以用于优化比较基因组学和确定微生物群落结构。最后，我们研究了通过聚类基因组的存在或不存在的直系同源基因家族构建的功能系统发育关系，并表明它们为标准的基于序列的分子系统发育关系提供了有价值的补充。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673a/3171473/414413a362ff/pone.0024704.g001.jpg

相似文献

Toward an efficient method of identifying core genes for evolutionary and functional microbial phylogenies.面向进化和功能微生物系统发生学中核心基因识别的有效方法。

PLoS One. 2011;6(9):e24704. doi: 10.1371/journal.pone.0024704. Epub 2011 Sep 12.

An emerging phylogenetic core of Archaea: phylogenies of transcription and translation machineries converge following addition of new genome sequences.古菌一个新出现的系统发育核心：随着新基因组序列的增加，转录和翻译机制的系统发育趋同。

BMC Evol Biol. 2005 Jun 2;5:36. doi: 10.1186/1471-2148-5-36.

Quantitatively Partitioning Microbial Genomic Traits among Taxonomic Ranks across the Microbial Tree of Life.定量划分生命之树上的微生物分类等级中的微生物基因组特征。

mSphere. 2019 Aug 28;4(4):e00446-19. doi: 10.1128/mSphere.00446-19.

Genome trees constructed using five different approaches suggest new major bacterial clades.使用五种不同方法构建的基因组树表明了新的主要细菌进化枝。

BMC Evol Biol. 2001 Oct 20;1:8. doi: 10.1186/1471-2148-1-8.

An estimate of the deepest branches of the tree of life from ancient vertically evolving genes.从古老的垂直进化基因估计生命之树的最深分支。

Elife. 2022 Feb 22;11:e66695. doi: 10.7554/eLife.66695.

Genome phylogeny based on gene content.基于基因含量的基因组系统发育。

Nat Genet. 1999 Jan;21(1):108-10. doi: 10.1038/5052.

Systematic identification of gene families for use as "markers" for phylogenetic and phylogeny-driven ecological studies of bacteria and archaea and their major subgroups.系统地鉴定基因家族，作为细菌和古菌及其主要亚群的系统发育和系统发育驱动的生态学研究的“标记”。

PLoS One. 2013 Oct 17;8(10):e77033. doi: 10.1371/journal.pone.0077033. eCollection 2013.

Insights into functional genes and taxonomical/phylogenetic diversity of microbial communities in biological heap leaching system and their correlation with functions.生物堆浸系统中微生物群落的功能基因和分类/系统发育多样性及其与功能的相关性研究。

Appl Microbiol Biotechnol. 2016 Nov;100(22):9745-9756. doi: 10.1007/s00253-016-7819-7. Epub 2016 Sep 15.

Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。

Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.

Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea.古菌与细菌基因组的比较：蛋白质序列的计算机分析预测新功能并暗示古菌的嵌合起源。

Mol Microbiol. 1997 Aug;25(4):619-37. doi: 10.1046/j.1365-2958.1997.4821861.x.

引用本文的文献

Comparative Genomics and Adaptive Evolution of in Geographically Distinct Human Gut Populations.不同地理区域人类肠道菌群的比较基因组学与适应性进化

Foods. 2025 Aug 6;14(15):2747. doi: 10.3390/foods14152747.

The effects of inulin supplementation on eating behaviours in children and adolescents with obesity: a randomized double-blinded placebo-controlled study.菊粉补充剂对肥胖儿童和青少年饮食行为的影响：一项随机双盲安慰剂对照研究。

Nutr Metab (Lond). 2025 Aug 12;22(1):97. doi: 10.1186/s12986-025-00995-0.

Gut Microbiota and Neurovascular Patterns in Amnestic Mild Cognitive Impairment.遗忘型轻度认知障碍中的肠道微生物群与神经血管模式

Brain Sci. 2025 May 22;15(6):538. doi: 10.3390/brainsci15060538.

Linkage-based ortholog refinement in bacterial pangenomes with CLARC.使用CLARC在细菌泛基因组中基于连锁的直系同源基因优化

Nucleic Acids Res. 2025 Jun 20;53(12). doi: 10.1093/nar/gkaf488.

Testing for Consistency in Co-occurrence Patterns Among Bacterial Taxa Across the Microbiomes of Four Different Trematode Parasites.四种不同吸虫寄生虫微生物群中细菌类群共现模式的一致性检测

Microb Ecol. 2025 May 17;88(1):45. doi: 10.1007/s00248-025-02545-w.

Augmenting microbial phylogenomic signal with tailored marker gene sets.用定制的标记基因集增强微生物系统发育信号。

bioRxiv. 2025 Mar 15:2025.03.13.643052. doi: 10.1101/2025.03.13.643052.

Antimicrobial peptide AP2 ameliorates Salmonella Typhimurium infection by modulating gut microbiota.抗菌肽AP2通过调节肠道微生物群改善鼠伤寒沙门氏菌感染。

BMC Microbiol. 2025 Feb 5;25(1):64. doi: 10.1186/s12866-025-03776-0.

The members of zinc finger-homeodomain (ZF-HD) transcription factors are associated with abiotic stresses in soybean: insights from genomics and expression analysis.锌指-同源异型结构域（ZF-HD）转录因子成员与大豆的非生物胁迫相关：来自基因组学和表达分析的见解

BMC Plant Biol. 2025 Jan 14;25(1):56. doi: 10.1186/s12870-024-06028-x.

Linkage-based ortholog refinement in bacterial pangenomes with CLARC.使用CLARC在细菌泛基因组中基于连锁的直系同源基因优化

bioRxiv. 2025 Jan 13:2024.12.18.629228. doi: 10.1101/2024.12.18.629228.

An Integrated Neuromuscular Training Intervention Applied in Primary School Induces Epigenetic Modifications in Disease-Related Genes: A Genome-Wide DNA Methylation Study.一项应用于小学的综合神经肌肉训练干预对疾病相关基因产生表观遗传修饰：一项全基因组DNA甲基化研究。

Scand J Med Sci Sports. 2025 Jan;35(1):e70012. doi: 10.1111/sms.70012.

本文引用的文献

Metagenomic biomarker discovery and explanation.宏基因组生物标志物发现与阐释。

Genome Biol. 2011 Jun 24;12(6):R60. doi: 10.1186/gb-2011-12-6-r60.

Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs).基于特征频率谱（FFPs）的大肠杆菌/志贺氏菌群的全基因组系统发育分析。

Proc Natl Acad Sci U S A. 2011 May 17;108(20):8329-34. doi: 10.1073/pnas.1105168108. Epub 2011 May 2.

Comparative analysis and supragenome modeling of twelve Moraxella catarrhalis clinical isolates.十二株卡他莫拉菌临床分离株的比较分析及超基因组建模。

BMC Genomics. 2011 Jan 26;12:70. doi: 10.1186/1471-2164-12-70.

Rapid evolutionary innovation during an Archaean genetic expansion.太古代遗传扩张期间的快速进化创新。

Nature. 2011 Jan 6;469(7328):93-6. doi: 10.1038/nature09649. Epub 2010 Dec 19.

Genomic and functional adaptation in surface ocean planktonic prokaryotes.海洋浮游原核生物的基因组和功能适应性。

Nature. 2010 Nov 4;468(7320):60-6. doi: 10.1038/nature09530.

Structure and dynamics of the pan-genome of Streptococcus pneumoniae and closely related species.肺炎链球菌及其近缘种的泛基因组结构与动态。

Genome Biol. 2010;11(10):R107. doi: 10.1186/gb-2010-11-10-r107. Epub 2010 Oct 29.

Genome comparison and phylogenetic analysis of Orientia tsutsugamushi strains.恙虫病东方体菌株的基因组比较和系统发育分析。

DNA Res. 2010 Oct;17(5):281-91. doi: 10.1093/dnares/dsq018. Epub 2010 Aug 3.

FastTree 2--approximately maximum-likelihood trees for large alignments.FastTree 2--用于大型比对的近似最大似然树。

PLoS One. 2010 Mar 10;5(3):e9490. doi: 10.1371/journal.pone.0009490.

A human gut microbial gene catalogue established by metagenomic sequencing.宏基因组测序建立的人类肠道微生物基因目录。

Nature. 2010 Mar 4;464(7285):59-65. doi: 10.1038/nature08821.

Mapping the tree of life: progress and prospects.绘制生命之树：进展与展望。

Microbiol Mol Biol Rev. 2009 Dec;73(4):565-76. doi: 10.1128/MMBR.00033-09.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

面向进化和功能微生物系统发生学中核心基因识别的有效方法。

Toward an efficient method of identifying core genes for evolutionary and functional microbial phylogenies.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献