• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。

Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.

作者信息

Makarova Kira S, Sorokin Alexander V, Novichkov Pavel S, Wolf Yuri I, Koonin Eugene V

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.

出版信息

Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.

DOI:10.1186/1745-6150-2-33
PMID:18042280
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2222616/
Abstract

BACKGROUND

An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes.

RESULTS

New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover approximately 88% of the genes in a genome compared to a approximately 76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; approximately 40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems.

CONCLUSION

The arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.

摘要

背景

对已测序基因组中的基因进行进化分类,区分直系同源基因和旁系同源基因,对于基因组注释和进化重建至关重要。在细菌、古菌和单细胞真核生物的多个基因组序列可得后不久,蛋白质直系同源簇(COG)就进行了这样的分类尝试。基因组序列的快速积累为完善COG创造了机会,但由于错误放大也带来了挑战。一种实际策略是为基因组的系统发育紧密子集构建完善的COG。

结果

利用一种改进程序为41个古菌基因组(13个泉古菌门、27个广古菌门和1个纳古菌门)构建了新的古菌直系同源基因簇(arCOG),该程序采用较小的、组特异性簇之间的相似性树,半自动划分多结构域蛋白中的直系同源结构域,并使用轮廓搜索来鉴定远缘直系同源基因。arCOG的注释是基于COG、CDD数据库以及NR数据库中同源物注释的三种赋值的共识。平均而言,7538个arCOG覆盖了基因组中约88%的基因,而COG的覆盖度约为76%。arCOG中直系同源物鉴定的粒度更细,这从4538个arCOG对应于2362个COG这一事实可以明显看出;约40%的arCOG是新的。古菌基因核心(在所有41个基因组中都存在的蛋白质编码基因)由166个arCOG组成。arCOG被用于重建古菌进化过程中的基因丢失和基因获得事件以及祖先形式的基因集。保守估计,最后的古菌共同祖先(LACA)拥有996个基因,而泉古菌门和广古菌门的最后共同祖先分别有1245个和1335个基因。据推测,LACA是一种化学自养嗜热菌,除了核心古菌功能外,还编码了更多特殊系统,例如抗病毒防御的CASS系统和一些毒素 - 抗毒素系统。

结论

arCOG为古菌基因组的功能注释、比较基因组学和进化重建提供了一个方便、灵活的框架。基因组重建表明,古菌的最后共同祖先可能(几乎)与现代古菌嗜热菌一样先进。arCOG及相关信息可在以下网址获取:ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/ 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/b6d6e2e71c43/1745-6150-2-33-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/b519d898d7dd/1745-6150-2-33-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/afd709449e9e/1745-6150-2-33-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/25da7aadee58/1745-6150-2-33-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/ffe5f73b48d4/1745-6150-2-33-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/a173be3fa731/1745-6150-2-33-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/6c04ed0e8173/1745-6150-2-33-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/bb85ea8611ad/1745-6150-2-33-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/48e24b26a451/1745-6150-2-33-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/b6d6e2e71c43/1745-6150-2-33-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/b519d898d7dd/1745-6150-2-33-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/afd709449e9e/1745-6150-2-33-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/25da7aadee58/1745-6150-2-33-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/ffe5f73b48d4/1745-6150-2-33-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/a173be3fa731/1745-6150-2-33-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/6c04ed0e8173/1745-6150-2-33-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/bb85ea8611ad/1745-6150-2-33-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/48e24b26a451/1745-6150-2-33-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7b4/2222616/b6d6e2e71c43/1745-6150-2-33-9.jpg

相似文献

1
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。
Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.
2
Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer.古菌的更新直系同源基因簇:古菌的复杂祖先和水平基因转移的旁路。
Biol Direct. 2012 Dec 14;7:46. doi: 10.1186/1745-6150-7-46.
3
Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs).通过使用蛋白质直系同源簇(COG)进行基因组注释来理解泉古菌的首个基因组序列。
Genome Biol. 2000;1(5):RESEARCH0009. doi: 10.1186/gb-2000-1-5-research0009. Epub 2000 Nov 6.
4
Archaeal Clusters of Orthologous Genes (arCOGs): An Update and Application for Analysis of Shared Features between Thermococcales, Methanococcales, and Methanobacteriales.古菌直系同源基因簇 (arCOGs):对热球菌目、甲烷球菌目和甲烷杆菌目之间共享特征进行分析的更新与应用。
Life (Basel). 2015 Mar 10;5(1):818-40. doi: 10.3390/life5010818.
5
Genome trees constructed using five different approaches suggest new major bacterial clades.使用五种不同方法构建的基因组树表明了新的主要细菌进化枝。
BMC Evol Biol. 2001 Oct 20;1:8. doi: 10.1186/1471-2148-1-8.
6
COG database update: focus on microbial diversity, model organisms, and widespread pathogens.COG 数据库更新:重点关注微生物多样性、模式生物和广泛存在的病原体。
Nucleic Acids Res. 2021 Jan 8;49(D1):D274-D281. doi: 10.1093/nar/gkaa1018.
7
The COG database: a tool for genome-scale analysis of protein functions and evolution.COG数据库:一种用于蛋白质功能和进化的基因组规模分析的工具。
Nucleic Acids Res. 2000 Jan 1;28(1):33-6. doi: 10.1093/nar/28.1.33.
8
Phyletic Distribution and Lineage-Specific Domain Architectures of Archaeal Two-Component Signal Transduction Systems.古菌双组分信号转导系统的系统发生分布和谱系特异性结构域结构。
J Bacteriol. 2018 Mar 12;200(7). doi: 10.1128/JB.00681-17. Print 2018 Apr 1.
9
Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes.用于计算基因组进化简约进化情景、最后共同祖先以及原核生物进化中水平基因转移主导地位的算法。
BMC Evol Biol. 2003 Jan 6;3:2. doi: 10.1186/1471-2148-3-2.
10
Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context.基因组比对、原核生物基因组组织的进化以及利用基因组背景预测基因功能。
Genome Res. 2001 Mar;11(3):356-72. doi: 10.1101/gr.gr-1619r.

引用本文的文献

1
Cyclization of archaeal membrane lipids impacts membrane protein activity and archaellum formation.古菌膜脂的环化影响膜蛋白活性和菌毛形成。
Proc Natl Acad Sci U S A. 2025 May 20;122(20):e2423648122. doi: 10.1073/pnas.2423648122. Epub 2025 May 12.
2
Studying sleep orthologs in Epsilonproteobacteria through an evolutionary lens: investigating sleep mysteries through phylogenomics.从进化角度研究ε-变形菌中的睡眠直系同源物:通过系统发育基因组学探究睡眠之谜。
World J Microbiol Biotechnol. 2025 Apr 28;41(5):154. doi: 10.1007/s11274-025-04361-3.
3
Unraveling the Mysteries of Sleep: Exploring Phylogenomic Sleep Signals in the Recently Characterized Archaeal Phylum Lokiarchaeota near Loki's Castle.

本文引用的文献

1
eggNOG: automated construction and annotation of orthologous groups of genes.蛋酒(EggNOG):直系同源基因簇的自动构建与注释
Nucleic Acids Res. 2008 Jan;36(Database issue):D250-4. doi: 10.1093/nar/gkm796. Epub 2007 Oct 16.
2
CRISPR provides acquired resistance against viruses in prokaryotes.CRISPR为原核生物提供了对病毒的适应性抗性。
Science. 2007 Mar 23;315(5819):1709-12. doi: 10.1126/science.1138140.
3
The genome of Hyperthermus butylicus: a sulfur-reducing, peptide fermenting, neutrophilic Crenarchaeote growing up to 108 degrees C.
揭开睡眠之谜:探索位于洛基城堡附近新发现的古菌门洛基古菌中的系统发育睡眠信号。
Int J Mol Sci. 2024 Dec 25;26(1):60. doi: 10.3390/ijms26010060.
4
Iron starvation results in up-regulation of a probable siderophore transporter.铁饥饿导致一种可能的铁载体转运蛋白上调。
Front Microbiol. 2024 Aug 14;15:1422844. doi: 10.3389/fmicb.2024.1422844. eCollection 2024.
5
The novel regulator HdrR controls the transcription of the heterodisulfide reductase operon in .新型调节子 HdrR 控制 在 中的异二硫键还原酶操纵子的转录。
Appl Environ Microbiol. 2024 Jun 18;90(6):e0069124. doi: 10.1128/aem.00691-24. Epub 2024 May 29.
6
Physiological potential and evolutionary trajectories of syntrophic sulfate-reducing bacterial partners of anaerobic methanotrophic archaea.厌氧甲烷氧化古菌共生硫酸盐还原菌的生理潜能和进化轨迹。
PLoS Biol. 2023 Sep 25;21(9):e3002292. doi: 10.1371/journal.pbio.3002292. eCollection 2023 Sep.
7
Quantitative Mass Spectrometry by SILAC in Haloferax volcanii.利用 SILAC 进行 Haloferax volcanii 的定量质谱分析。
Methods Mol Biol. 2022;2522:255-266. doi: 10.1007/978-1-0716-2445-6_16.
8
Discovery of archaeal fusexins homologous to eukaryotic HAP2/GCS1 gamete fusion proteins.发现古菌 fusexins 与真核生物 HAP2/GCS1 配子融合蛋白同源。
Nat Commun. 2022 Jul 6;13(1):3880. doi: 10.1038/s41467-022-31564-1.
9
Conserved and lineage-specific hypothetical proteins may have played a central role in the rise and diversification of major archaeal groups.保守和谱系特异性的假设蛋白可能在主要古菌类群的兴起和多样化中发挥了核心作用。
BMC Biol. 2022 Jul 5;20(1):154. doi: 10.1186/s12915-022-01348-6.
10
Gene tree and species tree reconciliation with endosymbiotic gene transfer.内共生基因转移的基因树和种系发生树的协调。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i120-i132. doi: 10.1093/bioinformatics/btab328.
嗜热丁酸栖热菌的基因组:一种能将硫还原、发酵肽类、嗜中性的泉古菌,最高可在108摄氏度下生长。
Archaea. 2007 May;2(2):127-35. doi: 10.1155/2007/745987.
4
Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2007 Jan;35(Database issue):D5-12. doi: 10.1093/nar/gkl1031. Epub 2006 Dec 14.
5
Genomic analysis of the uncultivated marine crenarchaeote Cenarchaeum symbiosum.未培养海洋泉古菌共生嗜 Cenarchaeum symbiosum 的基因组分析。
Proc Natl Acad Sci U S A. 2006 Nov 28;103(48):18296-301. doi: 10.1073/pnas.0608549103. Epub 2006 Nov 17.
6
STRING 7--recent developments in the integration and prediction of protein interactions.STRING 7——蛋白质相互作用整合与预测的最新进展
Nucleic Acids Res. 2007 Jan;35(Database issue):D358-62. doi: 10.1093/nar/gkl825. Epub 2006 Nov 10.
7
The tree of one percent.百分之一之树。
Genome Biol. 2006;7(10):118. doi: 10.1186/gb-2006-7-10-118. Epub 2006 Nov 1.
8
Comparative genomics of the lactic acid bacteria.乳酸菌的比较基因组学
Proc Natl Acad Sci U S A. 2006 Oct 17;103(42):15611-6. doi: 10.1073/pnas.0607117103. Epub 2006 Oct 9.
9
The Methanosarcina barkeri genome: comparative analysis with Methanosarcina acetivorans and Methanosarcina mazei reveals extensive rearrangement within methanosarcinal genomes.巴氏甲烷八叠球菌基因组:与嗜乙酸甲烷八叠球菌和马氏甲烷八叠球菌的比较分析揭示了甲烷八叠球菌基因组内广泛的重排。
J Bacteriol. 2006 Nov;188(22):7922-31. doi: 10.1128/JB.00810-06. Epub 2006 Sep 15.
10
The cyanobacterial genome core and the origin of photosynthesis.蓝藻基因组核心与光合作用的起源。
Proc Natl Acad Sci U S A. 2006 Aug 29;103(35):13126-31. doi: 10.1073/pnas.0605709103. Epub 2006 Aug 21.