• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

2024年COG数据库更新

COG database update 2024.

作者信息

Galperin Michael Y, Vera Alvarez Roberto, Karamycheva Svetlana, Makarova Kira S, Wolf Yuri I, Landsman David, Koonin Eugene V

机构信息

Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.

出版信息

Nucleic Acids Res. 2025 Jan 6;53(D1):D356-D363. doi: 10.1093/nar/gkae983.

DOI:10.1093/nar/gkae983
PMID:39494517
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11701660/
Abstract

The Clusters of Orthologous Genes (COG) database, originally created in 1997, has been updated to reflect the constantly growing collection of completely sequenced prokaryotic genomes. This update increased the genome coverage from 1309 to 2296 species, including 2103 bacteria and 193 archaea, in most cases, with a single representative genome per genus. This set covers all genera of bacteria and archaea that included organisms with 'complete genomes' as per NCBI databases in November 2023. The number of COGs has been expanded from 4877 to 4981, primarily by including protein families involved in bacterial protein secretion. Accordingly, COG pathways and functional groups now include secretion systems of types II through X, as well as Flp/Tad and type IV pili. These groupings allow straightforward identification and examination of the prokaryotic lineages that encompass-or lack-a particular secretion system. Other developments include improved annotations for the rRNA and tRNA modification proteins, multi-domain signal transduction proteins, and some previously uncharacterized protein families. The new version of COGs is available at https://www.ncbi.nlm.nih.gov/research/COG, as well as on the NCBI FTP site https://ftp.ncbi.nlm.nih.gov/pub/COG/, which also provides archived data from previous COG releases.

摘要

直系同源基因簇(COG)数据库最初创建于1997年,现已更新,以反映不断增加的完全测序原核生物基因组集合。此次更新将基因组覆盖范围从1309种增加到2296种,包括2103种细菌和193种古菌,在大多数情况下,每个属有一个代表性基因组。这一集合涵盖了截至2023年11月NCBI数据库中所有包含“完整基因组”生物的细菌和古菌属。COG的数量已从4877个扩展到4981个,主要是通过纳入参与细菌蛋白质分泌的蛋白质家族。相应地,COG途径和功能组现在包括II型至X型分泌系统,以及Flp/Tad和IV型菌毛。这些分组使得能够直接识别和检查包含或缺乏特定分泌系统的原核生物谱系。其他进展包括对rRNA和tRNA修饰蛋白、多结构域信号转导蛋白以及一些以前未表征的蛋白质家族的注释有所改进。新版本的COG可在https://www.ncbi.nlm.nih.gov/research/COG获取,也可在NCBI FTP站点https://ftp.ncbi.nlm.nih.gov/pub/COG/获取,该站点还提供以前COG版本的存档数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0b/11701660/a82e349afb55/gkae983figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0b/11701660/a82e349afb55/gkae983figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0b/11701660/a82e349afb55/gkae983figgra1.jpg

相似文献

1
COG database update 2024.2024年COG数据库更新
Nucleic Acids Res. 2025 Jan 6;53(D1):D356-D363. doi: 10.1093/nar/gkae983.
2
COG database update: focus on microbial diversity, model organisms, and widespread pathogens.COG 数据库更新:重点关注微生物多样性、模式生物和广泛存在的病原体。
Nucleic Acids Res. 2021 Jan 8;49(D1):D274-D281. doi: 10.1093/nar/gkaa1018.
3
Expanded microbial genome coverage and improved protein family annotation in the COG database.COG数据库中微生物基因组覆盖范围的扩大及蛋白质家族注释的改进。
Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223. Epub 2014 Nov 26.
4
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。
Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.
5
ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation.ATGC数据库和ATGC直系同源簇:用于原核生物基因组微观和宏观进化研究以及蛋白质家族注释的最新资源。
Nucleic Acids Res. 2017 Jan 4;45(D1):D210-D218. doi: 10.1093/nar/gkw934. Epub 2016 Oct 18.
6
Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer.古菌的更新直系同源基因簇:古菌的复杂祖先和水平基因转移的旁路。
Biol Direct. 2012 Dec 14;7:46. doi: 10.1186/1745-6150-7-46.
7
The COG database: a tool for genome-scale analysis of protein functions and evolution.COG数据库:一种用于蛋白质功能和进化的基因组规模分析的工具。
Nucleic Acids Res. 2000 Jan 1;28(1):33-6. doi: 10.1093/nar/28.1.33.
8
The COG database: new developments in phylogenetic classification of proteins from complete genomes.COG数据库:来自完整基因组的蛋白质系统发育分类的新进展。
Nucleic Acids Res. 2001 Jan 1;29(1):22-8. doi: 10.1093/nar/29.1.22.
9
Phyletic Distribution and Lineage-Specific Domain Architectures of Archaeal Two-Component Signal Transduction Systems.古菌双组分信号转导系统的系统发生分布和谱系特异性结构域结构。
J Bacteriol. 2018 Mar 12;200(7). doi: 10.1128/JB.00681-17. Print 2018 Apr 1.
10
GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy.GTDB:通过系统发生一致、等级归一化和基于完整基因组的分类学,对细菌和古菌多样性进行持续普查。
Nucleic Acids Res. 2022 Jan 7;50(D1):D785-D794. doi: 10.1093/nar/gkab776.

引用本文的文献

1
Influence of Seasonality and Pollution on the Presence of Antibiotic Resistance Genes and Potentially Pathogenic Bacteria in a Tropical Urban River.季节性和污染对热带城市河流中抗生素抗性基因和潜在致病细菌存在情况的影响
Antibiotics (Basel). 2025 Aug 5;14(8):798. doi: 10.3390/antibiotics14080798.
2
Uropathogenic Escherichia coli (UPEC) that hides its identity: features of LC2 and EC73 strains from recurrent urinary tract infections.隐匿身份的尿路致病性大肠杆菌(UPEC):来自复发性尿路感染的LC2和EC73菌株的特征
BMC Microbiol. 2025 Aug 25;25(1):547. doi: 10.1186/s12866-025-04287-8.
3
Kiwa is a membrane-embedded defense supercomplex activated at phage attachment sites.

本文引用的文献

1
VOGDB-Database of Virus Orthologous Groups.病毒直系同源物组数据库(VOGDB-Database of Virus Orthologous Groups)。
Viruses. 2024 Jul 25;16(8):1191. doi: 10.3390/v16081191.
2
The world's largest proteins? These mega-molecules turn bacteria into predators.世界上最大的蛋白质?这些巨型分子将细菌变成捕食者。
Nature. 2024 Jan;625(7993):16-18. doi: 10.1038/d41586-023-03937-z.
3
RefSeq and the prokaryotic genome annotation pipeline in the age of metagenomes.RefSeq 与宏基因组时代的原核生物基因组注释流程。
基瓦是一种在噬菌体附着位点被激活的膜嵌入防御超复合体。
Cell. 2025 Jul 23. doi: 10.1016/j.cell.2025.07.002.
4
Biological Function Assignment across Taxonomic Levels in Mass-Spectrometry-Based Metaproteomics via a Modified Expectation Maximization Algorithm.基于质谱的宏蛋白质组学中通过改进的期望最大化算法进行跨分类水平的生物学功能分配
J Proteome Res. 2025 Aug 1;24(8):3818-3832. doi: 10.1021/acs.jproteome.4c01125. Epub 2025 Jul 18.
5
Biological Function Assignment Across Taxonomic Levels in Mass-Spectrometry-Based Metaproteomics via a Modified Expectation Maximization Algorithm.基于质谱的宏蛋白质组学中跨分类水平的生物功能分配:一种改进的期望最大化算法
bioRxiv. 2025 Jun 17:2025.06.12.659309. doi: 10.1101/2025.06.12.659309.
6
The defensome of prokaryotes in aquifers.含水层中 prokaryotes 的防御组
Nat Commun. 2025 Jul 14;16(1):6482. doi: 10.1038/s41467-025-61467-w.
7
De novo assembly of transcriptomes of six Hua species (Semisulcospiridae, Cerithioidea, Gastropoda).六种华螺科物种(半褶螺科、蜒螺超科、腹足纲)转录组的从头组装
Sci Data. 2025 Jul 2;12(1):1126. doi: 10.1038/s41597-025-05425-7.
8
Genomic Characterization and Safety Evaluation of RB10 Isolated from Goat Feces.从山羊粪便中分离出的RB10的基因组特征及安全性评估
Antibiotics (Basel). 2025 Jun 16;14(6):612. doi: 10.3390/antibiotics14060612.
9
Evolution of gene order in prokaryotes is driven primarily by gene gain and loss.原核生物中基因顺序的演变主要由基因的获得和丢失驱动。
Proc Natl Acad Sci U S A. 2025 Jun 17;122(24):e2502752122. doi: 10.1073/pnas.2502752122. Epub 2025 Jun 11.
10
Systematic Engineering for Efficient Uric Acid-Degrading Activity in Probiotic Yeast .益生菌酵母中高效尿酸降解活性的系统工程
ACS Synth Biol. 2025 Jun 20;14(6):2030-2043. doi: 10.1021/acssynbio.4c00831. Epub 2025 May 8.
Nucleic Acids Res. 2024 Jan 5;52(D1):D762-D769. doi: 10.1093/nar/gkad988.
4
Ensembl 2024.Ensembl 2024.
Nucleic Acids Res. 2024 Jan 5;52(D1):D891-D899. doi: 10.1093/nar/gkad1049.
5
Structural basis for the selective methylation of 5-carboxymethoxyuridine in tRNA modification.tRNA 修饰中 5-羧基甲氧基尿嘧啶选择性甲基化的结构基础。
Nucleic Acids Res. 2023 Sep 22;51(17):9432-9441. doi: 10.1093/nar/gkad668.
6
All DACs in a Row: Domain Architectures of Bacterial and Archaeal Diadenylate Cyclases.所有串联的 DAC:细菌和古菌双腺苷酸环化酶的结构域架构。
J Bacteriol. 2023 Apr 25;205(4):e0002323. doi: 10.1128/jb.00023-23. Epub 2023 Apr 6.
7
InParanoiDB 9: Ortholog Groups for Protein Domains and Full-Length Proteins.InParanoiDB 9:蛋白质结构域和全长蛋白质的直系同源组。
J Mol Biol. 2023 Jul 15;435(14):168001. doi: 10.1016/j.jmb.2023.168001. Epub 2023 Feb 9.
8
The conserved domain database in 2023.2023 年的保守域数据库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D384-D388. doi: 10.1093/nar/gkac1096.
9
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
10
eggNOG 6.0: enabling comparative genomics across 12 535 organisms.eggNOG 6.0:支持 12535 个生物的比较基因组学研究。
Nucleic Acids Res. 2023 Jan 6;51(D1):D389-D394. doi: 10.1093/nar/gkac1022.