Galperin Michael Y, Vera Alvarez Roberto, Karamycheva Svetlana, Makarova Kira S, Wolf Yuri I, Landsman David, Koonin Eugene V
Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
Nucleic Acids Res. 2025 Jan 6;53(D1):D356-D363. doi: 10.1093/nar/gkae983.
The Clusters of Orthologous Genes (COG) database, originally created in 1997, has been updated to reflect the constantly growing collection of completely sequenced prokaryotic genomes. This update increased the genome coverage from 1309 to 2296 species, including 2103 bacteria and 193 archaea, in most cases, with a single representative genome per genus. This set covers all genera of bacteria and archaea that included organisms with 'complete genomes' as per NCBI databases in November 2023. The number of COGs has been expanded from 4877 to 4981, primarily by including protein families involved in bacterial protein secretion. Accordingly, COG pathways and functional groups now include secretion systems of types II through X, as well as Flp/Tad and type IV pili. These groupings allow straightforward identification and examination of the prokaryotic lineages that encompass-or lack-a particular secretion system. Other developments include improved annotations for the rRNA and tRNA modification proteins, multi-domain signal transduction proteins, and some previously uncharacterized protein families. The new version of COGs is available at https://www.ncbi.nlm.nih.gov/research/COG, as well as on the NCBI FTP site https://ftp.ncbi.nlm.nih.gov/pub/COG/, which also provides archived data from previous COG releases.
直系同源基因簇(COG)数据库最初创建于1997年,现已更新,以反映不断增加的完全测序原核生物基因组集合。此次更新将基因组覆盖范围从1309种增加到2296种,包括2103种细菌和193种古菌,在大多数情况下,每个属有一个代表性基因组。这一集合涵盖了截至2023年11月NCBI数据库中所有包含“完整基因组”生物的细菌和古菌属。COG的数量已从4877个扩展到4981个,主要是通过纳入参与细菌蛋白质分泌的蛋白质家族。相应地,COG途径和功能组现在包括II型至X型分泌系统,以及Flp/Tad和IV型菌毛。这些分组使得能够直接识别和检查包含或缺乏特定分泌系统的原核生物谱系。其他进展包括对rRNA和tRNA修饰蛋白、多结构域信号转导蛋白以及一些以前未表征的蛋白质家族的注释有所改进。新版本的COG可在https://www.ncbi.nlm.nih.gov/research/COG获取,也可在NCBI FTP站点https://ftp.ncbi.nlm.nih.gov/pub/COG/获取,该站点还提供以前COG版本的存档数据。