• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过使用蛋白质直系同源簇(COG)进行基因组注释来理解泉古菌的首个基因组序列。

Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs).

作者信息

Natale D A, Shankavaram U T, Galperin M Y, Wolf Y I, Aravind L, Koonin E V

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Rockville Pike, Bethesda, MD 20894, USA.

出版信息

Genome Biol. 2000;1(5):RESEARCH0009. doi: 10.1186/gb-2000-1-5-research0009. Epub 2000 Nov 6.

DOI:10.1186/gb-2000-1-5-research0009
PMID:11178258
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC15027/
Abstract

BACKGROUND

Standard archival sequence databases have not been designed as tools for genome annotation and are far from being optimal for this purpose. We used the database of Clusters of Orthologous Groups of proteins (COGs) to reannotate the genomes of two archaea, Aeropyrum pernix, the first member of the Crenarchaea to be sequenced, and Pyrococcus abyssi.

RESULTS

A. pernix and P. abyssi proteins were assigned to COGs using the COGNITOR program; the results were verified on a case-by-case basis and augmented by additional database searches using the PSI-BLAST and TBLASTN programs. Functions were predicted for over 300 proteins from A. pernix, which could not be assigned a function using conventional methods with a conservative sequence similarity threshold, an approximately 50% increase compared to the original annotation. A. pernix shares most of the conserved core of proteins that were previously identified in the Euryarchaeota. Cluster analysis or distance matrix tree construction based on the co-occurrence of genomes in COGs showed that A. pernix forms a distinct group within the archaea, although grouping with the two species of Pyrococci, indicative of similar repertoires of conserved genes, was observed. No indication of a specific relationship between Crenarchaeota and eukaryotes was obtained in these analyses. Several proteins that are conserved in Euryarchaeota and most bacteria are unexpectedly missing in A. pernix, including the entire set of de novo purine biosynthesis enzymes, the GTPase FtsZ (a key component of the bacterial and euryarchaeal cell-division machinery), and the tRNA-specific pseudouridine synthase, previously considered universal. A. pernix is represented in 48 COGs that do not contain any euryarchaeal members. Many of these proteins are TCA cycle and electron transport chain enzymes, reflecting the aerobic lifestyle of A. pernix.

CONCLUSIONS

Special-purpose databases organized on the basis of phylogenetic analysis and carefully curated with respect to known and predicted protein functions provide for a significant improvement in genome annotation. A differential genome display approach helps in a systematic investigation of common and distinct features of gene repertoires and in some cases reveals unexpected connections that may be indicative of functional similarities between phylogenetically distant organisms and of lateral gene exchange.

摘要

背景

标准的存档序列数据库并非设计用于基因组注释,远非实现此目的的最佳工具。我们使用蛋白质直系同源簇数据库(COGs)对两种古菌的基因组进行重新注释,这两种古菌分别是嗜热栖热菌(Aeropyrum pernix),它是泉古菌门中首个被测序的成员,以及深渊嗜热栖热菌(Pyrococcus abyssi)。

结果

使用COGNITOR程序将嗜热栖热菌和深渊嗜热栖热菌的蛋白质归入COGs;逐一验证结果,并通过使用PSI-BLAST和TBLASTN程序进行额外的数据库搜索来补充。对超过300种来自嗜热栖热菌的蛋白质预测了功能,使用具有保守序列相似性阈值的传统方法这些蛋白质无法被赋予功能,相比原始注释增加了约50%。嗜热栖热菌共享了先前在广古菌门中鉴定出的大部分保守核心蛋白质。基于COGs中基因组的共现进行聚类分析或构建距离矩阵树表明,嗜热栖热菌在古菌中形成一个独特的类群,尽管观察到它与两种嗜热栖热菌属物种归为一组,这表明保守基因库具有相似性。在这些分析中未发现泉古菌门与真核生物之间存在特定关系的迹象。在嗜热栖热菌中意外缺失了一些在广古菌门和大多数细菌中保守的蛋白质,包括整套从头嘌呤生物合成酶、GTP酶FtsZ(细菌和广古菌细胞分裂机制的关键组成部分)以及先前被认为是通用的tRNA特异性假尿苷合酶。嗜热栖热菌在48个不包含任何广古菌门成员的COGs中有所体现。这些蛋白质中的许多是三羧酸循环和电子传递链酶,反映了嗜热栖热菌的需氧生活方式。

结论

基于系统发育分析构建并针对已知和预测的蛋白质功能进行精心策划的专用数据库,可显著改善基因组注释。差异基因组展示方法有助于系统研究基因库的共同和独特特征,在某些情况下揭示出意想不到的联系,这些联系可能表明系统发育上距离遥远的生物体之间的功能相似性以及横向基因转移。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/341326605495/gb-2000-1-5-research0009-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/18a3c8337452/gb-2000-1-5-research0009-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/319274b59e8b/gb-2000-1-5-research0009-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/ed6a958a5e04/gb-2000-1-5-research0009-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/341326605495/gb-2000-1-5-research0009-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/18a3c8337452/gb-2000-1-5-research0009-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/319274b59e8b/gb-2000-1-5-research0009-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/ed6a958a5e04/gb-2000-1-5-research0009-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe5b/15027/341326605495/gb-2000-1-5-research0009-4.jpg

相似文献

1
Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs).通过使用蛋白质直系同源簇(COG)进行基因组注释来理解泉古菌的首个基因组序列。
Genome Biol. 2000;1(5):RESEARCH0009. doi: 10.1186/gb-2000-1-5-research0009. Epub 2000 Nov 6.
2
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。
Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.
3
Phylogenomic analysis of proteins that are distinctive of Archaea and its main subgroups and the origin of methanogenesis.古菌及其主要亚群特有的蛋白质的系统基因组分析与甲烷生成的起源
BMC Genomics. 2007 Mar 29;8:86. doi: 10.1186/1471-2164-8-86.
4
Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer.古菌的更新直系同源基因簇:古菌的复杂祖先和水平基因转移的旁路。
Biol Direct. 2012 Dec 14;7:46. doi: 10.1186/1745-6150-7-46.
5
Archaeal genomics.古菌基因组学。
Curr Opin Microbiol. 1999 Oct;2(5):542-7. doi: 10.1016/s1369-5274(99)00014-4.
6
Archaeal phylogeny based on proteins of the transcription and translation machineries: tackling the Methanopyrus kandleri paradox.基于转录和翻译机制蛋白质的古菌系统发育:解决坎氏甲烷嗜热菌悖论
Genome Biol. 2004;5(3):R17. doi: 10.1186/gb-2004-5-3-r17. Epub 2004 Feb 26.
7
Hindsight in the relative abundance, metabolic potential and genome dynamics of uncultivated marine archaea from comparative metagenomic analyses of bathypelagic plankton of different oceanic regions.通过对不同海洋区域深海浮游生物的比较宏基因组分析,洞察未培养海洋古菌的相对丰度、代谢潜力和基因组动态。
ISME J. 2008 Aug;2(8):865-86. doi: 10.1038/ismej.2008.40. Epub 2008 May 8.
8
Distinct gene set in two different lineages of ammonia-oxidizing archaea supports the phylum Thaumarchaeota.两个不同分支的氨氧化古菌中独特的基因集支持奇古菌门。
Trends Microbiol. 2010 Aug;18(8):331-40. doi: 10.1016/j.tim.2010.06.003. Epub 2010 Jul 2.
9
An integrated analysis of the genome of the hyperthermophilic archaeon Pyrococcus abyssi.嗜热古菌深渊热球菌基因组的综合分析。
Mol Microbiol. 2003 Mar;47(6):1495-512. doi: 10.1046/j.1365-2958.2003.03381.x.
10
The COG database: a tool for genome-scale analysis of protein functions and evolution.COG数据库:一种用于蛋白质功能和进化的基因组规模分析的工具。
Nucleic Acids Res. 2000 Jan 1;28(1):33-6. doi: 10.1093/nar/28.1.33.

引用本文的文献

1
Salinicoccus bachuensis sp. nov., a novel halophilic bacterium, isolated from rhizosphere soil surrounding Tamarix chinensis Lour. plants in Bachu County, Xinjiang, China.巴赫盐球菌新种,一种新型嗜盐细菌,从中国新疆巴楚县柽柳植物根际土壤中分离得到。
Arch Microbiol. 2025 Apr 3;207(5):112. doi: 10.1007/s00203-025-04315-1.
2
Insights into the genomic architecture of a newly discovered endophytic species belonging to the complex from India.对一种新发现的来自印度、属于该复合体的内生菌物种基因组结构的见解。
Front Microbiol. 2023 Nov 27;14:1266620. doi: 10.3389/fmicb.2023.1266620. eCollection 2023.
3
Diversity, structure, and distribution of bacterioplankton and diazotroph communities in the Bay of Bengal during the winter monsoon.

本文引用的文献

1
An evolutionary classification of the metallo-beta-lactamase fold proteins.金属β-内酰胺酶折叠蛋白的进化分类
In Silico Biol. 1999;1(2):69-91.
2
Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption.基因组功能注释中系统误差的来源:结构域重排、非直系同源基因替代和操纵子破坏。
In Silico Biol. 1998;1(1):55-67.
3
Using the COG database to improve gene recognition in complete genomes.利用COG数据库提高完整基因组中的基因识别能力。
冬季风期间孟加拉湾浮游细菌和固氮微生物群落的多样性、结构及分布
Front Microbiol. 2022 Nov 30;13:987462. doi: 10.3389/fmicb.2022.987462. eCollection 2022.
4
An integrated metabolome and transcriptome approach reveals the fruit flavor and regulatory network during jujube fruit development.整合代谢组学和转录组学方法揭示枣果实发育过程中的果实风味及调控网络。
Front Plant Sci. 2022 Sep 23;13:952698. doi: 10.3389/fpls.2022.952698. eCollection 2022.
5
Genome Features and AntiSMASH Analysis of an Endophytic Strain sp. R1.一种内生菌株sp. R1的基因组特征与抗SMASH分析
Metabolites. 2022 Jun 4;12(6):521. doi: 10.3390/metabo12060521.
6
Comparative mitogenomics and phylogenetics of the family Carangidae with special emphasis on the mitogenome of the Indian Scad Decapterus russelli.比较 Carangidae 科的线粒体基因组学和系统发育学,特别强调印度洋鲹 Decapterus russelli 的线粒体基因组。
Sci Rep. 2022 Apr 4;12(1):5642. doi: 10.1038/s41598-022-09636-5.
7
Comparative Genome Analysis Reveals Phylogenetic Identity of Bacillus velezensis HNA3 and Genomic Insights into Its Plant Growth Promotion and Biocontrol Effects.比较基因组分析揭示了贝莱斯芽孢杆菌 HNA3 的系统发育同一性及其促进植物生长和生物防治作用的基因组见解。
Microbiol Spectr. 2022 Feb 23;10(1):e0216921. doi: 10.1128/spectrum.02169-21. Epub 2022 Feb 2.
8
Orf165 is associated with cytoplasmic male sterility in pepper.Orf165与辣椒细胞质雄性不育相关。
Genet Mol Biol. 2021 Sep 22;44(3):e20210030. doi: 10.1590/1678-4685-GMB-2021-0030. eCollection 2021.
9
A Clinical Extensively-Drug Resistant (XDR) and Role of Its β-Lactamase Genes.临床广泛耐药(XDR)及其β-内酰胺酶基因的作用
Front Microbiol. 2020 Dec 10;11:590357. doi: 10.3389/fmicb.2020.590357. eCollection 2020.
10
Molecular detection and phylogenetic analysis of genotypes in Hillah, Iraq.伊拉克希拉市基因型的分子检测与系统发育分析
New Microbes New Infect. 2020 Aug 7;37:100739. doi: 10.1016/j.nmni.2020.100739. eCollection 2020 Sep.
Genetica. 2000;108(1):9-17. doi: 10.1023/a:1004031323748.
4
Who's your neighbor? New computational approaches for functional genomics.谁是你的邻居?功能基因组学的新计算方法。
Nat Biotechnol. 2000 Jun;18(6):609-13. doi: 10.1038/76443.
5
Protein fold recognition using sequence profiles and its application in structural genomics.利用序列谱进行蛋白质折叠识别及其在结构基因组学中的应用。
Adv Protein Chem. 2000;54:245-75. doi: 10.1016/s0065-3233(00)54008-x.
6
Organelle division: Self-assembling GTPase caught in the middle.细胞器分裂:处于中间位置的自组装GTP酶。
Curr Biol. 2000 May 4;10(9):R328-30. doi: 10.1016/s0960-9822(00)00458-9.
7
Uracil-DNA glycosylase in the extreme thermophile Archaeoglobus fulgidus.嗜热古菌富铁嗜热栖热菌中的尿嘧啶-DNA糖基化酶
J Biol Chem. 2000 Jun 23;275(25):19146-9. doi: 10.1074/jbc.M001995200.
8
An archaeal genomic signature.一种古菌基因组特征。
Proc Natl Acad Sci U S A. 2000 Mar 28;97(7):3304-8. doi: 10.1073/pnas.97.7.3304.
9
Cysteine biosynthesis pathway in the archaeon Methanosarcina barkeri encoded by acquired bacterial genes?由获得的细菌基因编码的嗜甲烷菌巴氏甲烷八叠球菌中的半胱氨酸生物合成途径?
J Bacteriol. 2000 Jan;182(1):143-5. doi: 10.1128/JB.182.1.143-145.2000.
10
SMART: a web-based tool for the study of genetically mobile domains.SMART:一个用于研究基因移动结构域的基于网络的工具。
Nucleic Acids Res. 2000 Jan 1;28(1):231-4. doi: 10.1093/nar/28.1.231.