• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过氨基酸和核苷酸序列扩展COG和arCOG数据库。

Extension of the COG and arCOG databases by amino acid and nucleotide sequences.

作者信息

Meereis Florian, Kaufmann Michael

机构信息

The Protein Chemistry Group, Witten/Herdecke University, Stockumer Str, 10, 58448 Witten, Germany.

出版信息

BMC Bioinformatics. 2008 Nov 13;9:479. doi: 10.1186/1471-2105-9-479.

DOI:10.1186/1471-2105-9-479
PMID:19014535
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2588464/
Abstract

BACKGROUND

The current versions of the COG and arCOG databases, both excellent frameworks for studies in comparative and functional genomics, do not contain the nucleotide sequences corresponding to their protein or protein domain entries.

RESULTS

Using sequence information obtained from GenBank flat files covering the completely sequenced genomes of the COG and arCOG databases, we constructed NUCOCOG (nucleotide sequences containing COG databases) as an extended version including all nucleotide sequences and in addition the amino acid sequences originally utilized to construct the current COG and arCOG databases. We make available three comprehensive single XML files containing the complete databases including all sequence information. In addition, we provide a web interface as a utility suitable to browse the NUCOCOG database for sequence retrieval. The database is accessible at http://www.uni-wh.de/nucocog.

CONCLUSION

NUCOCOG offers the possibility to analyze any sequence related property in the context of the COG and arCOG framework simply by using script languages such as PERL applied to a large but single XML document.

摘要

背景

目前版本的COG和arCOG数据库都是比较基因组学和功能基因组学研究的优秀框架,但不包含与其蛋白质或蛋白质结构域条目相对应的核苷酸序列。

结果

利用从GenBank平面文件中获得的序列信息,这些文件涵盖了COG和arCOG数据库的完全测序基因组,我们构建了NUCOCOG(包含COG数据库的核苷酸序列)作为一个扩展版本,包括所有核苷酸序列以及最初用于构建当前COG和arCOG数据库的氨基酸序列。我们提供了三个全面的单个XML文件,其中包含完整的数据库以及所有序列信息。此外,我们提供了一个网络界面,作为一个适合浏览NUCOCOG数据库进行序列检索的实用工具。该数据库可在http://www.uni-wh.de/nucocog上访问。

结论

NUCOCOG提供了一种可能性,即只需使用诸如PERL之类的脚本语言应用于一个大的但单个的XML文档,就可以在COG和arCOG框架的背景下分析任何与序列相关的属性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5ba/2588464/84e93fc01f5e/1471-2105-9-479-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5ba/2588464/465627c25a5b/1471-2105-9-479-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5ba/2588464/84e93fc01f5e/1471-2105-9-479-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5ba/2588464/465627c25a5b/1471-2105-9-479-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5ba/2588464/84e93fc01f5e/1471-2105-9-479-2.jpg

相似文献

1
Extension of the COG and arCOG databases by amino acid and nucleotide sequences.通过氨基酸和核苷酸序列扩展COG和arCOG数据库。
BMC Bioinformatics. 2008 Nov 13;9:479. doi: 10.1186/1471-2105-9-479.
2
ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.ANCA:COGs 的氨基酸、核苷酸和密码子分析——一种用于微生物同源物序列偏差分析的工具。
BMC Bioinformatics. 2012 Sep 8;13:223. doi: 10.1186/1471-2105-13-223.
3
ORFer--retrieval of protein sequences and open reading frames from GenBank and storage into relational databases or text files.ORFer——从GenBank中检索蛋白质序列和开放阅读框,并存储到关系数据库或文本文件中。
BMC Bioinformatics. 2002 Dec 19;3:40. doi: 10.1186/1471-2105-3-40.
4
SeqHound: biological sequence and structure database as a platform for bioinformatics research.SeqHound:作为生物信息学研究平台的生物序列与结构数据库
BMC Bioinformatics. 2002 Oct 25;3:32. doi: 10.1186/1471-2105-3-32.
5
ESTree db: a tool for peach functional genomics.ESTree数据库:一种用于桃功能基因组学的工具。
BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S16. doi: 10.1186/1471-2105-6-S4-S16.
6
Mitome: dynamic and interactive database for comparative mitochondrial genomics in metazoan animals.Mitome:后生动物比较线粒体基因组学的动态交互式数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D938-42. doi: 10.1093/nar/gkm763. Epub 2007 Oct 16.
7
ocsESTdb: a database of oil crop seed EST sequences for comparative analysis and investigation of a global metabolic network and oil accumulation metabolism.ocsESTdb:一个用于比较分析以及研究全球代谢网络和油脂积累代谢的油料作物种子EST序列数据库。
BMC Plant Biol. 2015 Jan 21;15:19. doi: 10.1186/s12870-014-0399-8.
8
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.transAlign:利用氨基酸促进蛋白质编码DNA序列的多重比对。
BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.
9
IGD: a resource for intronless genes in the human genome.IGD:人类基因组中无内含子基因的资源。
Gene. 2011 Nov 15;488(1-2):35-40. doi: 10.1016/j.gene.2011.08.013. Epub 2011 Sep 2.
10
3D-GENOMICS: a database to compare structural and functional annotations of proteins between sequenced genomes.3D基因组学:一个用于比较已测序基因组之间蛋白质的结构和功能注释的数据库。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D245-50. doi: 10.1093/nar/gkh064.

引用本文的文献

1
Analysis of the plant hormone expression profile during somatic embryogenesis induction in teak ().柚木体细胞胚胎发生诱导过程中植物激素表达谱分析()。 (注:括号内原文缺失内容)
Front Plant Sci. 2024 Oct 7;15:1429575. doi: 10.3389/fpls.2024.1429575. eCollection 2024.
2
Application of Subspace Clustering in DNA Sequence Analysis.子空间聚类在DNA序列分析中的应用。
J Comput Biol. 2015 Oct;22(10):940-52. doi: 10.1089/cmb.2015.0084. Epub 2015 Jul 10.
3
Mining for hemicellulases in the fungus-growing termite Pseudacanthotermes militaris using functional metagenomics.

本文引用的文献

1
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。
Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.
2
Evolutionarily conserved optimization of amino acid biosynthesis.氨基酸生物合成的进化保守优化
J Mol Evol. 2007 Aug;65(2):186-96. doi: 10.1007/s00239-007-0013-x. Epub 2007 Aug 7.
3
PCOGR: phylogenetic COG ranking as an online tool to judge the specificity of COGs with respect to freely definable groups of organisms.
利用功能宏基因组学在真菌白蚁拟澳白蚁中挖掘半纤维素酶。
Biotechnol Biofuels. 2013 May 14;6(1):78. doi: 10.1186/1754-6834-6-78.
4
ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.ANCA:COGs 的氨基酸、核苷酸和密码子分析——一种用于微生物同源物序列偏差分析的工具。
BMC Bioinformatics. 2012 Sep 8;13:223. doi: 10.1186/1471-2105-13-223.
5
Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group.通过新型古菌基因组揭示古菌和真核生物蛋白修饰系统的进化见解。
Nucleic Acids Res. 2011 Apr;39(8):3204-23. doi: 10.1093/nar/gkq1228. Epub 2010 Dec 15.
6
ComSin: database of protein structures in bound (complex) and unbound (single) states in relation to their intrinsic disorder.ComSin:与固有无序相关的结合(复合物)和未结合(单体)状态下的蛋白质结构数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D283-7. doi: 10.1093/nar/gkp963. Epub 2009 Nov 11.
PCOGR:系统发育COG排名,作为一种在线工具,用于判断COG相对于可自由定义的生物群体的特异性。
BMC Bioinformatics. 2004 Oct 15;5:150. doi: 10.1186/1471-2105-5-150.
4
Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content.嗜热原核生物具有密码子使用、氨基酸组成和核苷酸含量的特征模式。
Gene. 2003 Oct 23;317(1-2):39-47. doi: 10.1016/s0378-1119(03)00660-7.
5
The COG database: an updated version includes eukaryotes.COG数据库:更新版本涵盖真核生物。
BMC Bioinformatics. 2003 Sep 11;4:41. doi: 10.1186/1471-2105-4-41.
6
EPPS: mining the COG database by an extended phylogenetic patterns search.EPPS:通过扩展系统发育模式搜索挖掘COG数据库
Bioinformatics. 2003 Apr 12;19(6):784-5. doi: 10.1093/bioinformatics/btg089.
7
Genomic correlates of hyperthermostability, an update.超嗜热稳定性的基因组关联研究进展
J Biol Chem. 2003 May 9;278(19):17198-202. doi: 10.1074/jbc.M301327200. Epub 2003 Feb 24.
8
The COG database: new developments in phylogenetic classification of proteins from complete genomes.COG数据库:来自完整基因组的蛋白质系统发育分类的新进展。
Nucleic Acids Res. 2001 Jan 1;29(1):22-8. doi: 10.1093/nar/29.1.22.
9
Artemis: sequence visualization and annotation.阿尔忒弥斯:序列可视化与注释
Bioinformatics. 2000 Oct;16(10):944-5. doi: 10.1093/bioinformatics/16.10.944.
10
Structural and genomic correlates of hyperthermostability.超嗜热稳定性的结构与基因组关联
J Biol Chem. 2000 Oct 20;275(42):32383-6. doi: 10.1074/jbc.C000497200.