• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ANCA:COGs 的氨基酸、核苷酸和密码子分析——一种用于微生物同源物序列偏差分析的工具。

ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.

机构信息

The Protein Chemistry Group, Institute for Medical Biochemistry, Centre for Biomedical Education and Research, School of Medicine, Faculty of Health, Witten/Herdecke University, Stockumer Str. 10, Witten 58448, Germany.

出版信息

BMC Bioinformatics. 2012 Sep 8;13:223. doi: 10.1186/1471-2105-13-223.

DOI:10.1186/1471-2105-13-223
PMID:22958836
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3468366/
Abstract

BACKGROUND

The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins.

RESULTS

Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far.

CONCLUSIONS

Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.

摘要

背景

COG 数据库是最受欢迎的来自许多不同完全测序微生物基因组的同源蛋白集合。根据定义,该数据库中的一个同源簇(COG)仅包含最有可能实现相同细胞功能的蛋白质。最近,COG 数据库通过为每个蛋白质分配相应的氨基酸和编码核苷酸序列扩展为 NUCOCOG 数据库。该 COG 数据库的扩展版本是一个将序列特征与相应蛋白质功能连接起来的有价值的资源。

结果

在这里,我们介绍了 ANCAC,这是一种基于可自由定义的系统发育模式分析 COG 中氨基酸、核苷酸和密码子频率的网络工具和 MySQL 数据库。我们通过在特定于物种或功能的上下文中分析氨基酸频率、密码子使用和 GC 含量来证明 ANCAC 的有用性。关于氨基酸,我们至少部分地通过使用 ANCAC 的 NUCOCOG 数据集证实了同源偏爱假说,因为这是迄今为止为此目的提供的最大数据集。

结论

使用 NUCOCOG 数据集,ANCAC 通过 COG 将分类学、氨基酸和核苷酸序列信息与功能分类联系起来,并提供了一个用于灵活挖掘序列偏差的 GUI。因此,据我们所知,它是唯一一种无需大量编程技能即可根据生理角色和系统发育背景分析序列组成的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a2/3468366/8bb1292e1cc1/1471-2105-13-223-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a2/3468366/3bc6f690b4e1/1471-2105-13-223-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a2/3468366/dc36eaab2440/1471-2105-13-223-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a2/3468366/8bb1292e1cc1/1471-2105-13-223-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a2/3468366/3bc6f690b4e1/1471-2105-13-223-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a2/3468366/dc36eaab2440/1471-2105-13-223-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a2/3468366/8bb1292e1cc1/1471-2105-13-223-4.jpg

相似文献

1
ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.ANCA:COGs 的氨基酸、核苷酸和密码子分析——一种用于微生物同源物序列偏差分析的工具。
BMC Bioinformatics. 2012 Sep 8;13:223. doi: 10.1186/1471-2105-13-223.
2
Bioinformatics analysis of disordered proteins in prokaryotes.原核生物中无序蛋白质的生物信息学分析。
BMC Bioinformatics. 2011 Mar 2;12:66. doi: 10.1186/1471-2105-12-66.
3
EPPS: mining the COG database by an extended phylogenetic patterns search.EPPS:通过扩展系统发育模式搜索挖掘COG数据库
Bioinformatics. 2003 Apr 12;19(6):784-5. doi: 10.1093/bioinformatics/btg089.
4
Expanded microbial genome coverage and improved protein family annotation in the COG database.COG数据库中微生物基因组覆盖范围的扩大及蛋白质家族注释的改进。
Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223. Epub 2014 Nov 26.
5
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。
Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.
6
Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer.古菌的更新直系同源基因簇:古菌的复杂祖先和水平基因转移的旁路。
Biol Direct. 2012 Dec 14;7:46. doi: 10.1186/1745-6150-7-46.
7
PCOGR: phylogenetic COG ranking as an online tool to judge the specificity of COGs with respect to freely definable groups of organisms.PCOGR:系统发育COG排名,作为一种在线工具,用于判断COG相对于可自由定义的生物群体的特异性。
BMC Bioinformatics. 2004 Oct 15;5:150. doi: 10.1186/1471-2105-5-150.
8
A pursuit of lineage-specific and niche-specific proteome features in the world of archaea.追寻古菌世界中谱系特异性和生态位特异性的蛋白质组特征。
BMC Genomics. 2012 Jun 12;13:236. doi: 10.1186/1471-2164-13-236.
9
COG database update: focus on microbial diversity, model organisms, and widespread pathogens.COG 数据库更新:重点关注微生物多样性、模式生物和广泛存在的病原体。
Nucleic Acids Res. 2021 Jan 8;49(D1):D274-D281. doi: 10.1093/nar/gkaa1018.
10
Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content.嗜热原核生物具有密码子使用、氨基酸组成和核苷酸含量的特征模式。
Gene. 2003 Oct 23;317(1-2):39-47. doi: 10.1016/s0378-1119(03)00660-7.

引用本文的文献

1
CompoDynamics: a comprehensive database for characterizing sequence composition dynamics.CompoDynamics:用于描述序列组成动态的综合数据库。
Nucleic Acids Res. 2022 Jan 7;50(D1):D962-D969. doi: 10.1093/nar/gkab979.
2
Aerobic prokaryotes do not have higher GC contents than anaerobic prokaryotes, but obligate aerobic prokaryotes have.好氧原核生物的 GC 含量不比厌氧原核生物高,但专性需氧原核生物的 GC 含量比厌氧原核生物高。
BMC Evol Biol. 2019 Jan 28;19(1):35. doi: 10.1186/s12862-019-1365-8.
3
Application of Subspace Clustering in DNA Sequence Analysis.

本文引用的文献

1
Overview and comparison of ortholog databases.直系同源数据库概述与比较
Drug Discov Today Technol. 2006 Summer;3(2):137-43. doi: 10.1016/j.ddtec.2006.06.002.
2
The NCBI Taxonomy database.NCBI 分类数据库。
Nucleic Acids Res. 2012 Jan;40(Database issue):D136-43. doi: 10.1093/nar/gkr1178. Epub 2011 Dec 1.
3
Amino acid metabolic origin as an evolutionary influence on protein sequence in yeast.氨基酸代谢起源作为对酵母蛋白质序列的一种进化影响。
子空间聚类在DNA序列分析中的应用。
J Comput Biol. 2015 Oct;22(10):940-52. doi: 10.1089/cmb.2015.0084. Epub 2015 Jul 10.
4
Evolution of a genome-encoded bias in amino acid biosynthetic pathways is a potential indicator of amino acid dynamics in the environment.氨基酸生物合成途径中基因组编码偏好性的演变是环境中氨基酸动态变化的一个潜在指标。
Mol Biol Evol. 2014 Nov;31(11):2865-78. doi: 10.1093/molbev/msu225. Epub 2014 Aug 12.
J Mol Evol. 2009 May;68(5):490-7. doi: 10.1007/s00239-009-9218-5. Epub 2009 Apr 9.
4
Extension of the COG and arCOG databases by amino acid and nucleotide sequences.通过氨基酸和核苷酸序列扩展COG和arCOG数据库。
BMC Bioinformatics. 2008 Nov 13;9:479. doi: 10.1186/1471-2105-9-479.
5
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。
Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.
6
Evolutionarily conserved optimization of amino acid biosynthesis.氨基酸生物合成的进化保守优化
J Mol Evol. 2007 Aug;65(2):186-96. doi: 10.1007/s00239-007-0013-x. Epub 2007 Aug 7.
7
Correlation between codon usage and thermostability.密码子使用与热稳定性之间的相关性。
Extremophiles. 2006 Oct;10(5):479-81. doi: 10.1007/s00792-006-0533-0. Epub 2006 Jul 8.
8
Evidence of selection for low cognate amino acid bias in amino acid biosynthetic enzymes.氨基酸生物合成酶中低同源氨基酸偏好性选择的证据。
Mol Microbiol. 2005 May;56(4):1017-34. doi: 10.1111/j.1365-2958.2005.04566.x.
9
The roles of ribosomal proteins in the structure assembly, and evolution of the large ribosomal subunit.核糖体蛋白在大核糖体亚基的结构组装及进化中的作用。
J Mol Biol. 2004 Jun 25;340(1):141-77. doi: 10.1016/j.jmb.2004.03.076.
10
Preferred amino acids and thermostability.优选氨基酸与热稳定性。
Genet Mol Res. 2003 Dec 30;2(4):383-93.