• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PCOGR:系统发育COG排名,作为一种在线工具,用于判断COG相对于可自由定义的生物群体的特异性。

PCOGR: phylogenetic COG ranking as an online tool to judge the specificity of COGs with respect to freely definable groups of organisms.

作者信息

Meereis Florian, Kaufmann Michael

机构信息

The Protein Chemistry Group, Institute of Neurobiochemistry, Witten/Herdecke University, Stockumer Str. 10, 58448 Witten, Germany.

出版信息

BMC Bioinformatics. 2004 Oct 15;5:150. doi: 10.1186/1471-2105-5-150.

DOI:10.1186/1471-2105-5-150
PMID:15488147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC526202/
Abstract

BACKGROUND

The rapidly increasing number of completely sequenced genomes led to the establishment of the COG-database which, based on sequence homologies, assigns similar proteins from different organisms to clusters of orthologous groups (COGs). There are several bioinformatic studies that made use of this database to determine (hyper)thermophile-specific proteins by searching for COGs containing (almost) exclusively proteins from (hyper)thermophilic genomes. However, public software to perform individually definable group-specific searches is not available.

RESULTS

The tool described here exactly fills this gap. The software is accessible at http://www.uni-wh.de/pcogr and is linked to the COG-database. The user can freely define two groups of organisms by selecting for each of the (current) 66 organisms to belong either to groupA, to the reference groupB or to be ignored by the algorithm. Then, for all COGs a specificity index is calculated with respect to the specificity to groupA, i. e. high scoring COGs contain proteins from the most of groupA organisms while proteins from the most organisms assigned to groupB are absent. In addition to ranking all COGs according to the user defined specificity criteria, a graphical visualization shows the distribution of all COGs by displaying their abundance as a function of their specificity indexes.

CONCLUSIONS

This software allows detecting COGs specific to a predefined group of organisms. All COGs are ranked in the order of their specificity and a graphical visualization allows recognizing (i) the presence and abundance of such COGs and (ii) the phylogenetic relationship between groupA- and groupB-organisms. The software also allows detecting putative protein-protein interactions, novel enzymes involved in only partially known biochemical pathways, and alternate enzymes originated by convergent evolution.

摘要

背景

全基因组测序数量的迅速增加促成了COG数据库的建立,该数据库基于序列同源性,将来自不同生物体的相似蛋白质归入直系同源群(COG)。有多项生物信息学研究利用该数据库,通过搜索那些(几乎)只包含来自嗜热(超嗜热)基因组蛋白质的COG,来确定嗜热(超嗜热)菌特异性蛋白质。然而,目前尚无能够进行用户自定义的特定群组搜索的公共软件。

结果

本文所述工具恰好填补了这一空白。该软件可从http://www.uni-wh.de/pcogr获取,并与COG数据库相链接。用户可通过为当前66种生物体中的每一种选择归入A组、归入参考B组或被算法忽略,从而自由定义两组生物体。然后,针对所有COG计算相对于A组的特异性指数,即得分高的COG包含来自大多数A组生物体的蛋白质,而来自大多数归入B组生物体的蛋白质则不存在。除了根据用户定义的特异性标准对所有COG进行排序外,图形化可视化还通过将所有COG的丰度显示为其特异性指数的函数,展示了它们的分布情况。

结论

该软件能够检测特定预定义生物体组特有的COG。所有COG均按其特异性排序,图形化可视化有助于识别(i)此类COG的存在及丰度,以及(ii)A组和B组生物体之间的系统发育关系。该软件还能够检测假定的蛋白质 - 蛋白质相互作用、参与部分已知生化途径的新型酶,以及由趋同进化产生的替代酶。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da58/526202/2d0aa681ca8a/1471-2105-5-150-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da58/526202/26baa337f868/1471-2105-5-150-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da58/526202/6f8c8d0791e5/1471-2105-5-150-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da58/526202/2d0aa681ca8a/1471-2105-5-150-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da58/526202/26baa337f868/1471-2105-5-150-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da58/526202/6f8c8d0791e5/1471-2105-5-150-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da58/526202/2d0aa681ca8a/1471-2105-5-150-3.jpg

相似文献

1
PCOGR: phylogenetic COG ranking as an online tool to judge the specificity of COGs with respect to freely definable groups of organisms.PCOGR:系统发育COG排名,作为一种在线工具,用于判断COG相对于可自由定义的生物群体的特异性。
BMC Bioinformatics. 2004 Oct 15;5:150. doi: 10.1186/1471-2105-5-150.
2
A global gene evolution analysis on Vibrionaceae family using phylogenetic profile.利用系统发育图谱对弧菌科进行全基因组进化分析。
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S23. doi: 10.1186/1471-2105-8-S1-S23.
3
Detecting non-orthology in the COGs database and other approaches grouping orthologs using genome-specific best hits.在直系同源基因簇(COGs)数据库中检测非直系同源关系,以及使用特定基因组最佳匹配来对直系同源基因进行分组的其他方法。
Nucleic Acids Res. 2006 Jul 11;34(11):3309-16. doi: 10.1093/nar/gkl433. Print 2006.
4
PhyloGena--a user-friendly system for automated phylogenetic annotation of unknown sequences.PhyloGena——一个用于对未知序列进行自动系统发育注释的用户友好型系统。
Bioinformatics. 2007 Apr 1;23(7):793-801. doi: 10.1093/bioinformatics/btm016. Epub 2007 Mar 1.
5
The SYSTERS Protein Family Database in 2005.2005年的SYSTERS蛋白质家族数据库。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D226-9. doi: 10.1093/nar/gki030.
6
The COG database: new developments in phylogenetic classification of proteins from complete genomes.COG数据库:来自完整基因组的蛋白质系统发育分类的新进展。
Nucleic Acids Res. 2001 Jan 1;29(1):22-8. doi: 10.1093/nar/29.1.22.
7
Prosecutor: parameter-free inference of gene function for prokaryotes using DNA microarray data, genomic context and multiple gene annotation sources.起诉人:利用DNA微阵列数据、基因组背景和多种基因注释来源对原核生物基因功能进行无参数推断。
BMC Genomics. 2008 Oct 21;9:495. doi: 10.1186/1471-2164-9-495.
8
EPPS: mining the COG database by an extended phylogenetic patterns search.EPPS:通过扩展系统发育模式搜索挖掘COG数据库
Bioinformatics. 2003 Apr 12;19(6):784-5. doi: 10.1093/bioinformatics/btg089.
9
Comparative genomic analysis by microbial COGs self-attraction rate.基于微生物直系同源基因簇自吸引率的比较基因组分析
J Theor Biol. 2009 Jun 21;258(4):513-20. doi: 10.1016/j.jtbi.2009.01.035. Epub 2009 Feb 13.
10
Taxonomic colouring of phylogenetic trees of protein sequences.蛋白质序列系统发育树的分类着色。
BMC Bioinformatics. 2006 Feb 17;7:79. doi: 10.1186/1471-2105-7-79.

引用本文的文献

1
On the cytotoxicity of HCR-NTPase in the neuroblastoma cell line SH-SY5Y.HCR-NTPase对神经母细胞瘤细胞系SH-SY5Y的细胞毒性研究
BMC Res Notes. 2009 Jun 11;2:102. doi: 10.1186/1756-0500-2-102.
2
Extension of the COG and arCOG databases by amino acid and nucleotide sequences.通过氨基酸和核苷酸序列扩展COG和arCOG数据库。
BMC Bioinformatics. 2008 Nov 13;9:479. doi: 10.1186/1471-2105-9-479.
3
PHOG: a database of supergenomes built from proteome complements.PHOG:一个基于蛋白质组互补构建的超基因组数据库。

本文引用的文献

1
Thermophile-specific proteins: the gene product of aq_1292 from Aquifex aeolicus is an NTPase.嗜热菌特异性蛋白:来自嗜热栖热菌的aq_1292基因产物是一种NTP酶。
BMC Biochem. 2003 Sep 23;4:12. doi: 10.1186/1471-2091-4-12.
2
The COG database: an updated version includes eukaryotes.COG数据库:更新版本涵盖真核生物。
BMC Bioinformatics. 2003 Sep 11;4:41. doi: 10.1186/1471-2105-4-41.
3
EPPS: mining the COG database by an extended phylogenetic patterns search.EPPS:通过扩展系统发育模式搜索挖掘COG数据库
BMC Evol Biol. 2006 Jun 22;6:52. doi: 10.1186/1471-2148-6-52.
4
Crystal structure of THEP1 from the hyperthermophile Aquifex aeolicus: a variation of the RecA fold.嗜热栖热菌中THEP1的晶体结构:RecA折叠的一种变体
BMC Struct Biol. 2005 Mar 20;5:7. doi: 10.1186/1472-6807-5-7.
Bioinformatics. 2003 Apr 12;19(6):784-5. doi: 10.1093/bioinformatics/btg089.
4
Potential genomic determinants of hyperthermophily.嗜热菌的潜在基因组决定因素。
Trends Genet. 2003 Apr;19(4):172-6. doi: 10.1016/S0168-9525(03)00047-7.
5
A hot story from comparative genomics: reverse gyrase is the only hyperthermophile-specific protein.比较基因组学中的一个热门故事:反向回旋酶是唯一一种嗜热菌特有的蛋白质。
Trends Genet. 2002 May;18(5):236-7. doi: 10.1016/s0168-9525(02)02650-1.
6
The COG database: new developments in phylogenetic classification of proteins from complete genomes.COG数据库:来自完整基因组的蛋白质系统发育分类的新进展。
Nucleic Acids Res. 2001 Jan 1;29(1):22-8. doi: 10.1093/nar/29.1.22.
7
Who's your neighbor? New computational approaches for functional genomics.谁是你的邻居?功能基因组学的新计算方法。
Nat Biotechnol. 2000 Jun;18(6):609-13. doi: 10.1038/76443.
8
X-ray scattering studies of Methylophilus methylotrophus (sp. W3A1) electron-transferring flavoprotein. Evidence for multiple conformational states and an induced fit mechanism for assembly with trimethylamine dehydrogenase.嗜甲基甲基ophilus(菌株W3A1)电子传递黄素蛋白的X射线散射研究。多种构象状态的证据以及与三甲胺脱氢酶组装的诱导契合机制。
J Biol Chem. 2000 Jul 14;275(28):21349-54. doi: 10.1074/jbc.M001564200.
9
Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.通过比较基因组分析确定蛋白质功能:蛋白质系统发育谱
Proc Natl Acad Sci U S A. 1999 Apr 13;96(8):4285-8. doi: 10.1073/pnas.96.8.4285.
10
A genomic perspective on protein families.蛋白质家族的基因组视角。
Science. 1997 Oct 24;278(5338):631-7. doi: 10.1126/science.278.5338.631.