• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PRODOC:一个用于比较拴系蛋白结构域架构并内置远亲结构域家族信息的资源库。

PRODOC: a resource for the comparison of tethered protein domain architectures with in-built information on remotely related domain families.

作者信息

Krishnadev O, Rekha N, Pandit S B, Abhiman S, Mohanty S, Swapna L S, Gore S, Srinivasan N

机构信息

Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India.

出版信息

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W126-9. doi: 10.1093/nar/gki474.

DOI:10.1093/nar/gki474
PMID:15980440
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1160235/
Abstract

PROtein Domain Organization and Comparison (PRODOC) comprises several programs that enable convenient comparison of proteins as a sequence of domains. The in-built dataset currently consists of approximately 698 000 proteins from 192 organisms with complete genomic data, and all the SWISSPROT proteins obtained from the Pfam database. All the entries in PRODOC are represented as a sequence of functional domains, assigned using hidden Markov models, instead of as a sequence of amino acids. On average 69% of the proteins in the proteomes and 49% of the residues are covered by functional domain assignments. Software tools allow the user to query the dataset with a sequence of domains and identify proteins with the same or a jumbled or circularly permuted arrangement of domains. As it is proposed that proteins with jumbled or the same domain sequences have similar functions, this search tool is useful in assigning the overall function of a multi-domain protein. Unique features of PRODOC include the generation of alignments between multi-domain proteins on the basis of the sequence of domains and in-built information on distantly related domain families forming superfamilies. It is also possible using PRODOC to identify domain sharing and gene fusion events across organisms. An exhaustive genome-genome comparison tool in PRODOC also enables the detection of successive domain sharing and domain fusion events across two organisms. The tool permits the identification of gene clusters involved in similar biological processes in two closely related organisms. The URL for PRODOC is http://hodgkin.mbu.iisc.ernet.in/~prodoc.

摘要

蛋白质结构域组织与比较(PRODOC)包含多个程序,可方便地将蛋白质作为结构域序列进行比较。内置数据集目前包含来自192个具有完整基因组数据的生物体的约698000种蛋白质,以及从Pfam数据库获得的所有SWISSPROT蛋白质。PRODOC中的所有条目均表示为使用隐马尔可夫模型分配的功能结构域序列,而非氨基酸序列。蛋白质组中平均69%的蛋白质和49%的残基被功能结构域分配所覆盖。软件工具允许用户使用结构域序列查询数据集,并识别具有相同、混乱或环形排列结构域的蛋白质。由于有人提出具有混乱或相同结构域序列的蛋白质具有相似功能,因此该搜索工具在确定多结构域蛋白质的整体功能方面很有用。PRODOC的独特功能包括基于结构域序列生成多结构域蛋白质之间的比对,以及关于形成超家族的远缘相关结构域家族的内置信息。使用PRODOC还可以识别不同生物体之间的结构域共享和基因融合事件。PRODOC中的一个详尽的基因组-基因组比较工具还能够检测两个生物体之间连续的结构域共享和结构域融合事件。该工具允许识别两个密切相关生物体中参与相似生物学过程的基因簇。PRODOC的网址是http://hodgkin.mbu.iisc.ernet.in/~prodoc 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c567/1160235/1076df45f718/gki474f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c567/1160235/1076df45f718/gki474f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c567/1160235/1076df45f718/gki474f1.jpg

相似文献

1
PRODOC: a resource for the comparison of tethered protein domain architectures with in-built information on remotely related domain families.PRODOC:一个用于比较拴系蛋白结构域架构并内置远亲结构域家族信息的资源库。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W126-9. doi: 10.1093/nar/gki474.
2
Cascade PSI-BLAST web server: a remote homology search tool for relating protein domains.级联PSI-BLAST网络服务器:一种用于关联蛋白质结构域的远程同源性搜索工具。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W143-6. doi: 10.1093/nar/gkl157.
3
SUPFAM--a database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families: implications for structural genomics and function annotation in genomes.SUPFAM——一个通过比较基于序列和基于结构的家族而得出的潜在蛋白质超家族关系数据库:对结构基因组学和基因组功能注释的意义。
Nucleic Acids Res. 2002 Jan 1;30(1):289-93. doi: 10.1093/nar/30.1.289.
4
KinG: a database of protein kinases in genomes.KinG:基因组中蛋白激酶的数据库。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D153-5. doi: 10.1093/nar/gkh019.
5
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis.CATH结构域数据库以及相关资源Gene3D和DHS为基因组分析提供了全面的结构域家族信息。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D247-51. doi: 10.1093/nar/gki024.
6
SUPFAM: a database of sequence superfamilies of protein domains.SUPFAM:一个蛋白质结构域序列超家族数据库。
BMC Bioinformatics. 2004 Mar 15;5:28. doi: 10.1186/1471-2105-5-28.
7
PASS2: an automated database of protein alignments organised as structural superfamilies.PASS2:一个以结构超家族形式组织的蛋白质比对自动化数据库。
BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.
8
GenDiS: Genomic Distribution of protein structural domain Superfamilies.GenDiS:蛋白质结构域超家族的基因组分布
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D252-5. doi: 10.1093/nar/gki087.
9
MulPSSM: a database of multiple position-specific scoring matrices of protein domain families.MulPSSM:蛋白质结构域家族的多位置特异性评分矩阵数据库。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D243-6. doi: 10.1093/nar/gkj043.
10
Accurate domain identification with structure-anchored hidden Markov models, saHMMs.基于结构锚定隐马尔可夫模型(saHMMs)的精确领域识别。
Proteins. 2009 Aug 1;76(2):343-52. doi: 10.1002/prot.22349.

引用本文的文献

1
Evolution of domain promiscuity in eukaryotic genomes--a perspective from the inferred ancestral domain architectures.真核生物基因组中结构域混杂现象的演变——基于推断的祖先结构域架构的视角
Mol Biosyst. 2011 Mar;7(3):784-92. doi: 10.1039/c0mb00182a. Epub 2010 Dec 3.
2
DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture.DAhunter:一个通过比较结构域架构来识别同源蛋白质的基于网络的服务器。
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W60-4. doi: 10.1093/nar/gkn172. Epub 2008 Apr 14.

本文引用的文献

1
Interaction interfaces of protein domains are not topologically equivalent across families within superfamilies: Implications for metabolic and signaling pathways.蛋白质结构域的相互作用界面在超家族内的不同家族间拓扑结构并不等同:对代谢和信号通路的影响。
Proteins. 2005 Feb 1;58(2):339-53. doi: 10.1002/prot.20319.
2
SUPFAM: a database of sequence superfamilies of protein domains.SUPFAM:一个蛋白质结构域序列超家族数据库。
BMC Bioinformatics. 2004 Mar 15;5:28. doi: 10.1186/1471-2105-5-28.
3
Structure, function and evolution of multidomain proteins.
多结构域蛋白的结构、功能与进化
Curr Opin Struct Biol. 2004 Apr;14(2):208-16. doi: 10.1016/j.sbi.2004.03.011.
4
Comparative analysis of protein domain organization.蛋白质结构域组织的比较分析。
Genome Res. 2004 Mar;14(3):343-53. doi: 10.1101/gr.1610504.
5
Ensembl 2004.Ensembl 2004。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D468-70. doi: 10.1093/nar/gkh038.
6
The SUPERFAMILY database in 2004: additions and improvements.2004年的SUPERFAMILY数据库:新增内容与改进
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D235-9. doi: 10.1093/nar/gkh117.
7
SMART 4.0: towards genomic data integration.SMART 4.0:迈向基因组数据整合
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D142-4. doi: 10.1093/nar/gkh088.
8
Multi-domain protein families and domain pairs: comparison with known structures and a random model of domain recombination.多结构域蛋白家族和结构域对:与已知结构的比较及结构域重组的随机模型
J Struct Funct Genomics. 2003;4(2-3):67-78. doi: 10.1023/a:1026113408773.
9
Eukaryotic domain evolution inferred from genome comparisons.从基因组比较推断真核生物域的进化。
Curr Opin Genet Dev. 2003 Dec;13(6):623-8. doi: 10.1016/j.gde.2003.10.004.
10
The COG database: an updated version includes eukaryotes.COG数据库:更新版本涵盖真核生物。
BMC Bioinformatics. 2003 Sep 11;4:41. doi: 10.1186/1471-2105-4-41.