Suppr超能文献

蛋白质结构域组织的比较分析。

Comparative analysis of protein domain organization.

作者信息

Ye Yuzhen, Godzik Adam

机构信息

Program in Bioinformatics and Systems Biology, The Burnham Institute, La Jolla, California 92037, USA.

出版信息

Genome Res. 2004 Mar;14(3):343-53. doi: 10.1101/gr.1610504.

Abstract

We have developed a set of graph theory-based tools, which we call Comparative Analysis of Protein Domain Organization (CADO), to survey and compare protein domain organizations of different organisms. In the language of CADO, the organization of protein domains in a given organism is shown as a domain graph in which protein domains are represented as vertices, and domain combinations, defined as instances of two domains found in one protein, are represented as edges. CADO provides a new way to analyze and compare whole proteomes, including identifying the consensus and difference of domain organization between organisms. CADO was used to analyze and compare >50 bacterial, archaeal, and eukaryotic genomes. Examples and overviews presented here include the analysis of the modularity of domain graphs and the functional study of domains based on the graph topology. We also report on the results of comparing domain graphs of two organisms, Pyrococcus horikoshii (an extremophile) and Haemophilus influenzae (a parasite with reduced genome) with other organisms. Our comparison provides new insights into the genome organization of these organisms. Finally, we report on the specific domain combinations characterizing the three kingdoms of life, and the kingdom "signature" domain organizations derived from those specific domain combinations.

摘要

我们开发了一套基于图论的工具,称为蛋白质结构域组织比较分析(CADO),用于调查和比较不同生物体的蛋白质结构域组织。用CADO的语言来说,给定生物体中蛋白质结构域的组织表现为一个结构域图,其中蛋白质结构域表示为顶点,而结构域组合(定义为在一种蛋白质中发现的两个结构域的实例)表示为边。CADO提供了一种分析和比较整个蛋白质组的新方法,包括识别生物体之间结构域组织的共识和差异。CADO被用于分析和比较50多个细菌、古菌和真核生物基因组。这里给出的例子和概述包括对结构域图模块性的分析以及基于图拓扑结构对结构域的功能研究。我们还报告了将两种生物体——嗜热栖热菌(一种嗜极生物)和流感嗜血杆菌(一种基因组简化的寄生菌)的结构域图与其他生物体进行比较的结果。我们的比较为这些生物体的基因组组织提供了新的见解。最后,我们报告了表征生命三界的特定结构域组合,以及从这些特定结构域组合衍生出的界“特征”结构域组织。

相似文献

2
Structural characterization of the human proteome.人类蛋白质组的结构表征
Genome Res. 2002 Nov;12(11):1625-41. doi: 10.1101/gr.221202.
3
Preferred codons and amino acid couples in hyperthermophiles.嗜热菌中的偏好密码子和氨基酸对。
Genome Biol. 2002 Jul 19;3(8):PREPRINT0006. doi: 10.1186/gb-2002-3-8-preprint0006.
8
Comparative genomics using data mining tools.使用数据挖掘工具的比较基因组学。
J Biosci. 2002 Feb;27(1 Suppl 1):15-25. doi: 10.1007/BF02703680.
10
Abundant protein domains occur in proportion to proteome size.丰富的蛋白质结构域的出现与蛋白质组大小成比例。
Genome Biol. 2001;2(9):RESEARCH0039. doi: 10.1186/gb-2001-2-9-research0039. Epub 2001 Aug 24.

引用本文的文献

7
Simulating domain architecture evolution.模拟结构域进化。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i134-i142. doi: 10.1093/bioinformatics/btac242.

本文引用的文献

3
Increase of functional diversity by alternative splicing.通过可变剪接增加功能多样性。
Trends Genet. 2003 Mar;19(3):124-8. doi: 10.1016/S0168-9525(03)00023-4.
5
A global representation of the protein fold space.蛋白质折叠空间的全局表示。
Proc Natl Acad Sci U S A. 2003 Mar 4;100(5):2386-90. doi: 10.1073/pnas.2628030100. Epub 2003 Feb 26.
9
The PEDANT genome database.PEDANT基因组数据库。
Nucleic Acids Res. 2003 Jan 1;31(1):207-11. doi: 10.1093/nar/gkg005.
10
Expanding protein universe and its origin from the biological Big Bang.不断扩展的蛋白质世界及其源于生物大爆炸的起源。
Proc Natl Acad Sci U S A. 2002 Oct 29;99(22):14132-6. doi: 10.1073/pnas.202497999. Epub 2002 Oct 16.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验