Mulder Nicola J
Faculty of Health Sciences, National Bioinformatics Network Node, Institute for Infectious Diseases and Molecular Medicine, University of Cape Town, Cape Town, South Africa.
Methods Mol Biol. 2010;609:83-95. doi: 10.1007/978-1-60327-241-4_5.
Proteins are composed of functional units, or domains, that can be found alone or in combination with other domains. Analysis of protein domain architectures and the movement of protein domains within and across different genomes provide clues about the evolution of protein function. The classification of proteins into families and domains is provided through publicly available tools and databases that use known protein domains to predict other members in new proteins sequences. Currently at least 80% of the main protein sequence databases can be classified using these tools, thus providing a large data set to work from for analyzing protein domain architectures. Each of the protein domain databases provide intuitive web interfaces for viewing and analyzing their domain classifications and provide their data freely for downloading. Some of the main protein family and domain databases are described here, along with their Web-based tools for analyzing domain architectures.
蛋白质由功能单元或结构域组成,这些功能单元或结构域可以单独存在,也可以与其他结构域结合存在。对蛋白质结构域架构以及蛋白质结构域在不同基因组内和跨不同基因组的移动进行分析,可为蛋白质功能的进化提供线索。通过使用已知蛋白质结构域来预测新蛋白质序列中的其他成员的公开可用工具和数据库,可将蛋白质分类为家族和结构域。目前,至少80%的主要蛋白质序列数据库可使用这些工具进行分类,从而为分析蛋白质结构域架构提供了大量的数据集。每个蛋白质结构域数据库都提供直观的网络界面,用于查看和分析其结构域分类,并免费提供其数据以供下载。这里介绍了一些主要的蛋白质家族和结构域数据库,以及它们用于分析结构域架构的基于网络的工具。