Suppr超能文献

一种涉及对二级结构进行聚类分析以识别蛋白质结构域的自动化方法。

An automatic method involving cluster analysis of secondary structures for the identification of domains in proteins.

作者信息

Sowdhamini R, Blundell T L

机构信息

Department of Crystallography, Birkbeck College, London, United Kingdom.

出版信息

Protein Sci. 1995 Mar;4(3):506-20. doi: 10.1002/pro.5560040317.

Abstract

With a growing number of structures available in the Brookhaven Protein Data Bank, automatic methods for domain identification are required for the construction of databases. Domains are considered to be clusters of secondary structure elements. Thus, helices and strands are first clustered using intersecondary structural distances between C alpha positions, and dendrograms based on this distance measure are used to identify domains. Individual domains are recognized by a disjoint factor, which enables the automatic identification and classification into disjoint, interacting, and conjoint domains. Application to a database of 83 protein families and 18 unique structures shows that the approach provides an effective delineation of boundaries and identifies those proteins that can be considered as a single domain. A quantitative estimate of the interaction between domains has been proposed. The database of protein domains is a useful tool for understanding protein folding, for recognizing protein folds, and for understanding structure-activity relationships.

摘要

随着布鲁克海文蛋白质数据库中可用结构数量的不断增加,构建数据库需要用于结构域识别的自动方法。结构域被认为是二级结构元件的簇。因此,首先使用Cα位置之间的二级结构间距离对螺旋和链进行聚类,并基于此距离度量的树状图来识别结构域。单个结构域通过一个不相交因子来识别,该因子能够自动识别并分类为不相交、相互作用和联合结构域。应用于一个包含83个蛋白质家族和18个独特结构的数据库表明,该方法能够有效地划分边界,并识别出那些可被视为单个结构域的蛋白质。已经提出了对结构域之间相互作用的定量估计。蛋白质结构域数据库是理解蛋白质折叠、识别蛋白质折叠以及理解结构-活性关系的有用工具。

相似文献

引用本文的文献

2
An ambiguity principle for assigning protein structural domains.一种用于分配蛋白质结构域的不明确性原理。
Sci Adv. 2017 Jan 13;3(1):e1600552. doi: 10.1126/sciadv.1600552. eCollection 2017 Jan.
6
Generation of a consensus protein domain dictionary.生成共识蛋白结构域词典。
Bioinformatics. 2011 Jan 1;27(1):46-54. doi: 10.1093/bioinformatics/btq625. Epub 2010 Nov 9.
9
Classification of protein folds.蛋白质折叠的分类
Mol Biotechnol. 2007 Jul;36(3):238-47. doi: 10.1007/s12033-007-0032-2.

本文引用的文献

3
4
Binary discontinuous compact protein domains.二元不连续紧密蛋白结构域
Protein Eng. 1994 Mar;7(3):335-40. doi: 10.1093/protein/7.3.335.
6
Parser for protein folding units.蛋白质折叠单元解析器。
Proteins. 1994 Jul;19(3):256-68. doi: 10.1002/prot.340190309.
8
Location of structural domains in protein.蛋白质中结构域的位置。
Biochemistry. 1981 Nov 10;20(23):6544-52. doi: 10.1021/bi00526a005.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验