Suppr超能文献

蛋白质中50个残基片段的层次折叠网络的通用划分

Universal partitioning of the hierarchical fold network of 50-residue segments in proteins.

作者信息

Ito Jun-ichi, Sonobe Yuki, Ikeda Kazuyoshi, Tomii Kentaro, Higo Junichi

机构信息

The Center for Advanced Medical Engineering and Informatics, Osaka University, Open Laboratories for Advanced Bioscience and Biotechnology, Suita, Osaka, Japan.

出版信息

BMC Struct Biol. 2009 May 20;9:34. doi: 10.1186/1472-6807-9-34.

Abstract

BACKGROUND

Several studies have demonstrated that protein fold space is structured hierarchically and that power-law statistics are satisfied in relation between the numbers of protein families and protein folds (or superfamilies). We examined the internal structure and statistics in the fold space of 50 amino-acid residue segments taken from various protein folds. We used inter-residue contact patterns to measure the tertiary structural similarity among segments. Using this similarity measure, the segments were classified into a number (Kc) of clusters. We examined various Kc values for the clustering. The special resolution to differentiate the segment tertiary structures increases with increasing Kc. Furthermore, we constructed networks by linking structurally similar clusters.

RESULTS

The network was partitioned persistently into four regions for Kc >or= 1000. This main partitioning is consistent with results of earlier studies, where similar partitioning was reported in classifying protein domain structures. Furthermore, the network was partitioned naturally into several dozens of sub-networks (i.e., communities). Therefore, intra-sub-network clusters were mutually connected with numerous links, although inter-sub-network ones were rarely done with few links. For Kc >or= 1000, the major sub-networks were about 40; the contents of the major sub-networks were conserved. This sub-partitioning is a novel finding, suggesting that the network is structured hierarchically: Segments construct a cluster, clusters form a sub-network, and sub-networks constitute a region. Additionally, the network was characterized by non-power-law statistics, which is also a novel finding.

CONCLUSION

Main findings are: (1) The universe of 50 residue segments found here was characterized by non-power-law statistics. Therefore, the universe differs from those ever reported for the protein domains. (2) The 50-residue segments were partitioned persistently and universally into some dozens (ca. 40) of major sub-networks, irrespective of the number of clusters. (3) These major sub-networks encompassed 90% of all segments. Consequently, the protein tertiary structure is constructed using the dozens of elements (sub-networks).

摘要

背景

多项研究表明,蛋白质折叠空间具有层次结构,并且蛋白质家族数量与蛋白质折叠(或超家族)数量之间的关系满足幂律统计。我们研究了取自各种蛋白质折叠的50个氨基酸残基片段的折叠空间中的内部结构和统计数据。我们使用残基间接触模式来测量片段之间的三级结构相似性。利用这种相似性度量,将片段分类为若干(Kc)个簇。我们研究了聚类的各种Kc值。区分片段三级结构的特殊分辨率随着Kc的增加而提高。此外,我们通过连接结构相似的簇构建了网络。

结果

当Kc≥1000时,网络持续划分为四个区域。这种主要划分与早期研究结果一致,早期研究在对蛋白质结构域进行分类时也报告了类似的划分。此外,网络自然地划分为几十个子网(即社区)。因此,子网内的簇通过众多链接相互连接,而子网间的簇很少通过少量链接连接。对于Kc≥1000,主要子网约为40个;主要子网的内容是保守的。这种子划分是一个新发现,表明网络具有层次结构:片段构成一个簇,簇形成一个子网,子网构成一个区域。此外,该网络具有非幂律统计特征,这也是一个新发现。

结论

主要发现如下:(1)此处发现的50个残基片段的总体具有非幂律统计特征。因此,该总体与以往报道的蛋白质结构域不同。(2)无论簇的数量如何,50个残基片段都持续且普遍地划分为几十个(约40个)主要子网。(3)这些主要子网包含了所有片段的90%。因此,蛋白质三级结构是由几十个元素(子网)构建而成的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3638/2693521/3ff6a338a735/1472-6807-9-34-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验