Suppr超能文献

对人类结构数据库“热力学信息内容”的分析揭示了层次化的热力学组织。

Analysis of the "thermodynamic information content" of a Homo sapiens structural database reveals hierarchical thermodynamic organization.

作者信息

Larson Scott A, Hilser Vincent J

机构信息

Department of Human Biological Chemistry and Genetics, 5.162 Medical Research Bldg., University of Texas Medical Branch, Galveston, TX 77555-1068, USA.

出版信息

Protein Sci. 2004 Jul;13(7):1787-801. doi: 10.1110/ps.04706204.

Abstract

Classification of the amounts and types of lower order structural elements in proteins is a prerequisite to effective comparisons between protein folds. In an effort to provide an additional vehicle for fold comparison, we present an alternative classification scheme whereby protein folds are represented in statistical thermodynamic terms in such a way as to illuminate the energetic building blocks within protein structures. The thermodynamic relationship is examined between amino acid sequences and the conformational ensembles for a database of 159 Homo sapiens protein structures ranging from 50 to 250 amino acids. Using hierarchical clustering, it is shown through fold-recognition experiments that (1) eight thermodynamic environmental descriptors sufficiently accounts for the energetic variation within the native state ensembles of the H. sapiens structural database, (2) an amino acid library of only six residue types is sufficient to encode >90% of the thermodynamic information required for fold specificity in the entire database, and (3) structural resolution of the statistically derived environments reveals sequential cooperative segments throughout the protein, which are independent of secondary structure. As the first level of thermodynamic organization in proteins, these segments represent the thermodynamic counterpart to secondary structure.

摘要

对蛋白质中低阶结构元件的数量和类型进行分类,是有效比较蛋白质折叠的前提条件。为了提供一种额外的折叠比较方法,我们提出了一种替代分类方案,即从统计热力学角度来表示蛋白质折叠,以便阐明蛋白质结构中的能量构建单元。我们研究了一个包含159个人类蛋白质结构(氨基酸数量在50到250之间)的数据库中氨基酸序列与构象集合之间的热力学关系。通过折叠识别实验,利用层次聚类分析表明:(1)八个热力学环境描述符足以解释人类结构数据库天然态集合中的能量变化;(2)仅六种残基类型的氨基酸文库就足以编码整个数据库中折叠特异性所需热力学信息的90%以上;(3)统计得出的环境的结构解析揭示了整个蛋白质中的连续协同片段,这些片段与二级结构无关。作为蛋白质热力学组织的第一层次,这些片段代表了二级结构的热力学对应物。

相似文献

引用本文的文献

7
Investigating homology between proteins using energetic profiles.利用能量分布研究蛋白质的同源性。
PLoS Comput Biol. 2010 Mar 26;6(3):e1000722. doi: 10.1371/journal.pcbi.1000722.
10
Energetic profiling of protein folds.蛋白质折叠的能量分析
Methods Enzymol. 2009;455:299-327. doi: 10.1016/S0076-6879(08)04211-0.

本文引用的文献

7
The Protein Data Bank.蛋白质数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):235-42. doi: 10.1093/nar/28.1.235.
9
Protein fold recognition by prediction-based threading.基于预测穿线法的蛋白质折叠识别
J Mol Biol. 1997 Jul 18;270(3):471-80. doi: 10.1006/jmbi.1997.1101.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验