Suppr超能文献

蛋白质三级结构潜在的疏水序列周期性。

Underlying hydrophobic sequence periodicity of protein tertiary structure.

作者信息

Silverman B David

机构信息

IBM Thomas J. Watson Research Center, P. O. Box 218, Yorktown Heights, New York 10598, USA.

出版信息

J Biomol Struct Dyn. 2005 Feb;22(4):411-23. doi: 10.1080/07391102.2005.10507013.

Abstract

Hydropathy plots or window averages over local stretches of the sequence of residue hydrophobicity have revealed patterns related to various protein tertiary structural features. This has enabled identification of regions of the sequence that are at the surface or within the interior of globular soluble proteins, regions located within the lipid bilayer of transmembrane proteins, portions of the sequence that characterize repeating motifs, as well as motifs that usefully characterize different protein structural families. This, therefore, provides one example of the generally expressed maxim that "sequence determines structure". On the other hand, a number of previous investigations have shown the rapidly varying values of residue hydrophobicity along the sequence to be distributed approximately randomly. So one might question just how much of the sequence actually determines structure. It is, therefore, of interest to extract that part of this rapidly varying distribution of residue hydrophobicity that is responsible for the longer wavelength variations that correlate with protein tertiary structural features and to determine their prevalence within the entire distribution. This is accomplished by a finite Fourier analysis of the sequence of residue hydrophobicity and of a new measure of residue distance from the protein interior. Calculations are performed on a number of globins, immunoglobulins, cuprodoxins, and papain-like structures. The spectral power of the Fourier amplitudes of the frequencies extracted, whose inverse transforms underlie the windowed values of residue hydrophobicity is shown to be a small fraction of the total power of the hydrophobicity distribution and thereby consistent with a distribution that might appear to be predominantly random. The wide range of sequence identity between proteins having the same fold, all exhibiting similar small fractions of power amplitude that correlate with the longer wavelength inside-to-outside excursions of the amino acid residues, supports the general contention that close sequence identity is an expression of a close evolutionary relationship rather than an expression of structural similarity. Practical implications of the present analysis for protein structure prediction and engineering are also described.

摘要

亲水性图谱或对残基疏水性序列的局部片段进行窗口平均,揭示了与各种蛋白质三级结构特征相关的模式。这使得能够识别球状可溶性蛋白质表面或内部的序列区域、跨膜蛋白质脂质双层内的区域、表征重复基序的序列部分,以及有效表征不同蛋白质结构家族的基序。因此,这提供了“序列决定结构”这一普遍表述的一个例子。另一方面,先前的一些研究表明,沿着序列残基疏水性的快速变化值大致呈随机分布。所以人们可能会质疑,实际上有多少序列决定了结构。因此,提取残基疏水性这种快速变化分布中与蛋白质三级结构特征相关的较长波长变化的部分,并确定它们在整个分布中的普遍性,是很有意义的。这是通过对残基疏水性序列和从蛋白质内部到残基距离的一种新度量进行有限傅里叶分析来完成的。对多种球蛋白、免疫球蛋白、铜蓝蛋白和类木瓜蛋白酶结构进行了计算。所提取频率的傅里叶振幅的谱功率,其逆变换构成了残基疏水性的窗口值,结果表明它只是疏水性分布总功率的一小部分,从而与可能看似主要是随机的分布一致。具有相同折叠的蛋白质之间序列同一性的广泛范围,都表现出与氨基酸残基从内到外较长波长偏移相关的功率振幅的相似小部分,这支持了这样一种普遍观点,即紧密的序列同一性是密切进化关系的一种表达,而不是结构相似性的表达。本文还描述了当前分析对蛋白质结构预测和工程的实际意义。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验