Suppr超能文献

在从同源蛋白质的比对序列进行结构预测中使用氨基酸环境依赖性替换表和构象倾向。II. 二级结构。

Use of amino acid environment-dependent substitution tables and conformational propensities in structure prediction from aligned sequences of homologous proteins. II. Secondary structures.

作者信息

Wako H, Blundell T L

机构信息

Department of Crystallography, Birkbeck College, University of London, U.K.

出版信息

J Mol Biol. 1994 May 20;238(5):693-708. doi: 10.1006/jmbi.1994.1330.

Abstract

A three-step method is presented to predict secondary structures of proteins, by utilizing aligned sequences of homologous proteins. Mean propensities and amino acid substitution patterns at a given site in the aligned sequences are first evaluated for four conformational states (i.e. alpha-helix, beta-strand, buried coil and exposed coil). Capping rules are applied in order to define boundaries of the secondary-structure segments more precisely. In the second step beta-strand is predicted by searching regions predicted as coil for the two patterns characteristic of alternating and fully buried beta-strands. The complete sequences of the solvent-accessibility classes predicted by substitution tables and propensities are also searched using Fourier transform methods for alpha-helical periodicity. After applying capping rules, the alpha-helices and beta-strands predicted in the second step replace, where appropriate, the conformational states predicted in the first step. Finally, in the third step, if one of the four conformational states is assigned to the residues at an equivalent site of aligned sequences in more than a given fraction of the proteins, such a state is reassigned to all the residues at that site. The method is applied to 13 protein families, which contain four folding types, alpha, beta, alpha/beta and alpha + beta. The accuracy of the prediction ranges from 60 to 79% (mean percentage over the 13 families is 69%). For comparison the Garnier-Osguthorpe-Robson (GOR) method is also applied to them. Although the mean prediction accuracy for the GOR method, 58%, can be improved to 63% by applying the second and third steps in this method, there remain four families with less than 55% accuracy. The mean accuracy is relatively higher and poor predictions are reduced in this method.

摘要

本文提出了一种三步法,通过利用同源蛋白质的比对序列来预测蛋白质的二级结构。首先针对四种构象状态(即α螺旋、β链、埋藏卷曲和暴露卷曲)评估比对序列中给定位置的平均倾向和氨基酸取代模式。应用封端规则以便更精确地定义二级结构片段的边界。在第二步中,通过在预测为卷曲的区域中搜索交替和完全埋藏的β链的两种特征模式来预测β链。还使用傅里叶变换方法搜索由取代表和倾向预测的溶剂可及性类别的完整序列,以寻找α螺旋周期性。应用封端规则后,第二步中预测的α螺旋和β链在适当情况下取代第一步中预测的构象状态。最后,在第三步中,如果在超过给定比例的蛋白质的比对序列的等效位点处,四种构象状态之一被分配给残基,则将该状态重新分配给该位点的所有残基。该方法应用于13个蛋白质家族,这些家族包含四种折叠类型,即α、β、α/β和α + β。预测的准确率范围为60%至79%(13个家族的平均百分比为69%)。为了进行比较,Garnier-Osguthorpe-Robson(GOR)方法也应用于这些家族。尽管GOR方法的平均预测准确率为58%,通过应用此方法的第二步和第三步可提高到63%,但仍有四个家族的准确率低于55%。该方法的平均准确率相对较高,并且减少了较差的预测。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验