Suppr超能文献

一种预测蛋白质折叠类型的新方法。

A new approach to predicting protein folding types.

作者信息

Chou K C, Zhang C T

机构信息

Upjohn Research Laboratories, Kalamazoo, Michigan 49001.

出版信息

J Protein Chem. 1993 Apr;12(2):169-78. doi: 10.1007/BF01026038.

Abstract

A new method is proposed for predicting the folding type of a protein according to its amino acid composition based on the following physical picture: (1) a protein is characterized as a vector of 20-dimensional space, in which its 20 components are defined by the compositions of its 20 amino acids; and (2) the similarity of two proteins is proportional to the mutual projection of their characterized vectors, and hence inversely proportional to the size of their correlation angle. Thus, the prediction is performed by calculating the correlation angles of the vector for the predicted protein with a set of standard vectors representing the norms of four protein folding types (i.e., all alpha, all beta, alpha + beta, and alpha/beta). In comparison with the existing methods, the new method has the merits of yielding a higher rate of correct prediction, displaying a more intuitive physical picture, and being convenient in application. For instance, in predicting the 64 proteins in the development set based on which the standard vectors are derived, the average accuracy rate is 83.6%, which is higher than that obtained for the same set of proteins by any of the existing methods. The average accuracy predicted for an independent set of 35 proteins of known X-ray structure is 91.4%, which is significantly higher than any of the reported accuracies so far, implying that the new method is of great value in practical application. All of these have demonstrated that the new method as proposed in this paper is characterized by an improved feature in both self-consistency and extrapolating-effectiveness.

摘要

基于以下物理图像,提出了一种根据氨基酸组成预测蛋白质折叠类型的新方法:(1)将蛋白质表征为20维空间中的向量,其中其20个分量由其20种氨基酸的组成定义;(2)两种蛋白质的相似度与其表征向量的相互投影成正比,因此与它们相关角的大小成反比。因此,通过计算预测蛋白质的向量与一组代表四种蛋白质折叠类型(即全α、全β、α + β和α/β)规范的标准向量的相关角来进行预测。与现有方法相比,新方法具有预测正确率更高、物理图像更直观、应用方便等优点。例如,在预测基于其导出标准向量的开发集中的64种蛋白质时,平均准确率为83.6%,高于现有任何方法对同一组蛋白质获得的准确率。对一组已知X射线结构的35种蛋白质的独立集预测的平均准确率为91.4%,明显高于迄今为止报道的任何准确率,这意味着新方法在实际应用中具有很大价值。所有这些都表明本文提出的新方法在自洽性和外推有效性方面都具有改进的特征。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验