Chou K C, Zhang C T, Kézdy F J
Upjohn Laboratories, Kalamazoo, Michigan 49001-4940.
Proteins. 1993 Jun;16(2):195-204. doi: 10.1002/prot.340160206.
A vector projection method is proposed to predict the cleavability of oligopeptides by extended-specificity site proteases. For an enzyme with eight specificity subsites the substrate octapeptide can be uniquely expressed as a vector in an 8-dimensional space, whose eight bases correspond to the amino acids at the eight subsites, P4, P3, P2, P1, P1', P2', P3', and P4', respectively. The component of such a characteristic vector on each of the eight bases is defined as the frequency of an amino acid occurring at a given site. These frequencies were derived from a set of octapeptides known to be cleaved by HIV protease. The cleavability of an octapeptide can then be estimated from the projection of its characteristic vector on an idealized, optimally cleavable vector. The high ratio of correct prediction vs. total prediction for the data in both the training and the testing sets indicates that the new method is self-consistent and efficient. It provides a rapid and accurate algorithm for analyzing the specificity of any multi-subsite enzyme for which there is no coupling between subsites. In particular, it is useful for predicting the cleavability of an oligopeptide by either HIV-1 or HIV-2 protease, and hence offers a supplementary means for finding effective inhibitors of HIV protease as potential drugs against AIDS.
提出了一种向量投影方法来预测寡肽被扩展特异性位点蛋白酶切割的可能性。对于具有八个特异性亚位点的酶,底物八肽可以在八维空间中唯一地表示为一个向量,其八个基分别对应于八个亚位点P4、P3、P2、P1、P1'、P2'、P3'和P4'处的氨基酸。这种特征向量在八个基上的分量被定义为给定位点上出现的氨基酸的频率。这些频率来自一组已知可被HIV蛋白酶切割的八肽。然后,可以根据其特征向量在理想化的、最佳可切割向量上的投影来估计八肽的可切割性。训练集和测试集中数据的正确预测与总预测的高比率表明新方法是自洽且有效的。它为分析任何亚位点之间不存在耦合的多位点酶的特异性提供了一种快速准确的算法。特别是,它对于预测寡肽被HIV-1或HIV-2蛋白酶切割的可能性很有用,因此为寻找作为抗艾滋病潜在药物的HIV蛋白酶有效抑制剂提供了一种补充手段。