Ishida Takashi, Nakamura Shugo, Shimizu Kentaro
Department of Biotechnology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan.
Proteins. 2006 Sep 1;64(4):940-7. doi: 10.1002/prot.21047.
We developed a novel knowledge-based residue environment potential for assessing the quality of protein structures in protein structure prediction. The potential uses the contact number of residues in a protein structure and the absolute contact number of residues predicted from its amino acid sequence using a new prediction method based on a support vector regression (SVR). The contact number of an amino acid residue in a protein structure is defined by the number of residues around a given residue. First, the contact number of each residue is predicted using SVR from an amino acid sequence of a target protein. Then, the potential of the protein structure is calculated from the probability distribution of the native contact numbers corresponding to the predicted ones. The performance of this potential is compared with other score functions using decoy structures to identify both native structure from other structures and near-native structures from nonnative structures. This potential improves not only the ability to identify native structures from other structures but also the ability to discriminate near-native structures from nonnative structures.
我们开发了一种基于知识的新型残基环境势,用于评估蛋白质结构预测中蛋白质结构的质量。该势利用蛋白质结构中残基的接触数以及使用基于支持向量回归(SVR)的新预测方法从其氨基酸序列预测的残基绝对接触数。蛋白质结构中氨基酸残基的接触数由给定残基周围的残基数量定义。首先,使用SVR从目标蛋白质的氨基酸序列预测每个残基的接触数。然后,根据与预测接触数相对应的天然接触数的概率分布计算蛋白质结构的势。使用诱饵结构将该势的性能与其他评分函数进行比较,以从其他结构中识别天然结构,并从不正确结构中识别接近天然的结构。这种势不仅提高了从其他结构中识别天然结构的能力,还提高了从不正确结构中区分接近天然结构的能力。