Sadeghi Mehdi, Parto Sahar, Arab Shahriar, Ranjbar Bijan
Department of Biophysics, National Institute of Genetic Engineering and Biotechnology, P.O.Box 14155-6343, Tehran, Iran.
FEBS Lett. 2005 Jun 20;579(16):3397-400. doi: 10.1016/j.febslet.2005.04.082.
We have used a statistical approach for protein secondary structure prediction based on information theory and simultaneously taking into consideration pairwise residue types and conformational states. Since the prediction of residue secondary structure by one residue window sliding make ambiguity in state prediction, we used a dynamic programming algorithm to find the path with maximum score. A score system for residue pairs in particular conformations is derived for adjacent neighbors up to ten residue apart in sequence. The three state overall per-residue accuracy, Q3, of this method in a jackknife test with dataset created from PDBSELECT is more than 70%.
我们基于信息论,采用了一种统计方法来预测蛋白质二级结构,同时考虑了成对的残基类型和构象状态。由于通过一个残基窗口滑动来预测残基二级结构会在状态预测中产生歧义,我们使用动态规划算法来找到得分最高的路径。对于序列中相隔至多十个残基的相邻邻居,推导了特定构象中残基对的评分系统。在使用从PDBSELECT创建的数据集进行的留一法测试中,该方法的三态全残基准确率Q3超过70%。