Wang Long-Hui, Liu Juan, Li Yan-Fu, Zhou Huai-Bei
School of Computer, Wuhan University, Wuhan 430079, China.
Genome Inform. 2004;15(2):181-90.
Protein structure prediction is one of the most important problems in modern computational biology. Protein secondary structure prediction is a key step in prediction of protein tertiary structure. There have emerged many methods based on machine learning techniques, such as neural networks (NN) and support vector machine (SVM) etc., to focus on the prediction of the secondary structures. In this paper, a new method was proposed based on SVM. Different from the existing methods, this method takes into account of the physical-chemical properties and structure properties of amino acids. When tested on the most popular dataset CB513, it achieved a Q(3) accuracy of 0.7844, which illustrates that it is one of the top range methods for protein of secondary structure prediction.
蛋白质结构预测是现代计算生物学中最重要的问题之一。蛋白质二级结构预测是预测蛋白质三级结构的关键步骤。已经出现了许多基于机器学习技术的方法,如神经网络(NN)和支持向量机(SVM)等,来专注于二级结构的预测。本文提出了一种基于支持向量机的新方法。与现有方法不同,该方法考虑了氨基酸的物理化学性质和结构性质。在最流行的数据集CB513上进行测试时,它实现了0.7844的Q(3)准确率,这表明它是蛋白质二级结构预测的顶级方法之一。