Liu Na, Wang Tianming
Department of Applied Mathematics, Dalian University of Technology, Dalian 116024, China.
J Mol Graph Model. 2007 Mar;25(6):852-5. doi: 10.1016/j.jmgm.2006.08.006. Epub 2006 Aug 30.
Since the concept of structural classes of proteins was proposed, the problem of protein classification has been tackled by many groups. Most of their classification criteria are based only on the helix/strand contents of proteins. In this paper, we proposed a method for protein structural classification based on their secondary structure sequences. It is a classification scheme that can confirm existing classifications. Here a mathematical model is constructed to describe protein secondary structure sequences, in which each protein secondary structure sequence corresponds to a transition probability matrix that characterizes and differentiates protein structure numerically. Its application to a set of real data has indicated that our method can classify protein structures correctly. The final classification result is shown schematically. So it is visual to observe the structural classifications, which is different from traditional methods.
自从蛋白质结构类别的概念被提出以来,许多研究团队都致力于解决蛋白质分类问题。他们大多的分类标准仅基于蛋白质的螺旋/链状结构含量。在本文中,我们提出了一种基于蛋白质二级结构序列的蛋白质结构分类方法。这是一种能够验证现有分类的分类方案。在此构建了一个数学模型来描述蛋白质二级结构序列,其中每个蛋白质二级结构序列都对应一个转移概率矩阵,该矩阵从数值上表征并区分蛋白质结构。将其应用于一组实际数据表明,我们的方法能够正确地对蛋白质结构进行分类。最终的分类结果以示意图的形式展示。因此,观察结构分类变得直观,这与传统方法不同。