Ruggiero C, Sacile R, Rauch G
Department of Communication, Computer and System Sciences, Genoa University, Italy.
IEEE Trans Biomed Eng. 1993 Nov;40(11):1114-21. doi: 10.1109/10.245628.
Artificial neural networks have been recently applied with success for protein secondary structure prediction. So far, one of the two main aspects on which neural net performance depends, the topology of the net, has been considered. The present work addresses the other main aspect, the building up of the learning set. We present a criterion to build up suitable learning sets based on the alpha-helix percentage. Starting from a set of several well known proteins, we formed 7 groups of proteins with similar helix percentages and we used them for the learning of the same neural net. We found that the best secondary structure prediction for each of the tested proteins (not belonging to the initial set) was the one obtained using the learning set whose helix percentage was closest to that of the tested protein. The accuracy of correct prediction of our method on three types of secondary structure (alpha-helix, beta-sheet and coil), has been compared with the accuracy of other secondary structure prediction methods.
人工神经网络最近已成功应用于蛋白质二级结构预测。到目前为止,神经网络性能所依赖的两个主要方面之一,即网络的拓扑结构,已得到考虑。目前的工作涉及另一个主要方面,即学习集的构建。我们提出了一个基于α-螺旋百分比来构建合适学习集的标准。从一组几个知名蛋白质开始,我们形成了7组具有相似螺旋百分比的蛋白质,并将它们用于同一个神经网络的学习。我们发现,对于每个测试蛋白质(不属于初始集),最佳的二级结构预测是使用螺旋百分比最接近测试蛋白质的学习集获得的。我们的方法对三种二级结构(α-螺旋、β-折叠和卷曲)的正确预测准确率,已与其他二级结构预测方法的准确率进行了比较。