Salamov A A, Solovyev V V
Department of Cell Biology, Baylor College of Medicine, Houston, TX 77030.
J Mol Biol. 1995 Mar 17;247(1):11-5. doi: 10.1006/jmbi.1994.0116.
Recently Yi & Lander used a neural network and nearest-neighbor method with a scoring system that combined a sequence-similarity matrix with the local structural environment scoring scheme described by Bowie and co-workers for predicting protein secondary structure. We have improved their scoring system by taking into consideration N and C-terminal positions of alpha-helices and beta-strands and also beta-turns as distinctive types of secondary structure. Another improvement, which also decreases the time of computation, is performed by restricting a data base with a smaller subset of proteins that are similar with a query sequence. Using multiple sequence alignments rather than single sequences and a simple jury decision procedure our method reaches a sustained overall three-state accuracy of 72.2%, which is better than that observed for the most accurate multilayered neural-network approach, tested on the same data set of 126 non-homologous protein chains.
最近,Yi和Lander使用了一种神经网络和最近邻方法,并结合了一种评分系统,该系统将序列相似性矩阵与Bowie及其同事描述的用于预测蛋白质二级结构的局部结构环境评分方案相结合。我们通过考虑α螺旋、β链的N端和C端位置以及作为独特二级结构类型的β转角,改进了他们的评分系统。另一个改进是通过用与查询序列相似的较小蛋白质子集限制数据库来实现的,这也减少了计算时间。使用多序列比对而非单序列,并采用简单的评判决策程序,我们的方法实现了72.2%的持续总体三态准确率,这比在126条非同源蛋白质链的相同数据集上测试的最精确的多层神经网络方法所观察到的准确率要好。