Saha Sudipto, Raghava Gajendra P S
Institute of Microbial Technology, Chandigarh, India.
In Silico Biol. 2007;7(4-5):369-87.
We have developed a method NTXpred for predicting neurotoxins and classifying them based on their function and origin. The dataset used in this study consists of 582 non-redundant, experimentally annotated neurotoxins obtained from Swiss-Prot. A number of modules have been developed for predicting neurotoxins using residue composition based on feed-forwarded neural network (FNN), recurrent neural network (RNN), support vector machine (SVM) and achieved maximum accuracy of 84.19%, 92.75%, 97.72% respectively. In addition, SVM modules have been developed for classifying neurotoxins based on their source (e.g., eubacteria, cnidarians, molluscs, arthropods have been and chordate) using amino acid composition and dipeptide composition and achieved maximum overall accuracy of 78.94% and 88.07% respectively. The overall accuracy increased to 92.10%, when the evolutionary information obtained from PSI-BLAST was combined with SVM module of source classification. We have also developed SVM modules for classifying neurotoxins based on functions using amino acid, dipeptide composition and achieved overall accuracy of 83.11%, 91.10% respectively. The overall accuracy of function classification improved to 95.11%, when PSI-BLAST output was combined with SVM module. All the modules developed in this study were evaluated using five-fold cross-validation technique. The NTXpred is available at www.imtech.res.in/raghava/ntxpred/ and mirror site at http://bioinformatics.uams.edu/mirror/ntxpred.
我们开发了一种名为NTXpred的方法,用于预测神经毒素并根据其功能和来源对它们进行分类。本研究中使用的数据集由从Swiss-Prot获得的582种非冗余、经过实验注释的神经毒素组成。已经开发了许多模块,使用基于前馈神经网络(FNN)、递归神经网络(RNN)、支持向量机(SVM)的残基组成来预测神经毒素,其最大准确率分别达到84.19%、92.75%、97.72%。此外,还开发了SVM模块,使用氨基酸组成和二肽组成根据神经毒素的来源(如真细菌、刺胞动物、软体动物、节肢动物和脊索动物)对其进行分类,其最大总体准确率分别为78.94%和88.07%。当将从PSI-BLAST获得的进化信息与来源分类的SVM模块相结合时,总体准确率提高到了92.10%。我们还开发了SVM模块,使用氨基酸、二肽组成根据功能对神经毒素进行分类,总体准确率分别为83.11%、91.10%。当将PSI-BLAST输出与SVM模块相结合时,功能分类的总体准确率提高到了95.11%。本研究中开发的所有模块均使用五重交叉验证技术进行评估。NTXpred可在www.imtech.res.in/raghava/ntxpred/获取,镜像站点为http://bioinformatics.uams.edu/mirror/ntxpred 。