Zhao Yalin, Li Hua, Hou Yanyan, Cha Lei, Cao Yuan, Wang Ligui, Ying Xiaomin, Li Wuju
Center of Computational Biology, Beijing Institute of Basic Medical Sciences, Taiping Road 27#, Haidian District, Beijing 100850, China.
Biochem Biophys Res Commun. 2008 Jul 25;372(2):346-50. doi: 10.1016/j.bbrc.2008.05.046. Epub 2008 May 21.
Accurate prediction of sRNA targets plays a key role in determining sRNA functions. Here we introduced two mathematical models, sRNATargetNB and sRNATargetSVM, for prediction of sRNA targets using Nai ve Bayes method and support vector machines (SVM), respectively. The training dataset was composed of 46 positive samples (real sRNA-targets interaction) and 86 negative samples (no interaction between sRNA and targets). The leave-one-out cross-validation (LOOCV) classification accuracy was 91.67% for sRNATargetNB, and 100.00% for sRNATargetSVM. To evaluate the performance of the models, an independent test dataset was used, which contained 22 positive samples and 1700 randomly generated negative samples. The results showed that the classification accuracy, sensitivity, and specificity were 93.03%, 40.90%, and 93.71% for sRNATargetNB and 80.55%, 72.73%, and 80.65% for sRNATargetSVM, respectively. Therefore, the presented models provide support for experimental identification of sRNA targets. The related software and supplementary materials can be downloaded from webpage http://www.biosun.org.cn/srnatarget/.
准确预测小RNA(sRNA)的靶标在确定sRNA功能方面起着关键作用。在此,我们分别介绍了两种数学模型,即sRNATargetNB和sRNATargetSVM,用于使用朴素贝叶斯方法和支持向量机(SVM)预测sRNA靶标。训练数据集由46个阳性样本(真实的sRNA-靶标相互作用)和86个阴性样本(sRNA与靶标之间无相互作用)组成。sRNATargetNB的留一法交叉验证(LOOCV)分类准确率为91.67%,sRNATargetSVM的为100.00%。为了评估模型的性能,使用了一个独立测试数据集,其中包含22个阳性样本和1700个随机生成的阴性样本。结果表明,sRNATargetNB的分类准确率、灵敏度和特异性分别为93.03%、40.90%和93.71%,sRNATargetSVM的分别为80.55%、72.73%和80.65%。因此,所提出的模型为sRNA靶标的实验鉴定提供了支持。相关软件和补充材料可从网页http://www.biosun.org.cn/srnatarget/下载。