Han Ye, Liu Yuanning, Zhang Hao, He Fei, Shu Chonghe, Dong Liyan
Department of Computer Science and Technology, Jilin University, Changchun, Jilin, China; Key Laboratory of Symbolic Computation and Knowledge Engineering, Ministry of Education, Jilin University, Changchun, China.
Department of Computer Science and Information Technology, Northeast Normal University, Changchun, Jilin, China; Department of Environment, Northeast Normal University, Changchun, Jilin, China; Institute of Computational Biology, Northeast Normal University, Changchun, China.
Comput Math Methods Med. 2017;2017:5043984. doi: 10.1155/2017/5043984. Epub 2017 Jan 24.
Small interfering RNAs (siRNAs) induce posttranscriptional gene silencing in various organisms. siRNAs targeted to different positions of the same gene show different effectiveness; hence, predicting siRNA activity is a crucial step. In this paper, we developed and evaluated a powerful tool named "siRNApred" with a new mixed feature set to predict siRNA activity. To improve the prediction accuracy, we proposed 2-3NTs as our new features. A Random Forest siRNA activity prediction model was constructed using the feature set selected by our proposed Binary Search Feature Selection (BSFS) algorithm. Experimental data demonstrated that the binding site of the Argonaute protein correlates with siRNA activity. "siRNApred" is effective for selecting active siRNAs, and the prediction results demonstrate that our method can outperform other current siRNA activity prediction methods in terms of prediction accuracy.
小干扰RNA(siRNAs)在多种生物体中诱导转录后基因沉默。靶向同一基因不同位置的siRNAs表现出不同的有效性;因此,预测siRNA活性是关键步骤。在本文中,我们开发并评估了一个名为“siRNApred”的强大工具,它具有新的混合特征集来预测siRNA活性。为提高预测准确性,我们提出将2-3个核苷酸作为新特征。使用我们提出的二分搜索特征选择(BSFS)算法选择的特征集构建了随机森林siRNA活性预测模型。实验数据表明,AGO蛋白的结合位点与siRNA活性相关。“siRNApred”在选择活性siRNAs方面是有效的,预测结果表明我们的方法在预测准确性方面优于其他当前的siRNA活性预测方法。