Meng Jun, Shi Lin, Luan Yushi
School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, China.
School of Life Science and Biotechnology, Dalian University of Technology, Dalian, Liaoning, China.
PLoS One. 2014 Jul 22;9(7):e103181. doi: 10.1371/journal.pone.0103181. eCollection 2014.
Confident identification of microRNA-target interactions is significant for studying the function of microRNA (miRNA). Although some computational miRNA target prediction methods have been proposed for plants, results of various methods tend to be inconsistent and usually lead to more false positive. To address these issues, we developed an integrated model for identifying plant miRNA-target interactions.
Three online miRNA target prediction toolkits and machine learning algorithms were integrated to identify and analyze Arabidopsis thaliana miRNA-target interactions. Principle component analysis (PCA) feature extraction and self-training technology were introduced to improve the performance. Results showed that the proposed model outperformed the previously existing methods. The results were validated by using degradome sequencing supported Arabidopsis thaliana miRNA-target interactions. The proposed model constructed on Arabidopsis thaliana was run over Oryza sativa and Vitis vinifera to demonstrate that our model is effective for other plant species.
The integrated model of online predictors and local PCA-SVM classifier gained credible and high quality miRNA-target interactions. The supervised learning algorithm of PCA-SVM classifier was employed in plant miRNA target identification for the first time. Its performance can be substantially improved if more experimentally proved training samples are provided.
准确识别微小RNA(miRNA)与靶标的相互作用对于研究miRNA的功能具有重要意义。尽管已经提出了一些针对植物的计算miRNA靶标预测方法,但各种方法的结果往往不一致,并且通常会导致更多的假阳性。为了解决这些问题,我们开发了一种用于识别植物miRNA与靶标相互作用的综合模型。
整合了三个在线miRNA靶标预测工具包和机器学习算法,以识别和分析拟南芥miRNA与靶标的相互作用。引入主成分分析(PCA)特征提取和自训练技术以提高性能。结果表明,所提出的模型优于先前现有的方法。通过使用支持拟南芥miRNA与靶标相互作用的降解组测序对结果进行了验证。在拟南芥上构建的所提出的模型在水稻和葡萄上运行,以证明我们的模型对其他植物物种有效。
在线预测器和局部PCA-SVM分类器的综合模型获得了可靠且高质量的miRNA与靶标的相互作用。PCA-SVM分类器的监督学习算法首次用于植物miRNA靶标识别。如果提供更多经过实验验证的训练样本,其性能可以得到显著提高。