Ding Jun, Li Xiaoman, Hu Haiyan
Department of Electrical Engineering and Computer Science.
Burnett School of Biomedical Science, College of Medicine, University of Central Florida, Orlando, FL 32816, USA.
Bioinformatics. 2016 Sep 15;32(18):2768-75. doi: 10.1093/bioinformatics/btw318. Epub 2016 May 20.
The identification of microRNA (miRNA) target sites is fundamentally important for studying gene regulation. There are dozens of computational methods available for miRNA target site prediction. Despite their existence, we still cannot reliably identify miRNA target sites, partially due to our limited understanding of the characteristics of miRNA target sites. The recently published CLASH (crosslinking ligation and sequencing of hybrids) data provide an unprecedented opportunity to study the characteristics of miRNA target sites and improve miRNA target site prediction methods.
Applying four different machine learning approaches to the CLASH data, we identified seven new features of miRNA target sites. Combining these new features with those commonly used by existing miRNA target prediction algorithms, we developed an approach called TarPmiR for miRNA target site prediction. Testing on two human and one mouse non-CLASH datasets, we showed that TarPmiR predicted more than 74.2% of true miRNA target sites in each dataset. Compared with three existing approaches, we demonstrated that TarPmiR is superior to these existing approaches in terms of better recall and better precision.
The TarPmiR software is freely available at http://hulab.ucf.edu/research/projects/miRNA/TarPmiR/ CONTACTS: haihu@cs.ucf.edu or xiaoman@mail.ucf.edu
Supplementary data are available at Bioinformatics online.
识别微小RNA(miRNA)靶位点对于研究基因调控至关重要。目前有数十种计算方法可用于miRNA靶位点预测。尽管有这些方法,但我们仍无法可靠地识别miRNA靶位点,部分原因是我们对miRNA靶位点特征的了解有限。最近发表的CLASH(杂交交联连接与测序)数据为研究miRNA靶位点特征及改进miRNA靶位点预测方法提供了前所未有的机会。
将四种不同的机器学习方法应用于CLASH数据,我们识别出了miRNA靶位点的七个新特征。将这些新特征与现有miRNA靶标预测算法常用的特征相结合,我们开发了一种名为TarPmiR的miRNA靶位点预测方法。在两个人类和一个小鼠非CLASH数据集上进行测试,我们发现TarPmiR在每个数据集中预测出了超过74.2%的真实miRNA靶位点。与三种现有方法相比,我们证明TarPmiR在召回率和精确率方面均优于这些现有方法。
TarPmiR软件可在http://hulab.ucf.edu/research/projects/miRNA/TarPmiR/免费获取。联系方式:haihu@cs.ucf.edu或xiaoman@mail.ucf.edu
补充数据可在《生物信息学》在线获取。