Wang Dong, Lu Ming, Miao Jing, Li Tingting, Wang Edwin, Cui Qinghua
Department of Medical Informatics, Peking University Health Science Center, Beijing, China.
PLoS One. 2009;4(2):e4421. doi: 10.1371/journal.pone.0004421. Epub 2009 Feb 10.
Identifying the tissues in which a microRNA is expressed could enhance the understanding of the functions, the biological processes, and the diseases associated with that microRNA. However, the mechanisms of microRNA biogenesis and expression remain largely unclear and the identification of the tissues in which a microRNA is expressed is limited. Here, we present a machine learning based approach to predict whether an intronic microRNA show high co-expression with its host gene, by doing so, we could infer the tissues in which a microRNA is high expressed through the expression profile of its host gene. Our approach is able to achieve an accuracy of 79% in the leave-one-out cross validation and 95% on an independent testing dataset. We further estimated our method through comparing the predicted tissue specific microRNAs and the tissue specific microRNAs identified by biological experiments. This study presented a valuable tool to predict the co-expression patterns between human intronic microRNAs and their host genes, which would also help to understand the microRNA expression and regulation mechanisms. Finally, this framework can be easily extended to other species.
确定微小RNA(microRNA)表达的组织有助于加深对其功能、相关生物学过程以及所涉及疾病的理解。然而,微小RNA的生物合成和表达机制在很大程度上仍不清楚,且微小RNA表达组织的鉴定也受到限制。在此,我们提出一种基于机器学习的方法,用于预测内含子微小RNA是否与其宿主基因高度共表达,通过这种方式,我们可以根据其宿主基因的表达谱推断微小RNA高表达的组织。我们的方法在留一法交叉验证中准确率达到79%,在独立测试数据集上准确率为95%。我们通过比较预测的组织特异性微小RNA和通过生物学实验鉴定的组织特异性微小RNA进一步评估了我们的方法。本研究提出了一个预测人类内含子微小RNA与其宿主基因共表达模式的有价值工具,这也将有助于理解微小RNA的表达和调控机制。最后,该框架可以很容易地扩展到其他物种。