School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China, School of Computer and Information Engineering, Harbin University of Commerce, Harbin 150028, China, School of Information Science and Technology, Heilongjiang University, Harbin 150080, China and College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
Bioinformatics. 2015 Jun 1;31(11):1805-15. doi: 10.1093/bioinformatics/btv039. Epub 2015 Jan 23.
Identifying microRNAs associated with diseases (disease miRNAs) is helpful for exploring the pathogenesis of diseases. Because miRNAs fulfill function via the regulation of their target genes and because the current number of experimentally validated targets is insufficient, some existing methods have inferred potential disease miRNAs based on the predicted targets. It is difficult for these methods to achieve excellent performance due to the high false-positive and false-negative rates for the target prediction results. Alternatively, several methods have constructed a network composed of miRNAs based on their associated diseases and have exploited the information within the network to predict the disease miRNAs. However, these methods have failed to take into account the prior information regarding the network nodes and the respective local topological structures of the different categories of nodes. Therefore, it is essential to develop a method that exploits the more useful information to predict reliable disease miRNA candidates.
miRNAs with similar functions are normally associated with similar diseases and vice versa. Therefore, the functional similarity between a pair of miRNAs is calculated based on their associated diseases to construct a miRNA network. We present a new prediction method based on random walk on the network. For the diseases with some known related miRNAs, the network nodes are divided into labeled nodes and unlabeled nodes, and the transition matrices are established for the two categories of nodes. Furthermore, different categories of nodes have different transition weights. In this way, the prior information of nodes can be completely exploited. Simultaneously, the various ranges of topologies around the different categories of nodes are integrated. In addition, how far the walker can go away from the labeled nodes is controlled by restarting the walking. This is helpful for relieving the negative effect of noisy data. For the diseases without any known related miRNAs, we extend the walking on a miRNA-disease bilayer network. During the prediction process, the similarity between diseases, the similarity between miRNAs, the known miRNA-disease associations and the topology information of the bilayer network are exploited. Moreover, the importance of information from different layers of network is considered. Our method achieves superior performance for 18 human diseases with AUC values ranging from 0.786 to 0.945. Moreover, case studies on breast neoplasms, lung neoplasms, prostatic neoplasms and 32 diseases further confirm the ability of our method to discover potential disease miRNAs.
A web service for the prediction and analysis of disease miRNAs is available at http://bioinfolab.stx.hk/midp/.
识别与疾病相关的 microRNAs(疾病 microRNAs)有助于探索疾病的发病机制。由于 microRNA 通过调节其靶基因发挥功能,并且目前经过实验验证的靶标数量不足,因此一些现有方法基于预测靶标推断潜在的疾病 microRNA。由于靶标预测结果的假阳性和假阴性率较高,这些方法很难达到优异的性能。或者,有几种方法基于其相关疾病构建了由 microRNA 组成的网络,并利用网络内的信息来预测疾病 microRNA。然而,这些方法未能考虑到网络节点的先验信息以及不同类别节点的各自局部拓扑结构。因此,开发一种利用更有用信息来预测可靠疾病 microRNA 候选物的方法是至关重要的。
具有相似功能的 microRNA 通常与相似的疾病相关,反之亦然。因此,基于它们相关的疾病计算一对 microRNA 之间的功能相似性,以构建一个 microRNA 网络。我们提出了一种基于网络上随机游走的新预测方法。对于一些已知相关 microRNA 的疾病,网络节点分为标记节点和未标记节点,并为这两类节点建立转移矩阵。此外,不同类别的节点具有不同的转移权重。这样,可以充分利用节点的先验信息。同时,整合了不同类别节点周围的各种拓扑范围。此外,通过重新启动行走来控制行者可以远离标记节点的距离。这有助于缓解嘈杂数据的负面影响。对于没有任何已知相关 microRNA 的疾病,我们扩展了在 microRNA-疾病双层网络上的行走。在预测过程中,利用疾病之间的相似性、microRNA 之间的相似性、已知的 microRNA-疾病关联以及双层网络的拓扑信息。此外,还考虑了网络不同层信息的重要性。我们的方法在 18 个人类疾病上取得了卓越的性能,AUC 值范围为 0.786 到 0.945。此外,对乳腺肿瘤、肺肿瘤、前列腺肿瘤和 32 种疾病的案例研究进一步证实了我们的方法发现潜在疾病 microRNA 的能力。
用于预测和分析疾病 microRNA 的网络服务可在 http://bioinfolab.stx.hk/midp/ 获得。