IEEE/ACM Trans Comput Biol Bioinform. 2019 Mar-Apr;16(2):396-406. doi: 10.1109/TCBB.2017.2701379. Epub 2017 May 4.
Accumulating experimental evidence has indicated that long non-coding RNAs (lncRNAs) are critical for the regulation of cellular biological processes implicated in many human diseases. However, only relatively few experimentally supported lncRNA-disease associations have been reported. Developing effective computational methods to infer lncRNA-disease associations is becoming increasingly important. Current network-based algorithms typically use a network representation to identify novel associations between lncRNAs and diseases. But these methods are concentrated on specific entities of interest (lncRNAs and diseases) and they do not allow to consider networks with more than two types of entities. Considering the limitations in previous computational methods, we develop a new global network-based framework, LncRDNetFlow, to prioritize disease-related lncRNAs. LncRDNetFlow utilizes a flow propagation algorithm to integrate multiple networks based on a variety of biological information including lncRNA similarity, protein-protein interactions, disease similarity, and the associations between them to infer lncRNA-disease associations. We show that LncRDNetFlow performs significantly better than the existing state-of-the-art approaches in cross-validation. To further validate the reproducibility of the performance, we use the proposed method to identify the related lncRNAs for ovarian cancer, glioma, and cervical cancer. The results are encouraging. Many predicted lncRNAs in the top list have been verified by the biological studies.
越来越多的实验证据表明,长非编码 RNA(lncRNA)在许多人类疾病相关的细胞生物学过程的调控中起着关键作用。然而,目前仅有相对较少的经过实验验证的 lncRNA-疾病关联被报道。因此,开发有效的计算方法来推断 lncRNA-疾病关联变得越来越重要。目前基于网络的算法通常使用网络表示来识别 lncRNA 和疾病之间的新关联。但是,这些方法集中于特定的感兴趣实体(lncRNA 和疾病),并且不允许考虑具有超过两种类型实体的网络。考虑到之前计算方法的局限性,我们开发了一种新的基于全局网络的框架 LncRDNetFlow,用于对疾病相关的 lncRNA 进行优先级排序。LncRDNetFlow 使用流传播算法,基于包括 lncRNA 相似性、蛋白质-蛋白质相互作用、疾病相似性以及它们之间的关联在内的多种生物学信息,整合多个网络,以推断 lncRNA-疾病关联。我们的实验结果表明,LncRDNetFlow 在交叉验证中明显优于现有的最先进方法。为了进一步验证性能的可重复性,我们使用所提出的方法来识别卵巢癌、神经胶质瘤和宫颈癌的相关 lncRNA。结果令人鼓舞。在排名靠前的 lncRNA 中,许多预测的 lncRNA 已被生物学研究证实。