School of Computer Science, Guangdong University of Technology, Guangzhou, China.
Mol Genet Genomics. 2021 May;296(3):473-483. doi: 10.1007/s00438-021-01764-3. Epub 2021 Feb 15.
An increasing number of studies and experiments have demonstrated that long noncoding RNAs (lncRNAs) have a massive impact on various biological processes. Predicting potential associations between lncRNAs and diseases not only can improve our understanding of the molecular mechanisms of human diseases but also can facilitate the identification of biomarkers for disease diagnosis, treatment, and prevention. However, identifying such associations through experiments is costly and demanding, thereby prompting researchers to develop computational methods to complement these experiments. In this paper, we constructed a novel model called RWSF-BLP (a novel lncRNA-disease association prediction model using Random Walk-based multi-Similarity Fusion and Bidirectional Label Propagation), which applies an efficient random walk-based multi-similarity fusion (RWSF) method to fuse different similarity matrices and utilizes bidirectional label propagation to predict potential lncRNA-disease associations. Leave-one-out cross-validation (LOOCV) and 5-fold cross-validation (5-fold-CV) were implemented in the evaluation RWSF-BLP performance. Results showed that, RWSF-BLP has reliable AUCs of 0.9086 and 0.9115 ± 0.0044 under the framework of LOOCV and 5-fold-CV and outperformed other four canonical methods. Case studies on lung cancer and leukemia demonstrated that potential lncRNA-disease associations can be predicted through our method. Therefore, our method can accurately infer potential lncRNA-disease associations and may be a good choice in future biomedical research.
越来越多的研究和实验表明,长非编码 RNA(lncRNA)对各种生物过程有重大影响。预测 lncRNA 与疾病之间的潜在关联不仅可以提高我们对人类疾病分子机制的理解,还有助于识别疾病诊断、治疗和预防的生物标志物。然而,通过实验来识别这些关联是昂贵且费力的,因此促使研究人员开发计算方法来补充这些实验。在本文中,我们构建了一种名为 RWSF-BLP(一种使用基于随机游走的多相似性融合和双向标签传播的新型 lncRNA-疾病关联预测模型)的新型模型,该模型应用了一种高效的基于随机游走的多相似性融合(RWSF)方法来融合不同的相似性矩阵,并利用双向标签传播来预测潜在的 lncRNA-疾病关联。在评估 RWSF-BLP 性能时,我们采用了留一交叉验证(LOOCV)和 5 倍交叉验证(5-fold-CV)。结果表明,在 LOOCV 和 5-fold-CV 框架下,RWSF-BLP 的 AUC 值分别为 0.9086 和 0.9115±0.0044,优于其他四种标准方法。对肺癌和白血病的案例研究表明,通过我们的方法可以预测潜在的 lncRNA-疾病关联。因此,我们的方法可以准确推断潜在的 lncRNA-疾病关联,可能是未来生物医学研究的一个不错选择。