Liu Yanling, Yang Hong, Zheng Chu, Wang Ke, Yan Jingjing, Cao Hongyan, Zhang Yanbo
Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China.
Department of Mathematics, Changzhi Medical College, Changzhi, China.
Front Genet. 2022 Apr 13;13:862272. doi: 10.3389/fgene.2022.862272. eCollection 2022.
Long non-coding RNAs (lncRNAs) play significant roles in the disease process. Understanding the pathological mechanisms of lncRNAs during the course of various diseases will help clinicians prevent and treat diseases. With the emergence of high-throughput techniques, many biological experiments have been developed to study lncRNA-disease associations. Because experimental methods are costly, slow, and laborious, a growing number of computational models have emerged. Here, we present a new approach using network consistency projection and bi-random walk (NCP-BiRW) to infer hidden lncRNA-disease associations. First, integrated similarity networks for lncRNAs and diseases were constructed by merging similarity information. Subsequently, network consistency projection was applied to calculate space projection scores for lncRNAs and diseases, which were then introduced into a bi-random walk method for association prediction. To test model performance, we employed 5- and 10-fold cross-validation, with the area under the receiver operating characteristic curve as the evaluation indicator. The computational results showed that our method outperformed the other five advanced algorithms. In addition, the novel method was applied to another dataset in the Mammalian ncRNA-Disease Repository (MNDR) database and showed excellent performance. Finally, case studies were carried out on atherosclerosis and leukemia to confirm the effectiveness of our method in practice. In conclusion, we could infer lncRNA-disease associations using the NCP-BiRW model, which may benefit biomedical studies in the future.
长链非编码RNA(lncRNAs)在疾病过程中发挥着重要作用。了解lncRNAs在各种疾病过程中的病理机制将有助于临床医生预防和治疗疾病。随着高通量技术的出现,已经开展了许多生物学实验来研究lncRNA与疾病的关联。由于实验方法成本高、速度慢且费力,越来越多的计算模型应运而生。在此,我们提出一种使用网络一致性投影和双向随机游走(NCP-BiRW)的新方法来推断潜在的lncRNA与疾病的关联。首先,通过合并相似性信息构建lncRNAs和疾病的综合相似性网络。随后,应用网络一致性投影来计算lncRNAs和疾病的空间投影分数,然后将其引入双向随机游走方法进行关联预测。为了测试模型性能,我们采用了5折和10折交叉验证,以受试者工作特征曲线下面积作为评估指标。计算结果表明,我们的方法优于其他五种先进算法。此外,该新方法应用于哺乳动物非编码RNA-疾病知识库(MNDR)数据库中的另一个数据集,并表现出优异的性能。最后,对动脉粥样硬化和白血病进行了案例研究,以证实我们的方法在实际应用中的有效性。总之,我们可以使用NCP-BiRW模型推断lncRNA与疾病的关联,这可能在未来有益于生物医学研究。