Bamunu Mudiyanselage Thosini, Lei Xiujuan, Senanayake Nipuna, Zhang Yanqing, Pan Yi
Department of Statistics & Computer Science, University of Kelaniya, Sri Lanka.
School of Computer Science, Shaanxi Normal University, Xi'an 710119, China.
Methods. 2022 Feb;198:32-44. doi: 10.1016/j.ymeth.2021.10.008. Epub 2021 Nov 6.
Accumulated studies have discovered that circular RNAs (CircRNAs) are closely related to many complex human diseases. Due to this close relationship, CircRNAs can be used as good biomarkers for disease diagnosis and therapeutic targets for treatments. However, the number of experimentally verified circRNA-disease associations are still fewer and also conducting wet-lab experiments are constrained by the small scale and cost of time and labour. Therefore, effective computational methods are required to predict associations between circRNAs and diseases which will be promising candidates for small scale biological and clinical experiments. In this paper, we propose novel computational models based on Graph Convolution Networks (GCN) for the potential circRNA-disease association prediction. Currently most of the existing prediction methods use shallow learning algorithms. Instead, the proposed models combine the strengths of deep learning and graphs for the computation. First, they integrate multi-source similarity information into the association network. Next, models predict potential associations using graph convolution which explore this important relational knowledge of that network structure. Two circRNA-disease association prediction models, GCN based Node Classification (GCN-NC) and GCN based Link Prediction (GCN-LP) are introduced in this work and they demonstrate promising results in various experiments and outperforms other existing methods. Further, a case study proves that some of the predicted results of the novel computational models were confirmed by published literature and all top results could be verified using gene-gene interaction networks.
越来越多的研究发现,环状RNA(CircRNAs)与许多复杂的人类疾病密切相关。由于这种密切关系,CircRNAs可作为疾病诊断的良好生物标志物和治疗的靶点。然而,经实验验证的circRNA与疾病的关联数量仍然较少,而且进行湿实验室实验受到规模小、时间和劳动力成本的限制。因此,需要有效的计算方法来预测circRNAs与疾病之间的关联,这些关联将是小规模生物学和临床实验的有希望的候选者。在本文中,我们提出了基于图卷积网络(GCN)的新型计算模型,用于潜在的circRNA与疾病关联预测。目前,大多数现有的预测方法使用浅层学习算法。相反,所提出的模型将深度学习和图的优势结合起来进行计算。首先,它们将多源相似性信息整合到关联网络中。接下来,模型使用图卷积预测潜在关联,从而探索该网络结构的重要关系知识。本文介绍了两种circRNA与疾病关联预测模型,即基于GCN的节点分类(GCN-NC)和基于GCN的链接预测(GCN-LP),它们在各种实验中都显示出了有希望的结果,并且优于其他现有方法。此外,一个案例研究证明,新计算模型的一些预测结果得到了已发表文献的证实,所有顶级结果都可以使用基因-基因相互作用网络进行验证。