Miao Yan, Tang Xuan, Wang Chunyu, Sun Zhenyuan, Wang Guohua, Huang Shan
College of Computer and Control Engineering, Northeast Forestry University, Harbin, China.
Faculty of Computing, Harbin Institute of Technology, Harbin, China.
PLoS Comput Biol. 2025 Jun 19;21(6):e1013225. doi: 10.1371/journal.pcbi.1013225. eCollection 2025 Jun.
Circular RNA, a class of RNA molecules gaining widespread attentions, has been widely recognized as a potential biomarker for many diseases. In recent years, significant progress has been made in the study of the associations between circRNA and diseases. However, traditional experimental methods are often inefficient and costly, making computational models an effective alternative. Nevertheless, existing computational methods still face challenges such as data sparsity and the difficulty of confirming negative samples, which limits the accuracy of predictions. To address these challenges, a novel computational method, namely MVHGCN, is proposed based on multi-view and graph convolutional networks to predict potential associations between circRNA and diseases. MVHGCN first constructs a heterogeneous graph and generates feature descriptors by integrating multiple databases. Then it extracts different connection views of circRNA and diseases through meta-paths, maximizing the utilization of known association information, and aggregates deep feature information through graph convolutional networks. Finally, a MLP is used to predict the association scores. The experimental results show that MVHGCN significantly outperforms existing methods on benchmark datasets by 5-fold cross-validation. This research provides an effective new approach to studying the associations between circRNAs and diseases, capable of alleviating the problem of data sparsity and accurately identifying potential associations.
环状RNA是一类受到广泛关注的RNA分子,已被广泛认可为许多疾病的潜在生物标志物。近年来,环状RNA与疾病之间关联的研究取得了重大进展。然而,传统实验方法往往效率低下且成本高昂,使得计算模型成为一种有效的替代方法。尽管如此,现有的计算方法仍然面临数据稀疏和难以确定阴性样本等挑战,这限制了预测的准确性。为应对这些挑战,基于多视图和图卷积网络提出了一种新颖的计算方法——MVHGCN,用于预测环状RNA与疾病之间的潜在关联。MVHGCN首先构建一个异构图,并通过整合多个数据库生成特征描述符。然后,它通过元路径提取环状RNA和疾病的不同连接视图,最大限度地利用已知关联信息,并通过图卷积网络聚合深度特征信息。最后,使用多层感知器预测关联分数。实验结果表明,在基准数据集上通过5折交叉验证,MVHGCN显著优于现有方法。本研究为研究环状RNA与疾病之间的关联提供了一种有效的新方法,能够缓解数据稀疏问题并准确识别潜在关联。