Niu Mengting, Zou Quan, Wang Chunyu
Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan 610000, China.
Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang 324000, China.
Bioinformatics. 2022 Apr 12;38(8):2246-2253. doi: 10.1093/bioinformatics/btac079.
With the analysis of the characteristic and function of circular RNAs (circRNAs), people have realized that they play a critical role in the diseases. Exploring the relationship between circRNAs and diseases is of far-reaching significance for searching the etiopathogenesis and treatment of diseases. Nevertheless, it is inefficient to learn new associations only through biotechnology.
Consequently, we present a computational method, GMNN2CD, which employs a graph Markov neural network (GMNN) algorithm to predict unknown circRNA-disease associations. First, used verified associations, we calculate semantic similarity and Gaussian interactive profile kernel similarity (GIPs) of the disease and the GIPs of circRNA and then merge them to form a unified descriptor. After that, GMNN2CD uses a fusion feature variational map autoencoder to learn deep features and uses a label propagation map autoencoder to propagate tags based on known associations. Based on variational inference, GMNN alternate training enhances the ability of GMNN2CD to obtain high-efficiency high-dimensional features from low-dimensional representations. Finally, 5-fold cross-validation of five benchmark datasets shows that GMNN2CD is superior to the state-of-the-art methods. Furthermore, case studies have shown that GMNN2CD can detect potential associations.
The source code and data are available at https://github.com/nmt315320/GMNN2CD.git.
随着对环状RNA(circRNA)特征和功能的分析,人们已经意识到它们在疾病中起着关键作用。探索circRNA与疾病之间的关系对于寻找疾病的病因和治疗方法具有深远意义。然而,仅通过生物技术来发现新的关联效率较低。
因此,我们提出了一种计算方法GMNN2CD,它采用图马尔可夫神经网络(GMNN)算法来预测未知的circRNA-疾病关联。首先,利用已验证的关联,我们计算疾病的语义相似度和高斯交互轮廓核相似度(GIPs)以及circRNA的GIPs,然后将它们合并以形成统一的描述符。之后,GMNN2CD使用融合特征变分图自动编码器来学习深度特征,并使用标签传播图自动编码器基于已知关联来传播标签。基于变分推理,GMNN交替训练增强了GMNN2CD从低维表示中获取高效高维特征的能力。最后,对五个基准数据集进行的五折交叉验证表明,GMNN2CD优于现有方法。此外,案例研究表明GMNN2CD可以检测潜在的关联。