Cen Keliang, Xing Zheming, Wang Xuan, Wang Yadong, Li Junyi
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2556-2567. doi: 10.1109/TCBB.2024.3488281. Epub 2024 Dec 10.
Investigating the associations between circRNA and diseases is vital for comprehending the underlying mechanisms of diseases and formulating effective therapies. Computational prediction methods often rely solely on known circRNA-disease data, indirectly incorporating other biomolecules' effects by computing circRNA and disease similarities based on these molecules. However, this approach is limited, as other biomolecules also play significant roles in circRNA-disease interactions. To address this, we construct a comprehensive heterogeneous network incorporating data on human circRNAs, diseases, and other biomolecule interactions to develop a novel computational model, circ2DGNN, which is built upon a heterogeneous graph neural network. circ2DGNN directly takes heterogeneous networks as inputs and obtains the embedded representation of each node for downstream link prediction through graph representation learning. circ2DGNN employs a Transformer-like architecture, which can compute heterogeneous attention score for each edge, and perform message propagation and aggregation, using a residual connection to enhance the representation vector. It uniquely applies the same parameter matrix only to identical meta-relationships, reflecting diverse parameter spaces for different relationship types. After fine-tuning hyperparameters via five-fold cross-validation, evaluation conducted on a test dataset shows circ2DGNN outperforms existing state-of-the-art(SOTA) methods.
研究环状RNA(circRNA)与疾病之间的关联对于理解疾病的潜在机制和制定有效的治疗方法至关重要。计算预测方法通常仅依赖已知的circRNA-疾病数据,通过基于这些分子计算circRNA与疾病的相似性来间接纳入其他生物分子的影响。然而,这种方法存在局限性,因为其他生物分子在circRNA-疾病相互作用中也起着重要作用。为了解决这个问题,我们构建了一个综合的异质网络,纳入了人类circRNA、疾病和其他生物分子相互作用的数据,以开发一种新的计算模型circ2DGNN,它基于异质图神经网络构建。circ2DGNN直接将异质网络作为输入,并通过图表示学习获得每个节点的嵌入表示,用于下游链接预测。circ2DGNN采用类似Transformer的架构,该架构可以为每条边计算异质注意力分数,并执行消息传播和聚合,使用残差连接来增强表示向量。它独特地仅将相同的参数矩阵应用于相同的元关系,反映了不同关系类型的不同参数空间。通过五折交叉验证微调超参数后,在测试数据集上进行的评估表明circ2DGNN优于现有的最先进(SOTA)方法。