Li Bing, Xiao Xin, Zhang Chao, Xiao Ming, Zhang Le
College of Computer Science, Sichuan University, Chengdu, 610000, China.
Department of Thoracic Surgery, West China Hospital of Sichuan University, Chengdu, 610000, China.
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf379.
Studies on pan-cancer related genes play important roles in cancer research and precision therapy. With the richness of research data and the development of neural networks, several successful methods that take advantage of multiomics data, protein interaction networks, and graph neural networks to predict cancer genes have emerged. However, these methods also have several problems, such as ignoring potentially useful biological data and providing limited representations of higher-order information.
In this work, we propose a pan-cancer related gene predictive model, the DGHNN, which takes biological pathways into consideration, applies a deep graph and hypergraph neural network to encode the higher-order information in the protein interaction network and biological pathway, introduces skip residual connections into the deep graph and hypergraph neural network to avoid problems with training the deep neural network, and finally uses a feature tokenizer and transformer for classification. The experimental results show that the DGHNN outperforms other methods and achieves state-of-the-art model performance for pan-cancer related gene prediction.
The DGHNN is available at https://github.com/skytea/DGHNN.
泛癌相关基因的研究在癌症研究和精准治疗中发挥着重要作用。随着研究数据的丰富以及神经网络的发展,出现了几种利用多组学数据、蛋白质相互作用网络和图神经网络来预测癌症基因的成功方法。然而,这些方法也存在一些问题,例如忽略了潜在有用的生物学数据,并且对高阶信息的表示有限。
在这项工作中,我们提出了一种泛癌相关基因预测模型DGHNN,该模型考虑了生物途径,应用深度图和超图神经网络对蛋白质相互作用网络和生物途径中的高阶信息进行编码,在深度图和超图神经网络中引入跳跃残差连接以避免深度神经网络训练中的问题,最后使用特征分词器和变换器进行分类。实验结果表明,DGHNN优于其他方法,并在泛癌相关基因预测方面达到了当前最优的模型性能。