Zhang Hao, Lin Chaohuan, Chen Ying'ao, Shen Xianrui, Wang Ruizhe, Chen Yiqi, Lyu Jie
Postgraduate Training Base Alliance of Wenzhou Medical University, Wenzhou, Zhejiang, China.
Wenzhou Key Laboratory of Biophysics, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, China.
J Cell Mol Med. 2025 Jan;29(1):e70351. doi: 10.1111/jcmm.70351.
Cancer is a complex disease driven by mutations in the genes that play critical roles in cellular processes. The identification of cancer driver genes is crucial for understanding tumorigenesis, developing targeted therapies and identifying rational drug targets. Experimental identification and validation of cancer driver genes are time-consuming and costly. Studies have demonstrated that interactions among genes are associated with similar phenotypes. Therefore, identifying cancer driver genes using molecular network-based approaches is necessary. Molecular network-based random walk-based approaches, which integrate mutation data with protein-protein interaction networks, have been widely employed in predicting cancer driver genes and demonstrated robust predictive potential. However, recent advancements in deep learning, particularly graph-based models, have provided novel opportunities for enhancing the prediction of cancer driver genes. This review aimed to comprehensively explore how machine learning methodologies, particularly network propagation, graph neural networks, autoencoders, graph embeddings, and attention mechanisms, improve the scalability and interpretability of molecular network-based cancer gene prediction.
癌症是一种复杂的疾病,由在细胞过程中起关键作用的基因突变驱动。癌症驱动基因的识别对于理解肿瘤发生、开发靶向治疗方法以及确定合理的药物靶点至关重要。癌症驱动基因的实验识别和验证既耗时又昂贵。研究表明,基因之间的相互作用与相似的表型相关。因此,有必要使用基于分子网络的方法来识别癌症驱动基因。基于分子网络的基于随机游走的方法将突变数据与蛋白质-蛋白质相互作用网络相结合,已被广泛用于预测癌症驱动基因,并显示出强大的预测潜力。然而,深度学习的最新进展,特别是基于图的模型,为增强癌症驱动基因的预测提供了新的机会。本综述旨在全面探讨机器学习方法,特别是网络传播、图神经网络、自动编码器、图嵌入和注意力机制,如何提高基于分子网络的癌症基因预测的可扩展性和可解释性。