Zhang Qingsong, Liu Fei, Lai Xin
School of Software Engineering, South China University of Technology, Guangzhou, 510006, China.
Systems and Network Medicine Lab, Biomedicine Unit, Faculty of Medicine and Health Technology, Tampere University, Tampere, 33520, Finland.
Bioinformatics. 2025 Sep 1;41(9). doi: 10.1093/bioinformatics/btaf444.
MOTIVATION: Accurate tumor subtype diagnosis is crucial for precision oncology, yet current methodologies face significant challenges. These include balancing model accuracy with interpretability and the high costs of generating multi-omics data in clinical settings. Moreover, there is a lack of validated models capable of classifying hierarchical tumor subtypes across a comprehensive pan-cancer cohort. RESULTS: We present a graph neural network, HallmarkGraph, the first biologically informed model developed to classify hierarchical tumor subtypes in human cancer. Inspired by cancer hallmarks, the model's architecture integrates transcriptome profiles and gene regulatory interactions to perform multi-label classification. We evaluate the model on a comprehensive pan-cancer cohort comprising 11 476 samples from 26 primary cancers with 405 subtypes up to eight levels. The model demonstrates exceptional performance, achieving 5-fold cross-validation accuracy between 85% and 99% for tumor subtypes labeled with increasing details of genomic information. It also shows good generalizability on a validation dataset of 887 samples, assessed using three metrics that consider tumor subtypes at individual, combined, and sample levels. Benchmarking and ablation experiments show that hallmark-based embeddings slightly influence model performance, while the integrated multilayer perceptron plays a significant role in determining classifier accuracy. Additionally, we use the SHAP method to link cancer hallmarks with genes, identifying key features that influence model decisions. Our findings present a biologically informed machine learning framework capable of tracking tumor transcriptomic trajectories and distinguishing inter- and intra-tumor heterogeneity in pan-cancer. This approach holds promise for enhancing cancer diagnostics. AVAILABILITY AND IMPLEMENTATION: HallmarkGraph is accessible at https://github.com/laixn/HallmarkGraph.
Bioinformatics. 2025-8-2
Comput Methods Programs Biomed. 2025-6-21
IEEE/ACM Trans Comput Biol Bioinform. 2024
J Cancer Res Clin Oncol. 2021-7
IEEE/ACM Trans Comput Biol Bioinform. 2021-8-20
Front Artif Intell. 2024-7-25
BMC Bioinformatics. 2024-1-15
Cancers (Basel). 2023-12-15
Nucleic Acids Res. 2024-1-5
Nucleic Acids Res. 2024-1-5