Zhao Huiyu, Miao Chuang, Zhu Yidi, Shu Yijun, Wu Xiangsong, Yin Ziming, Deng Xiao, Gong Wei, Yang Ziyi, Zou Weiwen
State Key Laboratory of Advanced Optical Communication Systems and Networks, Intelligent Microwave Lightwave Integration Innovation Center (imLic), Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China.
Department of General Surgery, Affiliated to Shanghai, Xinhua Hospital, Jiao Tong University School of Medicine, Shanghai, 200092, China.
BMC Cancer. 2025 Jul 16;25(1):1178. doi: 10.1186/s12885-025-14462-9.
The accurate early-stage diagnosis of gallbladder cancer (GBC) is regarded as one of the major challenges in the field of oncology. However, few studies have focused on the comprehensive classification of GBC based on multiple modalities. This study aims to develop a comprehensive diagnostic framework for GBC based on both imaging and non-imaging medical data.
This retrospective study reviewed 298 clinical patients with gallbladder disease or volunteers from two devices. A novel end-to-end interpretable diagnostic framework for GBC is proposed to handle multiple medical modalities, including CT imaging, demographics, tumor markers, coagulation function tests, and routine blood tests. To achieve better feature extraction and fusion of the imaging modality, a novel global-hybrid-local network, namely GHL-Net, has also been developed. The ensemble learning strategy is employed to fuse multi-modality data and obtain the final classification result. In addition, two interpretable methods are applied to help clinicians understand the model-based decisions. Model performance was evaluated through accuracy, precision, specificity, sensitivity, F1-score, area under the curve (AUC), and matthews correlation coefficient (MCC).
In both binary and multi-class classification scenarios, the proposed method showed better performance compared to other comparison methods in both datasets. Especially in the binary classification scenario, the proposed method achieved the highest accuracy, sensitivity, specificity, precision, F1-score, ROC-AUC, PR-AUC, and MCC of 95.24%, 93.55%, 96.87%, 96.67%, 95.08%, 0.9591, 0.9636, and 0.9051, respectively. The visualization results obtained based on the interpretable methods also demonstrated a high clinical relevance of the intermediate decision-making processes. Ablation studies then provided an in-depth understanding of our methodology.
The machine learning-based framework can effectively improve the accuracy of GBC diagnosis and is expected to have a more significant impact in other cancer diagnosis scenarios.
胆囊癌(GBC)的准确早期诊断被视为肿瘤学领域的主要挑战之一。然而,很少有研究专注于基于多种模式对GBC进行综合分类。本研究旨在基于影像学和非影像学医学数据开发一个用于GBC的综合诊断框架。
这项回顾性研究回顾了298例患有胆囊疾病的临床患者或来自两台设备的志愿者。提出了一种用于GBC的新型端到端可解释诊断框架,以处理多种医学模式,包括CT成像、人口统计学、肿瘤标志物、凝血功能测试和血常规检查。为了实现更好的成像模式特征提取和融合,还开发了一种新型的全局-混合-局部网络,即GHL-Net。采用集成学习策略融合多模式数据并获得最终分类结果。此外,应用了两种可解释方法来帮助临床医生理解基于模型的决策。通过准确率、精确率、特异性、灵敏度、F1分数、曲线下面积(AUC)和马修斯相关系数(MCC)评估模型性能。
在二元和多类分类场景中,与其他比较方法相比,所提出的方法在两个数据集中均表现出更好的性能。特别是在二元分类场景中,所提出的方法分别实现了95.24%、93.55%、96.87%、96.67%、95.08%、0.9591、0.9636和0.9051的最高准确率、灵敏度、特异性、精确率、F1分数、ROC-AUC、PR-AUC和MCC。基于可解释方法获得的可视化结果也证明了中间决策过程具有高度的临床相关性。消融研究随后对我们的方法有了更深入的理解。
基于机器学习的框架可以有效提高GBC诊断的准确性,并有望在其他癌症诊断场景中产生更显著的影响。