Suppr超能文献

一种基于机器学习的端到端可解释框架,用于利用多模态医学数据对胆囊癌进行早期诊断。

An end-to-end interpretable machine-learning-based framework for early-stage diagnosis of gallbladder cancer using multi-modality medical data.

作者信息

Zhao Huiyu, Miao Chuang, Zhu Yidi, Shu Yijun, Wu Xiangsong, Yin Ziming, Deng Xiao, Gong Wei, Yang Ziyi, Zou Weiwen

机构信息

State Key Laboratory of Advanced Optical Communication Systems and Networks, Intelligent Microwave Lightwave Integration Innovation Center (imLic), Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China.

Department of General Surgery, Affiliated to Shanghai, Xinhua Hospital, Jiao Tong University School of Medicine, Shanghai, 200092, China.

出版信息

BMC Cancer. 2025 Jul 16;25(1):1178. doi: 10.1186/s12885-025-14462-9.

Abstract

BACKGROUND

The accurate early-stage diagnosis of gallbladder cancer (GBC) is regarded as one of the major challenges in the field of oncology. However, few studies have focused on the comprehensive classification of GBC based on multiple modalities. This study aims to develop a comprehensive diagnostic framework for GBC based on both imaging and non-imaging medical data.

METHODS

This retrospective study reviewed 298 clinical patients with gallbladder disease or volunteers from two devices. A novel end-to-end interpretable diagnostic framework for GBC is proposed to handle multiple medical modalities, including CT imaging, demographics, tumor markers, coagulation function tests, and routine blood tests. To achieve better feature extraction and fusion of the imaging modality, a novel global-hybrid-local network, namely GHL-Net, has also been developed. The ensemble learning strategy is employed to fuse multi-modality data and obtain the final classification result. In addition, two interpretable methods are applied to help clinicians understand the model-based decisions. Model performance was evaluated through accuracy, precision, specificity, sensitivity, F1-score, area under the curve (AUC), and matthews correlation coefficient (MCC).

RESULTS

In both binary and multi-class classification scenarios, the proposed method showed better performance compared to other comparison methods in both datasets. Especially in the binary classification scenario, the proposed method achieved the highest accuracy, sensitivity, specificity, precision, F1-score, ROC-AUC, PR-AUC, and MCC of 95.24%, 93.55%, 96.87%, 96.67%, 95.08%, 0.9591, 0.9636, and 0.9051, respectively. The visualization results obtained based on the interpretable methods also demonstrated a high clinical relevance of the intermediate decision-making processes. Ablation studies then provided an in-depth understanding of our methodology.

CONCLUSION

The machine learning-based framework can effectively improve the accuracy of GBC diagnosis and is expected to have a more significant impact in other cancer diagnosis scenarios.

摘要

背景

胆囊癌(GBC)的准确早期诊断被视为肿瘤学领域的主要挑战之一。然而,很少有研究专注于基于多种模式对GBC进行综合分类。本研究旨在基于影像学和非影像学医学数据开发一个用于GBC的综合诊断框架。

方法

这项回顾性研究回顾了298例患有胆囊疾病的临床患者或来自两台设备的志愿者。提出了一种用于GBC的新型端到端可解释诊断框架,以处理多种医学模式,包括CT成像、人口统计学、肿瘤标志物、凝血功能测试和血常规检查。为了实现更好的成像模式特征提取和融合,还开发了一种新型的全局-混合-局部网络,即GHL-Net。采用集成学习策略融合多模式数据并获得最终分类结果。此外,应用了两种可解释方法来帮助临床医生理解基于模型的决策。通过准确率、精确率、特异性、灵敏度、F1分数、曲线下面积(AUC)和马修斯相关系数(MCC)评估模型性能。

结果

在二元和多类分类场景中,与其他比较方法相比,所提出的方法在两个数据集中均表现出更好的性能。特别是在二元分类场景中,所提出的方法分别实现了95.24%、93.55%、96.87%、96.67%、95.08%、0.9591、0.9636和0.9051的最高准确率、灵敏度、特异性、精确率、F1分数、ROC-AUC、PR-AUC和MCC。基于可解释方法获得的可视化结果也证明了中间决策过程具有高度的临床相关性。消融研究随后对我们的方法有了更深入的理解。

结论

基于机器学习的框架可以有效提高GBC诊断的准确性,并有望在其他癌症诊断场景中产生更显著的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9df0/12265321/e00c5d8ff7a2/12885_2025_14462_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验