• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于可解释医学视觉问答的反事实因果效应干预

Counterfactual Causal-Effect Intervention for Interpretable Medical Visual Question Answering.

作者信息

Cai Linqin, Fang Haodu, Xu Nuoying, Ren Bo

出版信息

IEEE Trans Med Imaging. 2024 Dec;43(12):4430-4441. doi: 10.1109/TMI.2024.3425533. Epub 2024 Dec 2.

DOI:10.1109/TMI.2024.3425533
PMID:38980786
Abstract

Medical Visual Question Answering (VQA-Med) is a challenging task that involves answering clinical questions related to medical images. However, most current VQA-Med methods ignore the causal correlation between specific lesion or abnormality features and answers, while also failing to provide accurate explanations for their decisions. To explore the interpretability of VQA-Med, this paper proposes a novel CCIS-MVQA model for VQA-Med based on a counterfactual causal-effect intervention strategy. This model consists of the modified ResNet for image feature extraction, a GloVe decoder for question feature extraction, a bilinear attention network for vision and language feature fusion, and an interpretability generator for producing the interpretability and prediction results. The proposed CCIS-MVQA introduces a layer-wise relevance propagation method to automatically generate counterfactual samples. Additionally, CCIS-MVQA applies counterfactual causal reasoning throughout the training phase to enhance interpretability and generalization. Extensive experiments on three benchmark datasets show that the proposed CCIS-MVQA model outperforms the state-of-the-art methods. Enough visualization results are produced to analyze the interpretability and performance of CCIS-MVQA.

摘要

医学视觉问答(VQA-Med)是一项具有挑战性的任务,涉及回答与医学图像相关的临床问题。然而,当前大多数VQA-Med方法都忽略了特定病变或异常特征与答案之间的因果关系,同时也未能为其决策提供准确的解释。为了探索VQA-Med的可解释性,本文基于反事实因果效应干预策略,提出了一种用于VQA-Med的新型CCIS-MVQA模型。该模型由用于图像特征提取的改进ResNet、用于问题特征提取的GloVe解码器、用于视觉和语言特征融合的双线性注意力网络以及用于生成可解释性和预测结果的可解释性生成器组成。所提出的CCIS-MVQA引入了一种逐层相关性传播方法来自动生成反事实样本。此外,CCIS-MVQA在整个训练阶段应用反事实因果推理,以增强可解释性和泛化能力。在三个基准数据集上进行的大量实验表明,所提出的CCIS-MVQA模型优于现有最先进的方法。生成了足够的可视化结果来分析CCIS-MVQA的可解释性和性能。

相似文献

1
Counterfactual Causal-Effect Intervention for Interpretable Medical Visual Question Answering.用于可解释医学视觉问答的反事实因果效应干预
IEEE Trans Med Imaging. 2024 Dec;43(12):4430-4441. doi: 10.1109/TMI.2024.3425533. Epub 2024 Dec 2.
2
[Image-aware generative medical visual question answering based on image caption prompts].基于图像字幕提示的图像感知生成式医学视觉问答
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2025 Jun 25;42(3):560-566. doi: 10.7507/1001-5515.202412040.
3
iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.iACP-DPNet:一种用于可解释抗癌肽识别的双池因果扩张卷积网络。
Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.
4
Prophet: Prompting Large Language Models With Complementary Answer Heuristics for Knowledge-Based Visual Question Answering.Prophet:通过互补答案启发式方法提示大语言模型以进行基于知识的视觉问答
IEEE Trans Pattern Anal Mach Intell. 2025 Aug;47(8):6797-6808. doi: 10.1109/TPAMI.2025.3562422.
5
BPI-MVQA: a bi-branch model for medical visual question answering.BPI-MVQA:一种用于医学视觉问答的双分支模型。
BMC Med Imaging. 2022 Apr 29;22(1):79. doi: 10.1186/s12880-022-00800-x.
6
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
7
LGF-Net: A multi-scale feature fusion network for thyroid nodule ultrasound image classification.LGF-Net:一种用于甲状腺结节超声图像分类的多尺度特征融合网络。
J Appl Clin Med Phys. 2025 Aug;26(8):e70149. doi: 10.1002/acm2.70149.
8
Short-Term Memory Impairment短期记忆障碍
9
Sexual Harassment and Prevention Training性骚扰与预防培训
10
TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet:一种新颖的跨尺度级联分层Transformer 网络,用于增强视网膜血管分割。
Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.

引用本文的文献

1
A novel UNet-SegNet and vision transformer architectures for efficient segmentation and classification in medical imaging.一种用于医学成像中高效分割和分类的新型UNet-SegNet和视觉Transformer架构。
Phys Eng Sci Med. 2025 Jul 8. doi: 10.1007/s13246-025-01564-8.
2
Optimizing the early diagnosis of neurological disorders through the application of machine learning for predictive analytics in medical imaging.通过应用机器学习进行医学成像预测分析来优化神经系统疾病的早期诊断。
Sci Rep. 2025 Jul 2;15(1):22488. doi: 10.1038/s41598-025-05888-z.
3
Research on multi-algorithm and explainable AI techniques for predictive modeling of acute spinal cord injury using multimodal data.
使用多模态数据对急性脊髓损伤进行预测建模的多算法与可解释人工智能技术研究。
Sci Rep. 2025 May 29;15(1):18832. doi: 10.1038/s41598-025-93006-4.
4
Diabetic retinopathy detection based on mobile maxout network and weber local descriptor feature selection using retinal fundus image.基于移动最大池化网络和视网膜眼底图像的韦伯局部描述符特征选择的糖尿病视网膜病变检测
Sci Rep. 2025 Apr 22;15(1):13858. doi: 10.1038/s41598-025-97675-z.
5
New Machine Learning Method for Medical Image and Microarray Data Analysis for Heart Disease Classification.用于心脏病分类的医学图像和微阵列数据分析的新型机器学习方法。
J Imaging Inform Med. 2025 Apr 1. doi: 10.1007/s10278-025-01492-9.
6
Explainable AI in medical imaging: an interpretable and collaborative federated learning model for brain tumor classification.医学成像中的可解释人工智能:一种用于脑肿瘤分类的可解释且协作的联邦学习模型。
Front Oncol. 2025 Feb 27;15:1535478. doi: 10.3389/fonc.2025.1535478. eCollection 2025.
7
Precision and efficiency in skin cancer segmentation through a dual encoder deep learning model.通过双编码器深度学习模型实现皮肤癌分割的精准性与效率
Sci Rep. 2025 Feb 9;15(1):4815. doi: 10.1038/s41598-025-88753-3.
8
Quantum-inspired seagull optimised deep belief network approach for cardiovascular disease prediction.用于心血管疾病预测的量子启发式海鸥优化深度信念网络方法
PeerJ Comput Sci. 2024 Dec 13;10:e2498. doi: 10.7717/peerj-cs.2498. eCollection 2024.
9
Improved lung nodule segmentation with a squeeze excitation dilated attention based residual UNet.基于挤压激励扩张注意力的残差U-Net改进肺结节分割
Sci Rep. 2025 Jan 30;15(1):3770. doi: 10.1038/s41598-025-85199-5.
10
Enhancing image-based diagnosis of gastrointestinal tract diseases through deep learning with EfficientNet and advanced data augmentation techniques.通过使用 EfficientNet 和先进的数据增强技术进行深度学习,提高基于图像的胃肠道疾病诊断能力。
BMC Med Imaging. 2024 Nov 12;24(1):306. doi: 10.1186/s12880-024-01479-y.