用于可解释医学视觉问答的反事实因果效应干预

Counterfactual Causal-Effect Intervention for Interpretable Medical Visual Question Answering.

作者信息

Cai Linqin, Fang Haodu, Xu Nuoying, Ren Bo

出版信息

IEEE Trans Med Imaging. 2024 Dec;43(12):4430-4441. doi: 10.1109/TMI.2024.3425533. Epub 2024 Dec 2.

DOI:10.1109/TMI.2024.3425533

Abstract

Medical Visual Question Answering (VQA-Med) is a challenging task that involves answering clinical questions related to medical images. However, most current VQA-Med methods ignore the causal correlation between specific lesion or abnormality features and answers, while also failing to provide accurate explanations for their decisions. To explore the interpretability of VQA-Med, this paper proposes a novel CCIS-MVQA model for VQA-Med based on a counterfactual causal-effect intervention strategy. This model consists of the modified ResNet for image feature extraction, a GloVe decoder for question feature extraction, a bilinear attention network for vision and language feature fusion, and an interpretability generator for producing the interpretability and prediction results. The proposed CCIS-MVQA introduces a layer-wise relevance propagation method to automatically generate counterfactual samples. Additionally, CCIS-MVQA applies counterfactual causal reasoning throughout the training phase to enhance interpretability and generalization. Extensive experiments on three benchmark datasets show that the proposed CCIS-MVQA model outperforms the state-of-the-art methods. Enough visualization results are produced to analyze the interpretability and performance of CCIS-MVQA.

摘要

医学视觉问答（VQA-Med）是一项具有挑战性的任务，涉及回答与医学图像相关的临床问题。然而，当前大多数VQA-Med方法都忽略了特定病变或异常特征与答案之间的因果关系，同时也未能为其决策提供准确的解释。为了探索VQA-Med的可解释性，本文基于反事实因果效应干预策略，提出了一种用于VQA-Med的新型CCIS-MVQA模型。该模型由用于图像特征提取的改进ResNet、用于问题特征提取的GloVe解码器、用于视觉和语言特征融合的双线性注意力网络以及用于生成可解释性和预测结果的可解释性生成器组成。所提出的CCIS-MVQA引入了一种逐层相关性传播方法来自动生成反事实样本。此外，CCIS-MVQA在整个训练阶段应用反事实因果推理，以增强可解释性和泛化能力。在三个基准数据集上进行的大量实验表明，所提出的CCIS-MVQA模型优于现有最先进的方法。生成了足够的可视化结果来分析CCIS-MVQA的可解释性和性能。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

用于可解释医学视觉问答的反事实因果效应干预

Counterfactual Causal-Effect Intervention for Interpretable Medical Visual Question Answering.

作者信息

出版信息

相似文献

引用本文的文献

用于可解释医学视觉问答的反事实因果效应干预

Counterfactual Causal-Effect Intervention for Interpretable Medical Visual Question Answering.

作者信息

出版信息

相似文献

引用本文的文献