基于问题的医学影像视觉问答模型。

A Question-Centric Model for Visual Question Answering in Medical Imaging.

出版信息

IEEE Trans Med Imaging. 2020 Sep;39(9):2856-2868. doi: 10.1109/TMI.2020.2978284. Epub 2020 Mar 4.

DOI:10.1109/TMI.2020.2978284

Abstract

Deep learning methods have proven extremely effective at performing a variety of medical image analysis tasks. With their potential use in clinical routine, their lack of transparency has however been one of their few weak points, raising concerns regarding their behavior and failure modes. While most research to infer model behavior has focused on indirect strategies that estimate prediction uncertainties and visualize model support in the input image space, the ability to explicitly query a prediction model regarding its image content offers a more direct way to determine the behavior of trained models. To this end, we present a novel Visual Question Answering approach that allows an image to be queried by means of a written question. Experiments on a variety of medical and natural image datasets show that by fusing image and question features in a novel way, the proposed approach achieves an equal or higher accuracy compared to current methods.

摘要

深度学习方法在执行各种医学图像分析任务方面已被证明非常有效。随着它们在临床常规中的潜在应用，其缺乏透明度已成为它们为数不多的弱点之一，这引发了人们对其行为和故障模式的担忧。虽然大多数研究推断模型行为的重点都集中在间接策略上，这些策略可以估计预测不确定性并在输入图像空间中可视化模型支持，但能够针对图像内容明确查询预测模型提供了一种更直接的方法来确定训练模型的行为。为此，我们提出了一种新颖的视觉问答方法，通过书面问题来查询图像。在各种医学和自然图像数据集上的实验表明，通过以新颖的方式融合图像和问题特征，所提出的方法与当前方法相比实现了相等或更高的准确性。

相似文献

A Question-Centric Model for Visual Question Answering in Medical Imaging.基于问题的医学影像视觉问答模型。

IEEE Trans Med Imaging. 2020 Sep;39(9):2856-2868. doi: 10.1109/TMI.2020.2978284. Epub 2020 Mar 4.

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge.基于属性和外部知识的图像字幕和视觉问答。

IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1367-1381. doi: 10.1109/TPAMI.2017.2708709. Epub 2017 May 26.

Multitask Learning for Visual Question Answering.用于视觉问答的多任务学习

IEEE Trans Neural Netw Learn Syst. 2023 Mar;34(3):1380-1394. doi: 10.1109/TNNLS.2021.3105284. Epub 2023 Feb 28.

Anomaly Matters: An Anomaly-Oriented Model for Medical Visual Question Answering.异常情况至关重要：一种面向异常情况的医学视觉问答模型。

IEEE Trans Med Imaging. 2022 Nov;41(11):3385-3397. doi: 10.1109/TMI.2022.3185113. Epub 2022 Oct 27.

Multi-Modal Explicit Sparse Attention Networks for Visual Question Answering.多模态显式稀疏注意力网络的视觉问答。

Sensors (Basel). 2020 Nov 26;20(23):6758. doi: 10.3390/s20236758.

A framework for ontology-based question answering with application to parasite immunology.一个基于本体的问答框架及其在寄生虫免疫学中的应用。

J Biomed Semantics. 2015 Jul 17;6:31. doi: 10.1186/s13326-015-0029-x. eCollection 2015.

External features enriched model for biomedical question answering.生物医学问答的外部特征丰富模型。

BMC Bioinformatics. 2021 May 26;22(1):272. doi: 10.1186/s12859-021-04176-7.

Medical image classification using synergic deep learning.基于协同深度学习的医学图像分类。

Med Image Anal. 2019 May;54:10-19. doi: 10.1016/j.media.2019.02.010. Epub 2019 Feb 18.

A boosting framework for visuality-preserving distance metric learning and its application to medical image retrieval.一种保持视觉保真度的距离度量学习的提升框架及其在医学图像检索中的应用。

IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):30-44. doi: 10.1109/TPAMI.2008.273.

Multi-View Visual Question Answering with Active Viewpoint Selection.多视图视觉问答与主动视点选择。

Sensors (Basel). 2020 Apr 17;20(8):2281. doi: 10.3390/s20082281.

引用本文的文献

Visual explainable artificial intelligence for graph-based visual question answering and scene graph curation.用于基于图的视觉问答和场景图管理的可视化可解释人工智能。

Vis Comput Ind Biomed Art. 2025 Apr 7;8(1):9. doi: 10.1186/s42492-025-00185-y.

Vision-Language Model for Visual Question Answering in Medical Imagery.用于医学图像视觉问答的视觉语言模型。

Bioengineering (Basel). 2023 Mar 20;10(3):380. doi: 10.3390/bioengineering10030380.

MedFuseNet: An attention-based multimodal deep learning model for visual question answering in the medical domain.MedFuseNet：一种基于注意力的多模态深度学习模型，用于医学领域的视觉问答。

Sci Rep. 2021 Oct 6;11(1):19826. doi: 10.1038/s41598-021-98390-1.

Application of Dynamic Fragmentation Methods in Multimedia Databases: A Review.动态碎片化方法在多媒体数据库中的应用：综述

Entropy (Basel). 2020 Nov 30;22(12):1352. doi: 10.3390/e22121352.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于问题的医学影像视觉问答模型。

A Question-Centric Model for Visual Question Answering in Medical Imaging.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献