文献检索，用中文搜 PubMed

应用&插件

Zotero 插件浏览器插件 Mac 客户端 Windows 客户端微信小程序

定价

高级版会员购买积分包购买API积分包

服务

文献检索文档翻译深度研究 API 文档 MCP 服务

关于我们

关于 Suppr 公司介绍联系我们用户协议隐私条款

关注我们

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

粤ICP备2023148730 号-1Suppr @ 2026

Large vision language models (LVLMs) have achieved superior performance on natural image and text tasks, inspiring extensive fine-tuning research. However, their robustness against hallucination in clinical contexts remains understudied. We propose the Medical Visual Hallucination Test (MedVH), a novel evaluation framework assessing hallucination tendencies in both medical-specific and general-purpose LVLMs. MedVH encompasses six tasks targeting medical hallucinations, including two traditional tasks and four novel tasks formatted as multi-choice visual question answering and long response generation. Our extensive experiments with six evaluation metrics reveal that medical LVLMs, despite promising performance on standard medical tasks, are particularly susceptible to hallucinations-often more so than general models. This raises significant concerns about domain-specific model reliability. For real-world applications, medical LVLMs must accurately integrate medical knowledge while maintaining robust reasoning to prevent hallucination. We explore mitigation methods without model-specific fine-tuning, including prompt engineering and collaboration between general and domain-specific models. Our work provides a foundation for future evaluation studies. The dataset is available at PhysioNet: https://physionet.org/content/medvh.

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

MedVH：迈向医学背景下大型视觉语言模型幻觉的系统评估

MedVH: Toward Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

MedVH：迈向医学背景下大型视觉语言模型幻觉的系统评估

MedVH: Toward Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context.

作者信息

机构信息

出版信息

相似文献

本文引用的文献