Sun Zhaoyi, Yim Wen-Wai, Uzuner Özlem, Xia Fei, Yetisgen Meliha
Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, 98195, USA.
Health AI, Microsoft, Redmond, WA, 98052, USA.
J Biomed Inform. 2025 Jul 22:104866. doi: 10.1016/j.jbi.2025.104866.
This review aims to explore the potential and challenges of using Natural Language Processing (NLP) to detect, correct, and mitigate medically inaccurate information, including errors, misinformation, and hallucination. By unifying these concepts, the review emphasizes their shared methodological foundations and their distinct implications for healthcare. Our goal is to advance patient safety, improve public health communication, and support the development of more reliable and transparent NLP applications in healthcare.
A scoping review was conducted following PRISMA-ScR guidelines, analyzing studies from 2020 to 2024 across five databases. Studies were selected based on their use of NLP to address medically inaccurate information and were categorized by topic, tasks, document types, datasets, models, and evaluation metrics.
NLP has shown potential in addressing medically inaccurate information on the following tasks: (1) error detection (2) error correction (3) misinformation detection (4) misinformation correction (5) hallucination detection (6) hallucination mitigation. However, challenges remain with data privacy, context dependency, and evaluation standards.
This review highlights the advancements in applying NLP to tackle medically inaccurate information while underscoring the need to address persistent challenges. Future efforts should focus on developing real-world datasets, refining contextual methods, and improving hallucination management to ensure reliable and transparent healthcare applications.
本综述旨在探讨使用自然语言处理(NLP)来检测、纠正和减轻医学上不准确信息(包括错误、错误信息和幻觉)的潜力与挑战。通过统一这些概念,本综述强调了它们共同的方法学基础以及对医疗保健的不同影响。我们的目标是提高患者安全性、改善公共卫生通信,并支持在医疗保健领域开发更可靠、透明的NLP应用程序。
按照PRISMA-ScR指南进行了一项范围综述,分析了2020年至2024年期间五个数据库中的研究。根据研究对NLP用于处理医学上不准确信息的使用情况进行选择,并按主题、任务、文档类型、数据集、模型和评估指标进行分类。
NLP在处理以下医学上不准确信息的任务中显示出潜力:(1)错误检测(2)错误纠正(3)错误信息检测(4)错误信息纠正(5)幻觉检测(6)幻觉减轻。然而,在数据隐私、上下文依赖性和评估标准方面仍然存在挑战。
本综述强调了应用NLP处理医学上不准确信息方面的进展,同时强调了应对持续挑战的必要性。未来的努力应集中在开发真实世界数据集、完善上下文方法以及改善幻觉管理,以确保可靠、透明的医疗保健应用程序。