Liu Xinyao, Xin Junchang, Shen Qi, Huang Zhihong, Wang Zhiqiong
College of Medicine and Biological Information Engineering, Northeastern University, 110819, China.
College of Computer Science and Engineering, Northeastern University, 110819, China.
Comput Med Imaging Graph. 2025 Mar;120:102486. doi: 10.1016/j.compmedimag.2024.102486. Epub 2025 Jan 4.
With the increasing popularity of medical imaging and its expanding applications, posing significant challenges for radiologists. Radiologists need to spend substantial time and effort to review images and manually writing reports every day. To address these challenges and speed up the process of patient care, researchers have employed deep learning methods to automatically generate medical reports. In recent years, researchers have been increasingly focusing on this task and a large amount of related work has emerged. Although there have been some review articles summarizing the state of the art in this field, their discussions remain relatively limited. Therefore, this paper provides a comprehensive review of the latest advancements in automatic medical report generation, focusing on four key aspects: (1) describing the problem of automatic medical report generation, (2) introducing datasets of different modalities, (3) thoroughly analyzing existing evaluation metrics, (4) classifying existing studies into five categories: retrieval-based, domain knowledge-based, attention-based, reinforcement learning-based, large language models-based, and merged model. In addition, we point out the problems in this field and discuss the directions of future challenges. We hope that this review provides a thorough understanding of automatic medical report generation and encourages the continued development in this area.
随着医学成像的日益普及及其应用范围的不断扩大,给放射科医生带来了重大挑战。放射科医生每天需要花费大量时间和精力来查看图像并手动撰写报告。为应对这些挑战并加快患者护理流程,研究人员采用深度学习方法来自动生成医学报告。近年来,研究人员越来越关注这项任务,大量相关工作不断涌现。尽管已有一些综述文章总结了该领域的最新进展,但其讨论仍相对有限。因此,本文对自动医学报告生成的最新进展进行了全面综述,重点关注四个关键方面:(1)描述自动医学报告生成问题;(2)介绍不同模态的数据集;(3)深入分析现有评估指标;(4)将现有研究分为五类:基于检索的、基于领域知识的、基于注意力的、基于强化学习的、基于大语言模型的以及融合模型。此外,我们指出了该领域存在的问题,并讨论了未来面临挑战的方向。我们希望这篇综述能让读者对自动医学报告生成有全面的了解,并推动该领域的持续发展。