• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从视觉到文本:医学诊断和放射科报告生成中自然图像字幕的全面综述。

From vision to text: A comprehensive review of natural image captioning in medical diagnosis and radiology report generation.

机构信息

ETSI Informáticos, Universidad Politécnica de Madrid, 28660 Boadilla del Monte, Madrid, Spain.

Ontology Engineering Group, Departamento de Inteligencia Artificial, ETSI Informáticos, Universidad Politécnica de Madrid, 28660 Boadilla del Monte, Madrid, Spain; Departamento de Sistemas Informáticos, ETSI Sistemas Informáticos, Universidad Politécnica de Madrid, 28031 Madrid, Spain.

出版信息

Med Image Anal. 2024 Oct;97:103264. doi: 10.1016/j.media.2024.103264. Epub 2024 Jul 8.

DOI:10.1016/j.media.2024.103264
PMID:39013207
Abstract

Natural Image Captioning (NIC) is an interdisciplinary research area that lies within the intersection of Computer Vision (CV) and Natural Language Processing (NLP). Several works have been presented on the subject, ranging from the early template-based approaches to the more recent deep learning-based methods. This paper conducts a survey in the area of NIC, especially focusing on its applications for Medical Image Captioning (MIC) and Diagnostic Captioning (DC) in the field of radiology. A review of the state-of-the-art is conducted summarizing key research works in NIC and DC to provide a wide overview on the subject. These works include existing NIC and MIC models, datasets, evaluation metrics, and previous reviews in the specialized literature. The revised work is thoroughly analyzed and discussed, highlighting the limitations of existing approaches and their potential implications in real clinical practice. Similarly, future potential research lines are outlined on the basis of the detected limitations.

摘要

自然图像字幕生成(NIC)是计算机视觉(CV)和自然语言处理(NLP)交叉领域的一个跨学科研究领域。已经有一些关于这个主题的工作,从早期的基于模板的方法到最近的基于深度学习的方法都有涉及。本文对 NIC 领域进行了调查,特别是侧重于其在放射学领域的医学图像字幕生成(MIC)和诊断字幕生成(DC)中的应用。对该领域的最新技术进行了综述,总结了 NIC 和 DC 中的关键研究工作,提供了该主题的广泛概述。这些工作包括现有的 NIC 和 MIC 模型、数据集、评估指标以及专业文献中的以前的综述。对修改后的工作进行了深入分析和讨论,强调了现有方法的局限性及其在实际临床实践中的潜在影响。同样,根据检测到的局限性,概述了未来潜在的研究方向。

相似文献

1
From vision to text: A comprehensive review of natural image captioning in medical diagnosis and radiology report generation.从视觉到文本:医学诊断和放射科报告生成中自然图像字幕的全面综述。
Med Image Anal. 2024 Oct;97:103264. doi: 10.1016/j.media.2024.103264. Epub 2024 Jul 8.
2
From Show to Tell: A Survey on Deep Learning-Based Image Captioning.从展示到讲述:基于深度学习的图像字幕研究综述
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):539-559. doi: 10.1109/TPAMI.2022.3148210. Epub 2022 Dec 5.
3
Automatic captioning for medical imaging (MIC): a rapid review of literature.医学成像自动字幕(MIC):文献快速综述
Artif Intell Rev. 2023;56(5):4019-4076. doi: 10.1007/s10462-022-10270-w. Epub 2022 Sep 17.
4
Chinese Image Caption Generation via Visual Attention and Topic Modeling.基于视觉注意和主题建模的中文图像字幕生成。
IEEE Trans Cybern. 2022 Feb;52(2):1247-1257. doi: 10.1109/TCYB.2020.2997034. Epub 2022 Feb 16.
5
Arabic Captioning for Images of Clothing Using Deep Learning.基于深度学习的服装图像阿拉伯语字幕生成。
Sensors (Basel). 2023 Apr 7;23(8):3783. doi: 10.3390/s23083783.
6
Deep learning in generating radiology reports: A survey.深度学习在生成放射学报告中的应用:综述。
Artif Intell Med. 2020 Jun;106:101878. doi: 10.1016/j.artmed.2020.101878. Epub 2020 May 15.
7
A Survey on Learning Objects' Relationship for Image Captioning.面向图像字幕的学习对象关系调查。
Comput Intell Neurosci. 2023 May 29;2023:8600853. doi: 10.1155/2023/8600853. eCollection 2023.
8
Review of Natural Language Processing in Radiology.医学影像学中的自然语言处理综述
Neuroimaging Clin N Am. 2020 Nov;30(4):447-458. doi: 10.1016/j.nic.2020.08.001.
9
A survey on automatic generation of medical imaging reports based on deep learning.基于深度学习的医学影像报告自动生成研究综述。
Biomed Eng Online. 2023 May 18;22(1):48. doi: 10.1186/s12938-023-01113-y.
10
Recent progress in transformer-based medical image analysis.基于变压器的医学图像分析的最新进展。
Comput Biol Med. 2023 Sep;164:107268. doi: 10.1016/j.compbiomed.2023.107268. Epub 2023 Jul 20.

引用本文的文献

1
A Clinically-Informed Framework for Evaluating Vision-Language Models in Radiology Report Generation: Taxonomy of Errors and Risk-Aware Metric.一种用于评估放射学报告生成中视觉语言模型的临床信息框架:错误分类与风险感知指标
medRxiv. 2025 Jul 14:2025.07.13.25331222. doi: 10.1101/2025.07.13.25331222.
2
Keyword-based AI assistance in the generation of radiology reports: A pilot study.基于关键词的人工智能辅助生成放射学报告:一项试点研究。
NPJ Digit Med. 2025 Aug 1;8(1):490. doi: 10.1038/s41746-025-01889-4.
3
Advancements in Radiology Report Generation: A Comprehensive Analysis.
放射学报告生成的进展:全面分析
Bioengineering (Basel). 2025 Jun 25;12(7):693. doi: 10.3390/bioengineering12070693.
4
Revolutionizing Radiology with Natural Language Processing and Chatbot Technologies: A Narrative Umbrella Review on Current Trends and Future Directions.利用自然语言处理和聊天机器人技术革新放射学:当前趋势与未来方向的叙述性综述
J Clin Med. 2024 Dec 2;13(23):7337. doi: 10.3390/jcm13237337.
5
A dental intraoral image dataset of gingivitis for image captioning.用于图像字幕的牙龈炎口腔内牙齿图像数据集。
Data Brief. 2024 Sep 19;57:110960. doi: 10.1016/j.dib.2024.110960. eCollection 2024 Dec.