• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估自动胸部X光放射学报告生成的进展。

Evaluating progress in automatic chest X-ray radiology report generation.

作者信息

Yu Feiyang, Endo Mark, Krishnan Rayan, Pan Ian, Tsai Andy, Reis Eduardo Pontes, Fonseca Eduardo Kaiser Ururahy Nunes, Lee Henrique Min Ho, Abad Zahra Shakeri Hossein, Ng Andrew Y, Langlotz Curtis P, Venugopal Vasantha Kumar, Rajpurkar Pranav

机构信息

Department of Computer Science, Stanford University, Stanford, CA 94305, USA.

Department of Radiology, Brigham and Women's Hospital, Boston, MA 02115, USA.

出版信息

Patterns (N Y). 2023 Aug 3;4(9):100802. doi: 10.1016/j.patter.2023.100802. eCollection 2023 Sep 8.

DOI:10.1016/j.patter.2023.100802
PMID:37720336
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10499844/
Abstract

Artificial intelligence (AI) models for automatic generation of narrative radiology reports from images have the potential to enhance efficiency and reduce the workload of radiologists. However, evaluating the correctness of these reports requires metrics that can capture clinically pertinent differences. In this study, we investigate the alignment between automated metrics and radiologists' scoring of errors in report generation. We address the limitations of existing metrics by proposing new metrics, RadGraph F1 and RadCliQ, which demonstrate stronger correlation with radiologists' evaluations. In addition, we analyze the failure modes of the metrics to understand their limitations and provide guidance for metric selection and interpretation. This study establishes RadGraph F1 and RadCliQ as meaningful metrics for guiding future research in radiology report generation.

摘要

用于根据图像自动生成叙述性放射学报告的人工智能(AI)模型有提高效率和减轻放射科医生工作量的潜力。然而,评估这些报告的正确性需要能够捕捉临床相关差异的指标。在本研究中,我们调查了自动指标与放射科医生对报告生成错误的评分之间的一致性。我们通过提出新的指标RadGraph F1和RadCliQ来解决现有指标的局限性,这两个指标与放射科医生的评估显示出更强的相关性。此外,我们分析了指标的失败模式以了解其局限性,并为指标的选择和解释提供指导。本研究将RadGraph F1和RadCliQ确立为指导未来放射学报告生成研究的有意义的指标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/15608d7b479d/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/cebf43eb7b3b/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/bde5cc6bfa6c/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/e850090c9715/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/33d56f4a848e/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/15608d7b479d/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/cebf43eb7b3b/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/bde5cc6bfa6c/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/e850090c9715/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/33d56f4a848e/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6a5/10499844/15608d7b479d/gr4.jpg

相似文献

1
Evaluating progress in automatic chest X-ray radiology report generation.评估自动胸部X光放射学报告生成的进展。
Patterns (N Y). 2023 Aug 3;4(9):100802. doi: 10.1016/j.patter.2023.100802. eCollection 2023 Sep 8.
2
Leveraging Professional Radiologists' Expertise to Enhance LLMs' Evaluation for Radiology Reports.利用专业放射科医生的专业知识提升大语言模型对放射学报告的评估能力。
ArXiv. 2024 Feb 17:arXiv:2401.16578v3.
3
Thoracic Radiologists' Versus Computer Scientists' Perspectives on the Future of Artificial Intelligence in Radiology.胸科放射科医生与计算机科学家对放射学人工智能未来的看法。
J Thorac Imaging. 2020 Jul;35(4):255-259. doi: 10.1097/RTI.0000000000000453.
4
Evaluating diagnostic content of AI-generated radiology reports of chest X-rays.评估人工智能生成的胸部 X 光射线诊断报告的诊断内容。
Artif Intell Med. 2021 Jun;116:102075. doi: 10.1016/j.artmed.2021.102075. Epub 2021 Apr 15.
5
Radiology report generation with a learned knowledge base and multi-modal alignment.基于学习知识库和多模态对齐的放射学报告生成
Med Image Anal. 2023 May;86:102798. doi: 10.1016/j.media.2023.102798. Epub 2023 Mar 23.
6
Knowledge matters: Chest radiology report generation with general and specific knowledge.知识很重要:使用通用和特定知识生成胸部放射学报告。
Med Image Anal. 2022 Aug;80:102510. doi: 10.1016/j.media.2022.102510. Epub 2022 Jun 9.
7
The effect of an artificial intelligence algorithm on chest X-ray interpretation of radiology residents.人工智能算法对放射科住院医师胸部 X 光片判读的影响。
Br J Radiol. 2022 Oct 1;95(1139):20210688. doi: 10.1259/bjr.20210688. Epub 2022 Oct 5.
8
How to prepare for a bright future of radiology in Europe.如何为欧洲放射学的光明未来做好准备。
Insights Imaging. 2023 Oct 10;14(1):168. doi: 10.1186/s13244-023-01525-3.
9
Radiologists' perceptions on AI integration: An in-depth survey study.放射科医生对人工智能整合的看法:一项深入的调查研究。
Eur J Radiol. 2024 Aug;177:111590. doi: 10.1016/j.ejrad.2024.111590. Epub 2024 Jun 27.
10
Stakeholders' perspectives on the future of artificial intelligence in radiology: a scoping review.利益相关者对放射学人工智能未来的看法:范围综述。
Eur Radiol. 2022 Mar;32(3):1477-1495. doi: 10.1007/s00330-021-08214-z. Epub 2021 Sep 21.

引用本文的文献

1
From large language models to multimodal AI: a scoping review on the potential of generative AI in medicine.从大语言模型到多模态人工智能:关于生成式人工智能在医学领域潜力的范围综述
Biomed Eng Lett. 2025 Aug 22;15(5):845-863. doi: 10.1007/s13534-025-00497-1. eCollection 2025 Sep.
2
Towards generalist foundation model for radiology by leveraging web-scale 2D&3D medical data.通过利用网络规模的二维和三维医学数据构建放射学通用基础模型。
Nat Commun. 2025 Aug 23;16(1):7866. doi: 10.1038/s41467-025-62385-7.
3
Advancements in Radiology Report Generation: A Comprehensive Analysis.

本文引用的文献

1
Improving chest X-ray report generation by leveraging warm starting.利用热启动提高胸部 X 光报告生成
Artif Intell Med. 2023 Oct;144:102633. doi: 10.1016/j.artmed.2023.102633. Epub 2023 Aug 19.
2
The Current and Future State of AI Interpretation of Medical Images.医学图像人工智能解读的现状与未来发展态势
N Engl J Med. 2023 May 25;388(21):1981-1990. doi: 10.1056/NEJMra2301725.
3
Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning.通过自监督学习对未经注释的胸部 X 光图像中的病理学进行专家级检测。
放射学报告生成的进展:全面分析
Bioengineering (Basel). 2025 Jun 25;12(7):693. doi: 10.3390/bioengineering12070693.
4
Opportunities and challenges in lung cancer care in the era of large language models and vision language models.大语言模型和视觉语言模型时代肺癌护理中的机遇与挑战
Transl Lung Cancer Res. 2025 May 30;14(5):1830-1847. doi: 10.21037/tlcr-24-801. Epub 2025 May 23.
5
Efficiency and Quality of Generative AI-Assisted Radiograph Reporting.生成式人工智能辅助X线片报告的效率与质量
JAMA Netw Open. 2025 Jun 2;8(6):e2513921. doi: 10.1001/jamanetworkopen.2025.13921.
6
Large Language Models in Medicine: Clinical Applications, Technical Challenges, and Ethical Considerations.医学领域的大语言模型:临床应用、技术挑战与伦理考量
Healthc Inform Res. 2025 Apr;31(2):114-124. doi: 10.4258/hir.2025.31.2.114. Epub 2025 Apr 30.
7
The Evolution of Radiology Image Annotation in the Era of Large Language Models.大语言模型时代放射学图像标注的演变
Radiol Artif Intell. 2025 Jul;7(4):e240631. doi: 10.1148/ryai.240631.
8
A clinically accessible small multimodal radiology model and evaluation metric for chest X-ray findings.一种用于胸部X光检查结果的临床可用的小型多模态放射学模型及评估指标。
Nat Commun. 2025 Apr 1;16(1):3108. doi: 10.1038/s41467-025-58344-x.
9
A vision attention driven Language framework for medical report generation.一种用于医学报告生成的视觉注意力驱动语言框架。
Sci Rep. 2025 Mar 28;15(1):10704. doi: 10.1038/s41598-025-95666-8.
10
Multimodal generative AI for medical image interpretation.用于医学图像解读的多模态生成式人工智能。
Nature. 2025 Mar;639(8056):888-896. doi: 10.1038/s41586-025-08675-y. Epub 2025 Mar 26.
Nat Biomed Eng. 2022 Dec;6(12):1399-1406. doi: 10.1038/s41551-022-00936-9. Epub 2022 Sep 15.
4
Transfer learning enables prediction of myocardial injury from continuous single-lead electrocardiography.迁移学习可实现从连续单导联心电图预测心肌损伤。
J Am Med Inform Assoc. 2022 Oct 7;29(11):1908-1918. doi: 10.1093/jamia/ocac135.
5
Knowledge matters: Chest radiology report generation with general and specific knowledge.知识很重要:使用通用和特定知识生成胸部放射学报告。
Med Image Anal. 2022 Aug;80:102510. doi: 10.1016/j.media.2022.102510. Epub 2022 Jun 9.
6
Leveraging Deep Representations of Radiology Reports in Survival Analysis for Predicting Heart Failure Patient Mortality.在生存分析中利用放射学报告的深度表征预测心力衰竭患者死亡率
Proc Conf. 2021 Jun;2021:4533-4538. doi: 10.18653/v1/2021.naacl-main.358.
7
AI in health and medicine.人工智能在医疗中的应用。
Nat Med. 2022 Jan;28(1):31-38. doi: 10.1038/s41591-021-01614-0. Epub 2022 Jan 20.
8
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
9
Deep learning in generating radiology reports: A survey.深度学习在生成放射学报告中的应用:综述。
Artif Intell Med. 2020 Jun;106:101878. doi: 10.1016/j.artmed.2020.101878. Epub 2020 May 15.
10
MIMIC-CXR, a de-identified publicly available database of chest radiographs with free-text reports.MIMIC-CXR,一个去标识化的、公开可用的、包含自由文本报告的胸部 X 光数据库。
Sci Data. 2019 Dec 12;6(1):317. doi: 10.1038/s41597-019-0322-0.