用于医学图像解读的多模态生成式人工智能。

Multimodal generative AI for medical image interpretation.

作者信息

Rao Vishwanatha M, Hla Michael, Moor Michael, Adithan Subathra, Kwak Stephen, Topol Eric J, Rajpurkar Pranav

机构信息

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

出版信息

Nature. 2025 Mar;639(8056):888-896. doi: 10.1038/s41586-025-08675-y. Epub 2025 Mar 26.

DOI:10.1038/s41586-025-08675-y

PMID:40140592

Abstract

Accurately interpreting medical images and generating insightful narrative reports is indispensable for patient care but places heavy burdens on clinical experts. Advances in artificial intelligence (AI), especially in an area that we refer to as multimodal generative medical image interpretation (GenMI), create opportunities to automate parts of this complex process. In this Perspective, we synthesize progress and challenges in developing AI systems for generation of medical reports from images. We focus extensively on radiology as a domain with enormous reporting needs and research efforts. In addition to analysing the strengths and applications of new models for medical report generation, we advocate for a novel paradigm to deploy GenMI in a manner that empowers clinicians and their patients. Initial research suggests that GenMI could one day match human expert performance in generating reports across disciplines, such as radiology, pathology and dermatology. However, formidable obstacles remain in validating model accuracy, ensuring transparency and eliciting nuanced impressions. If carefully implemented, GenMI could meaningfully assist clinicians in improving quality of care, enhancing medical education, reducing workloads, expanding specialty access and providing real-time expertise. Overall, we highlight opportunities alongside key challenges for developing multimodal generative AI that complements human experts for reliable medical report writing.

摘要

准确解读医学图像并生成有深刻见解的叙述性报告对患者护理至关重要，但给临床专家带来了沉重负担。人工智能（AI）的进展，特别是在我们称之为多模态生成式医学图像解读（GenMI）的领域，为自动化这一复杂过程的部分环节创造了机会。在这篇观点文章中，我们总结了开发用于从图像生成医学报告的人工智能系统方面的进展和挑战。我们广泛关注放射学领域，因为该领域有巨大的报告需求且研究工作众多。除了分析用于生成医学报告的新模型的优势和应用外，我们倡导一种新的范式，以一种赋予临床医生及其患者权力的方式来部署GenMI。初步研究表明，GenMI有朝一日在跨学科（如放射学、病理学和皮肤病学）生成报告方面可能与人类专家的表现相媲美。然而，在验证模型准确性、确保透明度以及引出细微差别方面仍存在巨大障碍。如果谨慎实施，GenMI可以切实帮助临床医生提高护理质量、加强医学教育、减轻工作量、扩大专科服务可及性并提供实时专业知识。总体而言，我们强调了开发多模态生成式人工智能以补充人类专家进行可靠医学报告撰写的机会以及关键挑战。

相似文献

Multimodal generative AI for medical image interpretation.用于医学图像解读的多模态生成式人工智能。

Nature. 2025 Mar;639(8056):888-896. doi: 10.1038/s41586-025-08675-y. Epub 2025 Mar 26.

Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images.生成式人工智能生成高保真囊胚期胚胎图像。

Hum Reprod. 2024 Jun 3;39(6):1197-1207. doi: 10.1093/humrep/deae064.

Image-Based Generative Artificial Intelligence in Radiology: Comprehensive Updates.基于图像的放射学生成式人工智能：全面更新。

Korean J Radiol. 2024 Nov;25(11):959-981. doi: 10.3348/kjr.2024.0392.

Assessing GPT-4 multimodal performance in radiological image analysis.评估GPT-4在放射图像分析中的多模态性能。

Eur Radiol. 2025 Apr;35(4):1959-1965. doi: 10.1007/s00330-024-11035-5. Epub 2024 Aug 30.

Exploring prospects, hurdles, and road ahead for generative artificial intelligence in orthopedic education and training.探索生成式人工智能在骨科教育与培训中的前景、障碍及未来之路。

BMC Med Educ. 2024 Dec 28;24(1):1544. doi: 10.1186/s12909-024-06592-8.

Fitness for Purpose of Text-to-Image Generative Artificial Intelligence Image Creation in Medical Imaging.医学成像中基于文本到图像生成式人工智能的图像创建的适用性

J Nucl Med Technol. 2025 Mar 5;53(1):63-67. doi: 10.2967/jnmt.124.268402.

Using Artificial Intelligence to Revise ACR TI-RADS Risk Stratification of Thyroid Nodules: Diagnostic Accuracy and Utility.使用人工智能修订甲状腺结节 ACR TI-RADS 风险分层：诊断准确性和实用性。

Radiology. 2019 Jul;292(1):112-119. doi: 10.1148/radiol.2019182128. Epub 2019 May 21.

Revolutionizing Digital Pathology With the Power of Generative Artificial Intelligence and Foundation Models.利用生成式人工智能和基础模型推动数字病理学革命。

Lab Invest. 2023 Nov;103(11):100255. doi: 10.1016/j.labinv.2023.100255. Epub 2023 Sep 26.

Generative AI in healthcare: an implementation science informed translational path on application, integration and governance.生成式人工智能在医疗保健领域的应用、整合和治理：基于实施科学的转化途径。

Implement Sci. 2024 Mar 15;19(1):27. doi: 10.1186/s13012-024-01357-9.

Harnessing the Power of Generative Artificial Intelligence in Pathology Education: Opportunities, Challenges, and Future Directions.在病理学教育中利用生成式人工智能的力量：机遇、挑战与未来方向。

Arch Pathol Lab Med. 2025 Feb 1;149(2):142-151. doi: 10.5858/arpa.2024-0187-RA.

引用本文的文献

Deep-learning triage of 3D pathology datasets for comprehensive and efficient pathologist assessments.用于全面高效的病理学家评估的3D病理数据集的深度学习分类

bioRxiv. 2025 Jul 22:2025.07.20.665804. doi: 10.1101/2025.07.20.665804.

Interpretation of AI-Generated vs. Human-Made Images.人工智能生成图像与人工制作图像的解读。

J Imaging. 2025 Jul 7;11(7):227. doi: 10.3390/jimaging11070227.

Efficiency and Quality of Generative AI-Assisted Radiograph Reporting.生成式人工智能辅助X线片报告的效率与质量

JAMA Netw Open. 2025 Jun 2;8(6):e2513921. doi: 10.1001/jamanetworkopen.2025.13921.

本文引用的文献

The MAIDA initiative: establishing a framework for global medical-imaging data sharing.MAIDA倡议：建立全球医学影像数据共享框架

Lancet Digit Health. 2024 Jan;6(1):e6-e8. doi: 10.1016/S2589-7500(23)00222-4. Epub 2023 Nov 15.

The future landscape of large language models in medicine.医学领域大语言模型的未来前景。

Commun Med (Lond). 2023 Oct 10;3(1):141. doi: 10.1038/s43856-023-00370-1.

Generative Artificial Intelligence for Chest Radiograph Interpretation in the Emergency Department.急诊科胸部 X 光片解读的生成式人工智能。

JAMA Netw Open. 2023 Oct 2;6(10):e2336100. doi: 10.1001/jamanetworkopen.2023.36100.

Improving chest X-ray report generation by leveraging warm starting.利用热启动提高胸部 X 光报告生成

Artif Intell Med. 2023 Oct;144:102633. doi: 10.1016/j.artmed.2023.102633. Epub 2023 Aug 19.

Evaluating progress in automatic chest X-ray radiology report generation.评估自动胸部X光放射学报告生成的进展。

Patterns (N Y). 2023 Aug 3;4(9):100802. doi: 10.1016/j.patter.2023.100802. eCollection 2023 Sep 8.

Automatic comprehensive radiological reports for clinical acute stroke MRIs.临床急性中风磁共振成像的自动综合放射学报告。

Commun Med (Lond). 2023 Jul 10;3(1):95. doi: 10.1038/s43856-023-00327-4.

Bias in AI-based models for medical applications: challenges and mitigation strategies.基于人工智能的医学应用模型中的偏差：挑战与缓解策略。

NPJ Digit Med. 2023 Jun 14;6(1):113. doi: 10.1038/s41746-023-00858-z.

Foundation models for generalist medical artificial intelligence.通用型医学人工智能的基础模型。

Nature. 2023 Apr;616(7956):259-265. doi: 10.1038/s41586-023-05881-4. Epub 2023 Apr 12.

Attributed Abnormality Graph Embedding for Clinically Accurate X-Ray Report Generation.归因异常图嵌入在临床准确 X 射线报告生成中的应用。

IEEE Trans Med Imaging. 2023 Aug;42(8):2211-2222. doi: 10.1109/TMI.2023.3245608. Epub 2023 Aug 1.

A Novel Deep Learning Model for Medical Report Generation by Inter-Intra Information Calibration.一种通过内外信息校准生成医学报告的新型深度学习模型。

IEEE J Biomed Health Inform. 2023 Oct;27(10):5110-5121. doi: 10.1109/JBHI.2023.3236661. Epub 2023 Oct 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于医学图像解读的多模态生成式人工智能。

Multimodal generative AI for medical image interpretation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献