Rao Vishwanatha M, Hla Michael, Moor Michael, Adithan Subathra, Kwak Stephen, Topol Eric J, Rajpurkar Pranav
Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
Nature. 2025 Mar;639(8056):888-896. doi: 10.1038/s41586-025-08675-y. Epub 2025 Mar 26.
Accurately interpreting medical images and generating insightful narrative reports is indispensable for patient care but places heavy burdens on clinical experts. Advances in artificial intelligence (AI), especially in an area that we refer to as multimodal generative medical image interpretation (GenMI), create opportunities to automate parts of this complex process. In this Perspective, we synthesize progress and challenges in developing AI systems for generation of medical reports from images. We focus extensively on radiology as a domain with enormous reporting needs and research efforts. In addition to analysing the strengths and applications of new models for medical report generation, we advocate for a novel paradigm to deploy GenMI in a manner that empowers clinicians and their patients. Initial research suggests that GenMI could one day match human expert performance in generating reports across disciplines, such as radiology, pathology and dermatology. However, formidable obstacles remain in validating model accuracy, ensuring transparency and eliciting nuanced impressions. If carefully implemented, GenMI could meaningfully assist clinicians in improving quality of care, enhancing medical education, reducing workloads, expanding specialty access and providing real-time expertise. Overall, we highlight opportunities alongside key challenges for developing multimodal generative AI that complements human experts for reliable medical report writing.
准确解读医学图像并生成有深刻见解的叙述性报告对患者护理至关重要,但给临床专家带来了沉重负担。人工智能(AI)的进展,特别是在我们称之为多模态生成式医学图像解读(GenMI)的领域,为自动化这一复杂过程的部分环节创造了机会。在这篇观点文章中,我们总结了开发用于从图像生成医学报告的人工智能系统方面的进展和挑战。我们广泛关注放射学领域,因为该领域有巨大的报告需求且研究工作众多。除了分析用于生成医学报告的新模型的优势和应用外,我们倡导一种新的范式,以一种赋予临床医生及其患者权力的方式来部署GenMI。初步研究表明,GenMI有朝一日在跨学科(如放射学、病理学和皮肤病学)生成报告方面可能与人类专家的表现相媲美。然而,在验证模型准确性、确保透明度以及引出细微差别方面仍存在巨大障碍。如果谨慎实施,GenMI可以切实帮助临床医生提高护理质量、加强医学教育、减轻工作量、扩大专科服务可及性并提供实时专业知识。总体而言,我们强调了开发多模态生成式人工智能以补充人类专家进行可靠医学报告撰写的机会以及关键挑战。