Li Hanzhou Hanssen, Moon John T, Kumar Sampath, Ricci Julian, Sim Nathan, Bercu Zachary L, Newsome Janice, Trivedi Hari M, Gichoya Judy W
Division of Interventional Radiology and Image-Guided Medicine, Department of Radiology and Imaging Science, Emory University School of Medicine, Atlanta, Georgia.
Division of Interventional Radiology and Image-Guided Medicine, Department of Radiology and Imaging Science, Emory University School of Medicine, Atlanta, Georgia.
J Vasc Interv Radiol. 2025 Apr;36(4):696-703.e1. doi: 10.1016/j.jvir.2025.01.002. Epub 2025 Jan 9.
This study assessed the feasibility of large language models such as GPT-4 (OpenAI, San Francisco, California) to summarize interventional radiology procedural reports to improve layperson understanding and translate medical texts into multiple languages. Two hundred reports from 8 categories were summarized using GPT-4. Readability was assessed with Flesch-Kincaid reading level (FKRL) and Flesch reading ease score (FRES). Accuracy was assessed by 8 interventional radiologists. Summaries were translated into Spanish, Korean, Chinese, and Swahili, and their accuracy were assessed by 8 bilingual interventional radiologists. The original reports' FKRL of 10.7 and FRES of 41.9 improved to 7.0 and 73.0, respectively. Summaries were mostly accurate, with minimal misinformation. Translations introduced an increase in number of misinformation but no significant increase in critically wrong information. Layperson comprehension scores improved significantly from 2.5 to 4.3 out of 5 after summarization. Overall, GPT-4 enhanced report readability and comprehension, suggesting potential for broader application in improving patient communication.
本研究评估了诸如GPT-4(OpenAI,加利福尼亚州旧金山)之类的大语言模型对介入放射学程序报告进行总结以提高外行人理解度以及将医学文本翻译成多种语言的可行性。使用GPT-4对来自8个类别的200份报告进行了总结。通过弗莱施-金凯德阅读等级(FKRL)和弗莱施易读性分数(FRES)评估可读性。由8名介入放射科医生评估准确性。总结内容被翻译成西班牙语、韩语、中文和斯瓦希里语,其准确性由8名双语介入放射科医生评估。原始报告的FKRL为10.7,FRES为41.9,分别提高到了7.0和73.0。总结大多准确,错误信息极少。翻译导致错误信息数量增加,但严重错误信息没有显著增加。总结后,外行人理解分数从满分5分中的2.5分显著提高到了4.3分。总体而言,GPT-4提高了报告的可读性和理解度,表明其在改善患者沟通方面具有更广泛应用的潜力。