文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

From jargon to clarity: Improving the readability of foot and ankle radiology reports with an artificial intelligence large language model.

作者信息

Butler James J, Harrington Michael C, Tong Yixuan, Rosenbaum Andrew J, Samsonov Alan P, Walls Raymond J, Kennedy John G

机构信息

Foot and Ankle Division, Department of Orthopaedic Surgery, NYU Langone Health, 171 Delancey St, 2nd floor, New York City, USA.

Department of Orthopedic Surgery, Albany Medical Center, Albany, New York, USA.

出版信息

Foot Ankle Surg. 2024 Jun;30(4):331-337. doi: 10.1016/j.fas.2024.01.008. Epub 2024 Feb 5.


DOI:10.1016/j.fas.2024.01.008
PMID:38336501
Abstract

BACKGROUND: The purpose of this study was to evaluate the efficacy of an Artificial Intelligence Large Language Model (AI-LLM) at improving the readability foot and ankle orthopedic radiology reports. METHODS: The radiology reports from 100 foot or ankle X-Rays, 100 computed tomography (CT) scans and 100 magnetic resonance imaging (MRI) scans were randomly sampled from the institution's database. The following prompt command was inserted into the AI-LLM: "Explain this radiology report to a patient in layman's terms in the second person: [Report Text]". The mean report length, Flesch reading ease score (FRES) and Flesch-Kincaid reading level (FKRL) were evaluated for both the original radiology report and the AI-LLM generated report. The accuracy of the information contained within the AI-LLM report was assessed via a 5-point Likert scale. Additionally, any "hallucinations" generated by the AI-LLM report were recorded. RESULTS: There was a statistically significant improvement in mean FRES scores in the AI-LLM generated X-Ray report (33.8 ± 6.8 to 72.7 ± 5.4), CT report (27.8 ± 4.6 to 67.5 ± 4.9) and MRI report (20.3 ± 7.2 to 66.9 ± 3.9), all p < 0.001. There was also a statistically significant improvement in mean FKRL scores in the AI-LLM generated X-Ray report (12.2 ± 1.1 to 8.5 ± 0.4), CT report (15.4 ± 2.0 to 8.4 ± 0.6) and MRI report (14.1 ± 1.6 to 8.5 ± 0.5), all p < 0.001. Superior FRES scores were observed in the AI-LLM generated X-Ray report compared to the AI-LLM generated CT report and MRI report, p < 0.001. The mean Likert score for the AI-LLM generated X-Ray report, CT report and MRI report was 4.0 ± 0.3, 3.9 ± 0.4, and 3.9 ± 0.4, respectively. The rate of hallucinations in the AI-LLM generated X-Ray report, CT report and MRI report was 4%, 7% and 6%, respectively. CONCLUSION: AI-LLM was an efficacious tool for improving the readability of foot and ankle radiological reports across multiple imaging modalities. Superior FRES scores together with superior Likert scores were observed in the X-Ray AI-LLM reports compared to the CT and MRI AI-LLM reports. This study demonstrates the potential use of AI-LLMs as a new patient-centric approach for enhancing patient understanding of their foot and ankle radiology reports. Jel Classifications: IV.

摘要

相似文献

[1]
From jargon to clarity: Improving the readability of foot and ankle radiology reports with an artificial intelligence large language model.

Foot Ankle Surg. 2024-6

[2]
From technical to understandable: Artificial Intelligence Large Language Models improve the readability of knee radiology reports.

Knee Surg Sports Traumatol Arthrosc. 2024-5

[3]
Decoding Radiology Reports: Artificial Intelligence-Large Language Models Can Improve the Readability of Hand and Wrist Orthopedic Radiology Reports.

Hand (N Y). 2024-8-13

[4]
Decoding radiology reports: Potential application of OpenAI ChatGPT to enhance patient understanding of diagnostic reports.

Clin Imaging. 2023-9

[5]
Decoding medical jargon: The use of AI language models (ChatGPT-4, BARD, microsoft copilot) in radiology reports.

Patient Educ Couns. 2024-9

[6]
Assessing the Application of Large Language Models in Generating Dermatologic Patient Education Materials According to Reading Level: Qualitative Study.

JMIR Dermatol. 2024-5-16

[7]
Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.

Cureus. 2024-8-28

[8]
Harnessing artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in generating clinician-level bariatric surgery recommendations.

Surg Obes Relat Dis. 2024-7

[9]
Patient-centered radiology reports with generative artificial intelligence: adding value to radiology reporting.

Sci Rep. 2024-6-8

[10]
Quality of Large Language Model Responses to Radiation Oncology Patient Care Questions.

JAMA Netw Open. 2024-4-1

引用本文的文献

[1]
Systematic Review on Large Language Models in Orthopaedic Surgery.

J Clin Med. 2025-8-20

[2]
Structured Transformation of Unstructured Prostate MRI Reports Using Large Language Models.

Tomography. 2025-6-17

[3]
Large Language Models in Spine Surgery: A Promising Technology.

HSS J. 2025-5-29

[4]
Evaluation of a large language model to simplify discharge summaries and provide cardiological lifestyle recommendations.

Commun Med (Lond). 2025-5-29

[5]
Evaluation of large language models in generating pulmonary nodule follow-up recommendations.

Eur J Radiol Open. 2025-4-30

[6]
Detecting New Lesions Using a Large Language Model: Applications in Real-World Multiple Sclerosis Datasets.

Ann Neurol. 2025-8

[7]
Artificial Intelligence-Empowered Radiology-Current Status and Critical Review.

Diagnostics (Basel). 2025-1-24

[8]
Evaluation of large language models under different training background in Chinese medical examination: a comparative study.

Front Artif Intell. 2024-12-4

[9]
Generative Large Language Models in Electronic Health Records for Patient Care Since 2023: A Systematic Review.

medRxiv. 2024-8-19

[10]
Conceptual review of outcome metrics and measures used in clinical evaluation of artificial intelligence in radiology.

Radiol Med. 2024-11

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索