• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将专业放射学乳腺摄影报告翻译成通俗易懂语言的人工智能语言模型——对患者可解释性和认知的影响

Artificial Intelligence Language Models to Translate Professional Radiology Mammography Reports Into Plain Language - Impact on Interpretability and Perception by Patients.

作者信息

Pisarcik Dusan, Kissling Marc, Heimer Jakob, Farkas Monika, Leo Cornelia, Kubik-Huch Rahel A, Euler André

机构信息

Department of Radiology, Kantonsspital Baden, affiliated Hospital for Research and Teaching of the Faculty of Medicine of the University of Zurich, Baden, Switzerland (D.P., M.K., J.H., M.F., R.A.K.H., A.E.).

Department of Gynecology, Interdisciplinary Breast Center, Kantonsspital Baden, affiliated Hospital for Research and Teaching of the Faculty of Medicine of the University of Zurich, Baden, Switzerland (C.L.).

出版信息

Acad Radiol. 2025 Sep;32(9):4988-4996. doi: 10.1016/j.acra.2025.05.065. Epub 2025 Jun 19.

DOI:10.1016/j.acra.2025.05.065
PMID:40537381
Abstract

RATIONALE AND OBJECTIVES

This study aimed to evaluate the interpretability and patient perception of AI-translated mammography and sonography reports, focusing on comprehensibility, follow-up recommendations, and conveyed empathy using a survey.

MATERIALS AND METHODS

In this observational study, three fictional mammography and sonography reports with BI-RADS categories 3, 4, and 5 were created. These reports were repeatedly translated to plain language by three different large language models (LLM: ChatGPT-4, ChatGPT-4o, Google Gemini). In a first step, the best of these repeatedly translated reports for each BI-RADS category and LLM was selected by two experts in breast imaging considering factual correctness, completeness, and quality. In a second step, female participants compared and rated the translated reports regarding comprehensibility, follow-up recommendations, conveyed empathy, and additional value of each report using a survey with Likert scales. Statistical analysis included cumulative link mixed models and the Plackett-Luce model for ranking preferences.

RESULTS

40 females participated in the survey. GPT-4 and GPT-4o were rated significantly higher than Gemini across all categories (P<.001). Participants >50 years of age rated the reports significantly higher as compared to participants of 18-29 years of age (P<.05). Higher education predicted lower ratings (P=.02). No prior mammography increased scores (P=.03), and AI-experience had no effect (P=.88). Ranking analysis showed GPT-4o as the most preferred (P=.48), followed by GPT-4 (P=.37), with Gemini ranked last (P=.15).

CONCLUSION

Patient preference differed among AI-translated radiology reports. Compared to a traditional report using radiological language, AI-translated reports add value for patients, enhance comprehensibility and empathy and therefore hold the potential to improve patient communication in breast imaging.

摘要

原理与目的

本研究旨在通过一项调查评估人工智能翻译的乳房X线摄影和超声检查报告的可解释性以及患者的认知,重点关注可理解性、后续建议以及所传达的同理心。

材料与方法

在这项观察性研究中,创建了三份具有BI-RADS 3类、4类和5类的虚构乳房X线摄影和超声检查报告。这些报告由三种不同的大语言模型(LLM:ChatGPT-4、ChatGPT-4o、谷歌Gemini)反复翻译成通俗易懂的语言。第一步,两名乳腺影像学专家根据事实准确性、完整性和质量,为每个BI-RADS类别和大语言模型从这些反复翻译的报告中选出最佳报告。第二步,女性参与者使用李克特量表调查,比较并评价翻译后的报告在可理解性、后续建议、传达的同理心以及每份报告的附加价值方面的表现。统计分析包括累积链接混合模型和用于排序偏好的Plackett-Luce模型。

结果

40名女性参与了调查。在所有类别中,GPT-4和GPT-4o的评分显著高于Gemini(P<0.001)。50岁以上的参与者对报告的评分显著高于18至29岁的参与者(P<0.05)。高等教育程度预示着评分较低(P=0.02)。未曾进行过乳房X线摄影检查的参与者评分较高(P=0.03),而人工智能经验没有影响(P=0.88)。排名分析显示GPT-4o最受青睐(P=0.48),其次是GPT-4(P=0.37),Gemini排名最后(P=0.15)。

结论

人工智能翻译的放射学报告在患者偏好方面存在差异。与使用放射学语言的传统报告相比,人工智能翻译的报告为患者增加了价值,提高了可理解性和同理心,因此有可能改善乳腺影像学中的医患沟通。

相似文献

1
Artificial Intelligence Language Models to Translate Professional Radiology Mammography Reports Into Plain Language - Impact on Interpretability and Perception by Patients.将专业放射学乳腺摄影报告翻译成通俗易懂语言的人工智能语言模型——对患者可解释性和认知的影响
Acad Radiol. 2025 Sep;32(9):4988-4996. doi: 10.1016/j.acra.2025.05.065. Epub 2025 Jun 19.
2
Evaluating a Large Language Model in Translating Patient Instructions to Spanish Using a Standardized Framework.使用标准化框架评估大型语言模型在将患者指导说明翻译成西班牙语方面的表现。
JAMA Pediatr. 2025 Jul 7. doi: 10.1001/jamapediatrics.2025.1729.
3
Enhancing Magnetic Resonance Imaging (MRI) Report Comprehension in Spinal Trauma: Readability Analysis of AI-Generated Explanations for Thoracolumbar Fractures.提高脊柱创伤磁共振成像(MRI)报告的理解:胸腰椎骨折人工智能生成解释的可读性分析
JMIR AI. 2025 Jul 1;4:e69654. doi: 10.2196/69654.
4
Thyroid Eye Disease and Artificial Intelligence: A Comparative Study of ChatGPT-3.5, ChatGPT-4o, and Gemini in Patient Information Delivery.甲状腺眼病与人工智能:ChatGPT-3.5、ChatGPT-4o和Gemini在患者信息传递方面的比较研究
Ophthalmic Plast Reconstr Surg. 2024 Dec 24. doi: 10.1097/IOP.0000000000002882.
5
Translating ophthalmic medical jargon with artificial intelligence: a comparative comprehension study.利用人工智能翻译眼科医学术语:一项比较理解研究。
Can J Ophthalmol. 2024 Dec 9. doi: 10.1016/j.jcjo.2024.11.003.
6
Assessment of Recommendations Provided to Athletes Regarding Sleep Education by GPT-4o and Google Gemini: Comparative Evaluation Study.GPT-4o和谷歌Gemini向运动员提供的关于睡眠教育的建议评估:比较评估研究
JMIR Form Res. 2025 Jul 8;9:e71358. doi: 10.2196/71358.
7
Evaluation of GPT-4o for multilingual translation of radiology reports across imaging modalities.GPT-4o用于跨成像模态的放射学报告多语言翻译的评估。
Eur J Radiol. 2025 Oct;191:112341. doi: 10.1016/j.ejrad.2025.112341. Epub 2025 Jul 29.
8
Comparison of a Specialized Large Language Model with GPT-4o for CT and MRI Radiology Report Summarization.一种用于CT和MRI放射学报告总结的专业大语言模型与GPT-4o的比较。
Radiology. 2025 Aug;316(2):e243774. doi: 10.1148/radiol.243774.
9
Evaluating a Chatbot as a Companion for Patients With Breast Cancer: Collaborative Pilot Study.评估聊天机器人作为乳腺癌患者陪伴者的效果:协作性试点研究。
JMIR Cancer. 2025 Aug 13;11:e68426. doi: 10.2196/68426.
10
Development of a Large-Scale Dataset of Chest Computed Tomography Reports in Japanese and a High-Performance Finding Classification Model: Dataset Development and Validation Study.日语胸部计算机断层扫描报告大规模数据集的开发及高性能发现分类模型:数据集开发与验证研究
JMIR Med Inform. 2025 Aug 28;13:e71137. doi: 10.2196/71137.