• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用人工智能翻译眼科医学术语:一项比较理解研究。

Translating ophthalmic medical jargon with artificial intelligence: a comparative comprehension study.

作者信息

Balas Michael, Kaplan Alexander J, Esmail Kaisra, Saleh Solin, Sharma Rahul A, Yan Peng, Arjmand Parnian

机构信息

Department of Ophthalmology and Vision Sciences, University of Toronto, Toronto, ON, Canada.

Department of Ophthalmology and Vision Sciences, University of Toronto, Toronto, ON, Canada; Kensington Eye Institute, Toronto, ON, Canada.

出版信息

Can J Ophthalmol. 2024 Dec 9. doi: 10.1016/j.jcjo.2024.11.003.

DOI:10.1016/j.jcjo.2024.11.003
PMID:39667413
Abstract

OBJECTIVE

Our goal was to evaluate the efficacy of OpenAI's ChatGPT-4.0 large language model (LLM) in translating technical ophthalmology terminology into more comprehensible language for allied health care professionals and compare it with other LLMs.

DESIGN

Observational cross-sectional study.

PARTICIPANTS

Five ophthalmologists each contributed three clinical encounter notes, totaling 15 reports for analysis.

METHODS

Notes were translated into more comprehensible language using ChatGPT-4.0, ChatGPT-4o, Claude 3 Sonnet, and Google Gemini. Ten family physicians, masked to whether the note was original or translated by an LLM, independently evaluated both sets using Likert scales to assess comprehension and utility for clinical decision-making. Readability was evaluated using Flesch Reading Ease and Flesch-Kincaid Grade Level scores. Five ophthalmologist raters compared performance between LLMs and identified translation errors.

RESULTS

LLM translations significantly outperformed the original notes in terms of comprehension (mean score of 4.7/5.0 vs 3.7/5.0; p < 0.001) and perceived usefulness (mean score of 4.6/5.0 vs 3.8/5.0; p < 0.005). Readability analysis demonstrated mildly increased linguistic complexity in the translated notes. ChatGPT-4.0 was preferred in 8 of 15 cases, ChatGPT-4o in 4, Gemini in 3, and Claude 3 Sonnet in 0 cases. All models exhibited some translation errors, but ChatGPT-4o and ChatGPT-4.0 had fewer inaccuracies.

CONCLUSIONS

ChatGPT-4.0 can significantly enhance the comprehensibility of ophthalmic notes, facilitating better interprofessional communication and suggesting a promising role for LLMs in medical translation. However, the results also underscore the need for ongoing refinement and careful implementation of such technologies. Further research is needed to validate these findings across a broader range of specialties and languages.

摘要

目的

我们的目标是评估OpenAI的ChatGPT-4.0大语言模型(LLM)将眼科专业术语翻译成更通俗易懂的语言供联合医疗保健专业人员使用的效果,并将其与其他大语言模型进行比较。

设计

观察性横断面研究。

参与者

五位眼科医生每人提供三份临床会诊记录,共15份报告用于分析。

方法

使用ChatGPT-4.0、ChatGPT-4o、Claude 3 Sonnet和谷歌Gemini将记录翻译成更通俗易懂的语言。十位家庭医生在不知道记录是原始记录还是由大语言模型翻译的情况下,使用李克特量表独立评估这两组记录,以评估其可理解性和对临床决策的实用性。使用弗莱什易读性和弗莱什-金凯德年级水平分数评估可读性。五位眼科医生评分者比较了大语言模型之间的表现并识别翻译错误。

结果

在可理解性方面(平均得分4.7/5.0对3.7/5.0;p<0.001)和感知有用性方面(平均得分4.6/5.0对3.8/5.0;p<0.005),大语言模型的翻译明显优于原始记录。可读性分析表明翻译后的记录在语言复杂性上略有增加。在15个案例中,ChatGPT-4.0在8个案例中更受青睐,ChatGPT-4o在4个案例中更受青睐,Gemini在3个案例中更受青睐,Claude 3 Sonnet在0个案例中更受青睐。所有模型都出现了一些翻译错误,但ChatGPT-4o和ChatGPT-4.0的不准确之处较少。

结论

ChatGPT-4.0可以显著提高眼科记录的可理解性,促进更好的跨专业交流,并表明大语言模型在医学翻译中具有广阔前景。然而,结果也强调了持续改进和谨慎应用此类技术的必要性。需要进一步研究以在更广泛的专业和语言范围内验证这些发现。

相似文献

1
Translating ophthalmic medical jargon with artificial intelligence: a comparative comprehension study.利用人工智能翻译眼科医学术语:一项比较理解研究。
Can J Ophthalmol. 2024 Dec 9. doi: 10.1016/j.jcjo.2024.11.003.
2
Enhancing the Readability of Online Patient Education Materials Using Large Language Models: Cross-Sectional Study.使用大语言模型提高在线患者教育材料的可读性:横断面研究。
J Med Internet Res. 2025 Jun 4;27:e69955. doi: 10.2196/69955.
3
Benchmarking the performance of large language models in uveitis: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, Google Gemini, and Anthropic Claude3.葡萄膜炎中大型语言模型性能的基准测试:ChatGPT-3.5、ChatGPT-4.0、谷歌Gemini和Anthropic Claude3的比较分析
Eye (Lond). 2025 Apr;39(6):1132-1137. doi: 10.1038/s41433-024-03545-9. Epub 2024 Dec 17.
4
A structured evaluation of LLM-generated step-by-step instructions in cadaveric brachial plexus dissection.对大语言模型生成的尸体臂丛神经解剖分步指导的结构化评估。
BMC Med Educ. 2025 Jul 1;25(1):903. doi: 10.1186/s12909-025-07493-0.
5
Evaluating a Large Language Model in Translating Patient Instructions to Spanish Using a Standardized Framework.使用标准化框架评估大型语言模型在将患者指导说明翻译成西班牙语方面的表现。
JAMA Pediatr. 2025 Jul 7. doi: 10.1001/jamapediatrics.2025.1729.
6
Accuracy of ChatGPT, Gemini, Copilot, and Claude to Blepharoplasty-Related Questions.ChatGPT、Gemini、Copilot和Claude对双眼皮手术相关问题的回答准确性。
Aesthetic Plast Surg. 2025 Jul 21. doi: 10.1007/s00266-025-05071-9.
7
Accuracy of large language models in generating differential diagnosis from clinical presentation and imaging findings in pediatric cases.大型语言模型根据儿科病例的临床表现和影像学检查结果生成鉴别诊断的准确性。
Pediatr Radiol. 2025 Jul 12. doi: 10.1007/s00247-025-06317-z.
8
Enhancing Magnetic Resonance Imaging (MRI) Report Comprehension in Spinal Trauma: Readability Analysis of AI-Generated Explanations for Thoracolumbar Fractures.提高脊柱创伤磁共振成像(MRI)报告的理解:胸腰椎骨折人工智能生成解释的可读性分析
JMIR AI. 2025 Jul 1;4:e69654. doi: 10.2196/69654.
9
Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?来自大语言模型或网络资源的关于肌肉骨骼恶性肿瘤的信息对患者来说是否处于合适的阅读水平?
Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25.
10
Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.外周动脉疾病教育中的人工智能:ChatGPT与谷歌Gemini的较量
Cureus. 2025 Jun 1;17(6):e85174. doi: 10.7759/cureus.85174. eCollection 2025 Jun.