• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChatGPT 和谷歌翻译在儿科出院医嘱翻译中的性能。

Performance of ChatGPT and Google Translate for Pediatric Discharge Instruction Translation.

机构信息

Boston Combined Residency Program in Pediatrics at Boston Children's Hospital and Boston Medical Center, Boston, Massachusetts.

Departments of Pediatrics.

出版信息

Pediatrics. 2024 Jul 1;154(1). doi: 10.1542/peds.2023-065573.

DOI:10.1542/peds.2023-065573
PMID:38860299
Abstract

BACKGROUND AND OBJECTIVES

Patients who speak languages other than English face barriers to equitable healthcare delivery. Machine translation systems, including emerging large language models, have the potential to expand access to translation services, but their merits and limitations in clinical practice remain poorly defined. We aimed to assess the performance of Google Translate and ChatGPT for multilingual translation of pediatric discharge instructions.

METHODS

Twenty standardized discharge instructions for pediatric conditions were translated into Spanish, Brazilian Portuguese, and Haitian Creole by professional translation services, Google Translate and ChatGPT-4.0, and evaluated for adequacy (preserved information), fluency (grammatical correctness), meaning (preserved connotation), and severity (clinical harm), along with assessment of overall preference. Domain-level ratings and preferred translation source were summarized with descriptive statistics and compared with professional translations.

RESULTS

Google Translate and ChatGPT demonstrated similar domain-level ratings to professional translations for Spanish and Portuguese. For Haitian Creole, compared with both Google Translate and ChatGPT, professional translations demonstrated significantly greater adequacy, fluency meaning, and severity scores. ChatGPT (33.3%, P < .001) and Google Translate (23.3%, P = .024) contained more potentially clinically significant errors (severity score ≤3) for Haitian Creole than professional translations (8.3%). Professional Haitian Creole (48.3%) and Portuguese (43.3%), but not Spanish (15%), translations were most frequently preferred among translation sources.

CONCLUSIONS

Machine translation platforms have comparable performance to professional translations for Spanish and Portuguese but shortcomings in quality, accuracy, and preference persist for Haitian Creole. Diverse multilingual training data are needed, along with regulations ensuring safe and equitable applications of machine translation in clinical practice.

摘要

背景与目的

讲英语以外其他语言的患者在获得公平医疗服务方面面临障碍。机器翻译系统,包括新兴的大型语言模型,有可能扩大翻译服务的获取途径,但它们在临床实践中的优缺点仍未得到明确界定。我们旨在评估 Google Translate 和 ChatGPT 在翻译儿科出院指导方面的多语言翻译表现。

方法

由专业翻译服务、Google Translate 和 ChatGPT-4.0 将 20 份标准化儿科疾病的出院指导翻译成西班牙语、巴西葡萄牙语和海地克里奥尔语,评估其充分性(保留信息)、流畅性(语法正确性)、意义(保留内涵)和严重程度(临床危害),以及整体偏好。用描述性统计对域级评分和首选翻译来源进行总结,并与专业翻译进行比较。

结果

Google Translate 和 ChatGPT 在西班牙语和葡萄牙语方面与专业翻译的域级评分相似。对于海地克里奥尔语,与 Google Translate 和 ChatGPT 相比,专业翻译的充分性、流畅性、意义和严重程度评分显著更高。ChatGPT(33.3%,P<.001)和 Google Translate(23.3%,P=.024)比专业翻译(8.3%)包含更多潜在临床意义的错误(严重程度评分≤3)。专业海地克里奥尔语(48.3%)和葡萄牙语(43.3%),但不是西班牙语(15%)的翻译源在翻译源中最常被首选。

结论

机器翻译平台在西班牙语和葡萄牙语方面的表现与专业翻译相当,但在海地克里奥尔语方面,质量、准确性和偏好方面的不足仍然存在。需要更多的多语言培训数据,以及确保机器翻译在临床实践中安全和公平应用的法规。

相似文献

1
Performance of ChatGPT and Google Translate for Pediatric Discharge Instruction Translation.ChatGPT 和谷歌翻译在儿科出院医嘱翻译中的性能。
Pediatrics. 2024 Jul 1;154(1). doi: 10.1542/peds.2023-065573.
2
A Pragmatic Assessment of Google Translate for Emergency Department Instructions.Google 翻译在急诊科医嘱中的实用性评估。
J Gen Intern Med. 2021 Nov;36(11):3361-3365. doi: 10.1007/s11606-021-06666-z. Epub 2021 Mar 5.
3
Evaluating the Accuracy of Google Translate for Diabetes Education Material.评估谷歌翻译在糖尿病教育材料方面的准确性。
JMIR Diabetes. 2016 Jun 28;1(1):e3. doi: 10.2196/diabetes.5848.
4
Accuracy of Google Translate in translating the directions and counseling points for top-selling drugs from English to Arabic, Chinese, and Spanish.谷歌翻译将畅销药物的使用说明和咨询要点从英语翻译成阿拉伯语、中文和西班牙语的准确性。
Am J Health Syst Pharm. 2021 Nov 9;78(22):2053-2058. doi: 10.1093/ajhp/zxab224.
5
Machine vs. Radiologist-Based Translations of RadLex: Implications for Multi-language Report Interoperability.机器翻译与放射科医生翻译 RadLex 之比较:对多语言报告互操作性的影响。
J Digit Imaging. 2022 Jun;35(3):660-665. doi: 10.1007/s10278-022-00597-9. Epub 2022 Feb 15.
6
Data extraction from machine-translated versus original language randomized trial reports: a comparative study.从机器翻译与原语言随机试验报告中提取数据:一项比较研究。
Syst Rev. 2013 Nov 7;2:97. doi: 10.1186/2046-4053-2-97.
7
Translating Discharge Instructions for Limited English-Proficient Families: Strategies and Barriers.为英语水平有限的家庭翻译出院指导:策略与障碍
Hosp Pediatr. 2019 Oct;9(10):779-787. doi: 10.1542/hpeds.2019-0055.
8
Transcending Language Barriers: Can ChatGPT Be the Key to Enhancing Multilingual Accessibility in Health Care?跨越语言障碍:ChatGPT能否成为增强医疗保健多语言可及性的关键?
J Am Coll Radiol. 2024 Dec;21(12):1888-1895. doi: 10.1016/j.jacr.2024.05.009. Epub 2024 Jun 14.
9
Translation into Spanish and field-testing of a new score for evaluating psoriasis severity: The Simplified Psoriasis Index (SPI).西班牙语翻译及新的银屑病严重程度评估评分的现场测试:简化银屑病指数(SPI)。
Actas Dermosifiliogr. 2022 Apr;113(4):363-369. doi: 10.1016/j.ad.2021.11.004. Epub 2021 Nov 19.
10
Performance of an online translation tool when applied to patient educational material.在线翻译工具在患者教育材料中的应用性能。
J Hosp Med. 2011 Nov;6(9):519-25. doi: 10.1002/jhm.898. Epub 2011 Oct 27.

引用本文的文献

1
Evaluation of the accuracy and safety of machine translation of patient-specific discharge instructions: a comparative analysis.患者特定出院指导机器翻译的准确性和安全性评估:一项对比分析。
BMJ Qual Saf. 2025 Jul 9. doi: 10.1136/bmjqs-2024-018384.
2
Evaluating a Large Language Model in Translating Patient Instructions to Spanish Using a Standardized Framework.使用标准化框架评估大型语言模型在将患者指导说明翻译成西班牙语方面的表现。
JAMA Pediatr. 2025 Jul 7. doi: 10.1001/jamapediatrics.2025.1729.
3
Potential role of large language models and personalized medicine to innovate cardiac rehabilitation.
大语言模型和个性化医疗在创新心脏康复中的潜在作用。
World J Clin Cases. 2025 Jul 6;13(19):98095. doi: 10.12998/wjcc.v13.i19.98095.
4
Using AI to Translate and Simplify Spanish Orthopedic Medical Text: Instrument Validation Study.使用人工智能翻译和简化西班牙语骨科医学文本:仪器验证研究。
JMIR AI. 2025 Mar 21;4:e70222. doi: 10.2196/70222.
5
Anatomical variations of the ilioinguinal, iliohypogastric, and genitofemoral nerves: a systematic scoping review of cadaver studies.髂腹股沟神经、髂腹下神经和生殖股神经的解剖变异:尸体研究的系统综述
Hernia. 2025 May 30;29(1):191. doi: 10.1007/s10029-025-03388-7.
6
Artificial Intelligence as a Language Barrier Application in a Simulated Health Care Setting: A Proof-of-Concept Study.人工智能在模拟医疗环境中作为语言障碍应用的概念验证研究。
Pediatr Emerg Care. 2025 Jun 1;41(6):481-485. doi: 10.1097/PEC.0000000000003369. Epub 2025 Mar 4.
7
A proof-of-concept study for patient use of open notes with large language models.一项关于患者使用带有大语言模型的开放病历的概念验证研究。
JAMIA Open. 2025 Apr 9;8(2):ooaf021. doi: 10.1093/jamiaopen/ooaf021. eCollection 2025 Apr.
8
ChatGPT-4o and 4o1 Preview as Dietary Support Tools in a Real-World Medicated Obesity Program: A Prospective Comparative Analysis.ChatGPT-4o和4o1预览版作为现实世界药物性肥胖项目中的饮食支持工具:一项前瞻性比较分析。
Healthcare (Basel). 2025 Mar 16;13(6):647. doi: 10.3390/healthcare13060647.
9
Usability of technological tools to overcome language barriers in healthcare- a scoping review.用于克服医疗保健领域语言障碍的技术工具的可用性——一项范围综述
Arch Public Health. 2025 Feb 25;83(1):52. doi: 10.1186/s13690-025-01543-1.
10
Artificial intelligence in clinical settings: a systematic review of its role in language translation and interpretation.临床环境中的人工智能:对其在语言翻译和口译中作用的系统评价
Ann Transl Med. 2024 Dec 24;12(6):117. doi: 10.21037/atm-24-162. Epub 2024 Dec 17.