• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一项关于人工智能机器翻译工具的系统性多模态评估,以促进在国际上获取重症监护教育资源。

A systematic multimodal assessment of AI machine translation tools for enhancing access to critical care education internationally.

作者信息

Chen Christine L, Dong Yue, Castillo-Zambrano Claudia, Bencheqroun Hassan, Barwise Amelia, Hoffman Adria, Nalaie Keivan, Qiu Yishu, Boulekbache Oualid, Niven Alexander S

机构信息

Division of Internal Medicine, Mayo Clinic, 200 First St. SW, Rochester, MN, 55905, USA.

Division of Pulmonary and Critical Care Medicine, Mayo Clinic, 200 First St. SW, Rochester, MN, 55905, USA.

出版信息

BMC Med Educ. 2025 Jul 8;25(1):1022. doi: 10.1186/s12909-025-07452-9.

DOI:10.1186/s12909-025-07452-9
PMID:40629322
Abstract

BACKGROUND

Language barriers pose a significant barrier to expanding access to critical care education worldwide. Machine translation (MT) offers significant promise to increase accessibility to critical care content, and has rapidly evolved using newer artificial intelligence frameworks and large language models. The best approach to systematically apply and evaluate these tools, however, remains unclear.

METHODS

We developed a multimodal method to evaluate translations of critical care content used as part of an established international critical care education program. Four freely-available MT tools were selected (DeepL™, Google Gemini™, Google Translate™, Microsoft CoPilot™) and used to translate selected phrases and paragraphs into Chinese (Mandarin), Spanish, and Ukrainian. A human translation performed by a professional medical translator was used for comparison. These translations were compared using 1) blinded bilingual clinician evaluations using anchored Likert domains of fluency, adequacy, and meaning; 2) automated BiLingual Evaluation Understudy (BLEU) scores; and 3) validated system usability scale to assess the ease of use of MT tools. Blinded bilingual clinician evaluations were calculated as individual domains and averaged composite scores.

RESULTS

Blinded clinician composite scores were highest for human translation (Chinese), Google Gemini (Spanish), and Microsoft CoPilot (Ukrainian). Microsoft CoPilot (Chinese) and Google Translate (Spanish and Ukrainian) earned the lowest scores. All Chinese and Spanish versions received "understandable to good" or "high quality" BLEU scores, while Ukrainian overall scored "hard to get the gist" except using Microsoft CoPilot. Usability scores were highest with DeepL (Chinese), Google Gemini (Spanish), and Google Translate (Ukrainian), and lower with Microsoft CoPilot (Chinese and Ukrainian) and Google Translate (Spanish).

CONCLUSION

No single MT tool performed best across all metrics and languages, highlighting the importance of routine assessment of these tools during educational activities given their rapid ongoing evolution. We offer a multimodal evaluation methodology to aid this assessment as medical educators expand their use of MT in international educational programs.

摘要

背景

语言障碍对在全球范围内扩大重症监护教育的可及性构成了重大障碍。机器翻译(MT)为增加重症监护内容的可及性带来了巨大希望,并且已使用更新的人工智能框架和大语言模型迅速发展。然而,系统应用和评估这些工具的最佳方法仍不明确。

方法

我们开发了一种多模态方法,以评估作为既定国际重症监护教育计划一部分使用的重症监护内容的翻译。选择了四个免费的机器翻译工具(DeepL™、谷歌Gemini™、谷歌翻译™、微软Copilot™),并用于将选定的短语和段落翻译成中文(普通话)、西班牙语和乌克兰语。由专业医学翻译人员进行的人工翻译用于比较。使用以下方法对这些翻译进行比较:1)采用流畅性、充分性和意义的锚定李克特量表进行盲法双语临床医生评估;2)自动双语评估替代指标(BLEU)得分;3)经过验证的系统可用性量表,以评估机器翻译工具的易用性。盲法双语临床医生评估按各个领域计算,并得出平均综合得分。

结果

人工翻译(中文)、谷歌Gemini(西班牙语)和微软Copilot(乌克兰语)的盲法临床医生综合得分最高。微软Copilot(中文)以及谷歌翻译(西班牙语和乌克兰语)得分最低。所有中文和西班牙语版本的BLEU得分均为“可理解到良好”或“高质量”,而乌克兰语版本总体得分“难以理解主旨”,不过使用微软Copilot时除外。DeepL(中文)、谷歌Gemini(西班牙语)和谷歌翻译(乌克兰语)的可用性得分最高,微软Copilot(中文和乌克兰语)以及谷歌翻译(西班牙语)的可用性得分较低。

结论

没有单一的机器翻译工具在所有指标和语言上都表现最佳,鉴于这些工具正在迅速发展,这凸显了在教育活动中对其进行常规评估的重要性。随着医学教育工作者在国际教育计划中扩大对机器翻译的使用,我们提供了一种多模态评估方法来辅助这一评估。

相似文献

1
A systematic multimodal assessment of AI machine translation tools for enhancing access to critical care education internationally.一项关于人工智能机器翻译工具的系统性多模态评估,以促进在国际上获取重症监护教育资源。
BMC Med Educ. 2025 Jul 8;25(1):1022. doi: 10.1186/s12909-025-07452-9.
2
Using AI to Translate and Simplify Spanish Orthopedic Medical Text: Instrument Validation Study.使用人工智能翻译和简化西班牙语骨科医学文本:仪器验证研究。
JMIR AI. 2025 Mar 21;4:e70222. doi: 10.2196/70222.
3
Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis.用于计算局部麻醉药最大安全剂量的3种对话式生成人工智能模型的性能:比较分析
JMIR AI. 2025 May 13;4:e66796. doi: 10.2196/66796.
4
Evaluation of the accuracy and safety of machine translation of patient-specific discharge instructions: a comparative analysis.患者特定出院指导机器翻译的准确性和安全性评估:一项对比分析。
BMJ Qual Saf. 2025 Jul 9. doi: 10.1136/bmjqs-2024-018384.
5
Evaluating a Large Language Model in Translating Patient Instructions to Spanish Using a Standardized Framework.使用标准化框架评估大型语言模型在将患者指导说明翻译成西班牙语方面的表现。
JAMA Pediatr. 2025 Jul 7. doi: 10.1001/jamapediatrics.2025.1729.
6
Leveraging Generative AI to Overcome Language Barriers in Healthcare.利用生成式人工智能克服医疗保健中的语言障碍。
Stud Health Technol Inform. 2025 Jun 26;328:86-90. doi: 10.3233/SHTI250678.
7
Artificial Intelligence Shows Limited Success in Improving Readability Levels of Spanish-language Orthopaedic Patient Education Materials.人工智能在提高西班牙语骨科患者教育材料的可读性方面成效有限。
Clin Orthop Relat Res. 2025 Feb 11. doi: 10.1097/CORR.0000000000003413.
8
Can Artificial Intelligence Improve the Readability of Patient Education Materials?人工智能能否提高患者教育材料的可读性?
Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
Melatonin for the promotion of sleep in adults in the intensive care unit.褪黑素用于促进重症监护病房成年患者的睡眠。
Cochrane Database Syst Rev. 2018 May 10;5(5):CD012455. doi: 10.1002/14651858.CD012455.pub2.

本文引用的文献

1
Assessing GPT and DeepL for terminology translation in the medical domain: A comparative study on the human phenotype ontology.评估GPT和DeepL在医学领域术语翻译中的表现:关于人类表型本体的比较研究。
BMC Med Inform Decis Mak. 2025 Jul 1;25(1):237. doi: 10.1186/s12911-025-03075-8.
2
Translation of Ontological Concepts from English into German Using Commercial Translation Software and Expert Evaluation.使用商业翻译软件和专家评估将本体论概念从英语翻译成德语。
Stud Health Technol Inform. 2024 Jan 25;310:89-93. doi: 10.3233/SHTI230933.
3
The Use of Machine Translation for Outreach and Health Communication in Epidemiology and Public Health: Scoping Review.
机器翻译在流行病学和公共卫生中的外展和健康传播中的应用:范围综述。
JMIR Public Health Surveill. 2023 Nov 20;9:e50814. doi: 10.2196/50814.
4
Opportunities, Challenges, and Future Directions of Generative Artificial Intelligence in Medical Education: Scoping Review.生成式人工智能在医学教育中的机遇、挑战与未来方向:范围综述
JMIR Med Educ. 2023 Oct 20;9:e48785. doi: 10.2196/48785.
5
Risks and Benefits of Large Language Models for the Environment.大型语言模型对环境的风险与益处
Environ Sci Technol. 2023 Mar 7;57(9):3464-3466. doi: 10.1021/acs.est.3c01106. Epub 2023 Feb 23.
6
Global critical care: a call to action.全球危重病护理:行动呼吁。
Crit Care. 2023 Jan 20;27(1):28. doi: 10.1186/s13054-022-04296-3.
7
Automatic Generation of German Translation Candidates for SNOMED CT Textual Descriptions.医学系统命名法临床术语(SNOMED CT)文本描述的德语翻译候选自动生成
Stud Health Technol Inform. 2021 May 27;281:178-182. doi: 10.3233/SHTI210144.
8
Checklist for Early Recognition and Treatment of Acute Illness and Injury: An Exploratory Multicenter International Quality-Improvement Study in the ICUs With Variable Resources.急性疾病和损伤早期识别与治疗核对清单:资源多变的 ICU 中一项探索性多中心国际质量改进研究。
Crit Care Med. 2021 Jun 1;49(6):e598-e612. doi: 10.1097/CCM.0000000000004937.
9
Disadvantages in preparing and publishing scientific papers caused by the dominance of the English language in science: The case of Colombian researchers in biological sciences.由于英语在科学领域的主导地位而导致的科学论文编写和发表方面的劣势:以哥伦比亚生物科学研究人员为例。
PLoS One. 2020 Sep 16;15(9):e0238372. doi: 10.1371/journal.pone.0238372. eCollection 2020.
10
Digital transformation in healthcare - architectures of present and future information technologies.医疗保健领域的数字化转型——现有和未来信息技术的架构。
Clin Chem Lab Med. 2019 Feb 25;57(3):328-335. doi: 10.1515/cclm-2018-0658.