• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大语言模型能帮助提高儿科用药剂量的准确性吗?

Can large language models assist with pediatric dosing accuracy?

作者信息

Levin Chedva, Orkaby Brurya, Kerner Erika, Saban Mor

机构信息

Faculty of School of Life and Health Sciences, Nursing Department, The Jerusalem College of Technology-Lev Academic Center, Jerusalem, Israel.

The Department of Vascular Surgery, The Chaim Sheba Medical Center, Tel Hashomer, Ramat Gan, Tel Aviv, Israel.

出版信息

Pediatr Res. 2025 Mar 8. doi: 10.1038/s41390-025-03980-8.

DOI:10.1038/s41390-025-03980-8
PMID:40057653
Abstract

BACKGROUND AND OBJECTIVE

Medication errors in pediatric care remain a significant healthcare challenge despite technological advancements, necessitating innovative approaches. This study aims to evaluate Large Language Models' (LLMs) potential in reducing pediatric medication dosage calculation errors compared to experienced nurses.

METHODS

This cross-sectional study (June-August 2024) involved 101 nurses from pediatric and neonatal departments and three LLMs (ChatGPT-4o, Claude-3.0, Llama 3 8B). Participants completed a nine-question survey on pediatric medication calculations. Primary outcomes were accuracy and response time. Secondary measures included seniority and group membership on accuracy.

RESULTS

Significant differences (P < 0.001) were observed between nurses and LLMs. Nurses averaged 93.14 ± 9.39 accuracy. Claude-3.0 and ChatGPT-4o achieved 100 accuracy, while Llama 3 8B was 66 accurate. LLMs were faster (15.7-75.12 seconds) than nurses (1621.2 ± 8379.3 s). The Generalized Linear Model analysis revealed task performance was significantly influenced by duration (Wald χ² = 27,881.261, p < 0.001) and interaction between relative seniority and group membership (Wald χ² = 3,938.250, p < 0.001), with participants achieving a mean total grade of 91.03 (SD = 13.87).

CONCLUSIONS

Claude-3.0 and ChatGPT-4o demonstrated perfect accuracy and rapid calculation capabilities, showing promise in reducing pediatric medication dosage errors. Further research is needed to explore their integration into practice.

IMPACT

Key Message Large Language Models (LLMs) like ChatGPT-4o and Claude-3.0 demonstrate perfect accuracy and significantly faster response times in pediatric medication dosage calculations, showing potential to reduce errors and save time. Addition to Existing Literature This study provides novel insights by quantitatively comparing LLM performance with experienced nurses, contributing to the understanding of AI's role in improving medication safety. Impact The findings emphasize the value of LLMs as supplemental tools in healthcare, particularly in high-stakes pediatric care, where they can reduce calculation errors and improve clinical efficiency.

摘要

背景与目的

尽管技术不断进步,但儿科护理中的用药错误仍是一个重大的医疗挑战,因此需要创新方法。本研究旨在评估大语言模型(LLMs)与经验丰富的护士相比,在减少儿科用药剂量计算错误方面的潜力。

方法

这项横断面研究(2024年6月至8月)涉及来自儿科和新生儿科的101名护士以及三个大语言模型(ChatGPT-4o、Claude-3.0、Llama 3 8B)。参与者完成了一项关于儿科用药计算的九个问题的调查。主要结果是准确性和响应时间。次要指标包括资历和准确性方面的组成员身份。

结果

护士与大语言模型之间存在显著差异(P < 0.001)。护士的平均准确率为93.14 ± 9.39。Claude-3.0和ChatGPT-4o的准确率达到100%,而Llama 3 8B的准确率为66%。大语言模型比护士更快(15.7 - 75.12秒)(护士为1621.2 ± 8379.3秒)。广义线性模型分析显示,任务表现受到持续时间(Wald χ² = 27,881.261,p < 0.001)以及相对资历和组成员身份之间的交互作用(Wald χ² = 3,938.250,p < 0.001)的显著影响,参与者的平均总分为91.03(标准差 = 13.87)。

结论

Claude-3.0和ChatGPT-4o展示了完美的准确性和快速计算能力,在减少儿科用药剂量错误方面显示出前景。需要进一步研究以探索将它们整合到实践中的方法。

影响

关键信息 ChatGPT-4o和Claude-3.0等大语言模型在儿科用药剂量计算中展示了完美的准确性和显著更快的响应时间,显示出减少错误和节省时间的潜力。对现有文献的补充 本研究通过将大语言模型的性能与经验丰富的护士进行定量比较,提供了新的见解,有助于理解人工智能在提高用药安全性方面的作用。影响 研究结果强调了大语言模型作为医疗保健补充工具(特别是在高风险的儿科护理中,它们可以减少计算错误并提高临床效率) 的价值。

相似文献

1
Can large language models assist with pediatric dosing accuracy?大语言模型能帮助提高儿科用药剂量的准确性吗?
Pediatr Res. 2025 Mar 8. doi: 10.1038/s41390-025-03980-8.
2
Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5 edition.评估大语言模型在与《乳腺影像报告和数据系统》第5版相关问题上的文本和视觉诊断能力。
Diagn Interv Radiol. 2025 Mar 3;31(2):111-129. doi: 10.4274/dir.2024.242876. Epub 2024 Sep 9.
3
Benchmarking LLM chatbots' oncological knowledge with the Turkish Society of Medical Oncology's annual board examination questions.用土耳其医学肿瘤学会年度委员会考试问题对大型语言模型聊天机器人的肿瘤学知识进行基准测试。
BMC Cancer. 2025 Feb 4;25(1):197. doi: 10.1186/s12885-025-13596-0.
4
Evaluating the Efficacy of Large Language Models in Generating Medical Documentation: A Comparative Study of ChatGPT-4, ChatGPT-4o, and Claude.评估大语言模型在生成医学文档方面的功效:ChatGPT-4、ChatGPT-4o和Claude的比较研究
Aesthetic Plast Surg. 2025 Apr 14. doi: 10.1007/s00266-025-04842-8.
5
Accuracy of Large Language Models for Infective Endocarditis Prophylaxis in Dental Procedures.大型语言模型在牙科手术中预防感染性心内膜炎的准确性。
Int Dent J. 2025 Feb;75(1):206-212. doi: 10.1016/j.identj.2024.09.033. Epub 2024 Oct 12.
6
Assessment of decision-making with locally run and web-based large language models versus human board recommendations in otorhinolaryngology, head and neck surgery.在耳鼻喉科、头颈外科中,评估本地运行和基于网络的大语言模型与人类委员会建议的决策情况。
Eur Arch Otorhinolaryngol. 2025 Mar;282(3):1593-1607. doi: 10.1007/s00405-024-09153-3. Epub 2025 Jan 10.
7
Can large language models be new supportive tools in coronary computed tomography angiography reporting?大语言模型能否成为冠状动脉 CT 血管造影报告的新辅助工具?
Clin Imaging. 2024 Oct;114:110271. doi: 10.1016/j.clinimag.2024.110271. Epub 2024 Aug 31.
8
Unlocking the potential of advanced large language models in medication review and reconciliation: A proof-of-concept investigation.挖掘先进大语言模型在用药审查与核对中的潜力:一项概念验证研究。
Explor Res Clin Soc Pharm. 2024 Aug 17;15:100492. doi: 10.1016/j.rcsop.2024.100492. eCollection 2024 Sep.
9
Assessing the feasibility of ChatGPT-4o and Claude 3-Opus in thyroid nodule classification based on ultrasound images.评估ChatGPT-4o和Claude 3-Opus基于超声图像进行甲状腺结节分类的可行性。
Endocrine. 2025 Mar;87(3):1041-1049. doi: 10.1007/s12020-024-04066-x. Epub 2024 Oct 11.
10
AI in Home Care-Evaluation of Large Language Models for Future Training of Informal Caregivers: Observational Comparative Case Study.家庭护理中的人工智能——对用于未来非正式护理人员培训的大语言模型的评估:观察性比较案例研究
J Med Internet Res. 2025 Apr 28;27:e70703. doi: 10.2196/70703.

引用本文的文献

1
Artificial intelligence in pediatric healthcare: current applications, potential, and implementation considerations.人工智能在儿科医疗保健中的应用:当前应用、潜力及实施考量
Clin Exp Pediatr. 2025 Sep;68(9):641-651. doi: 10.3345/cep.2025.00962. Epub 2025 Jun 25.

本文引用的文献

1
Large language models in health care: Development, applications, and challenges.医疗保健领域的大语言模型:发展、应用与挑战。
Health Care Sci. 2023 Jul 24;2(4):255-263. doi: 10.1002/hcs2.61. eCollection 2023 Aug.
2
Measuring the Impact of AI in the Diagnosis of Hospitalized Patients: A Randomized Clinical Vignette Survey Study.测量人工智能在住院患者诊断中的影响:一项随机临床病例调查研究。
JAMA. 2023 Dec 19;330(23):2275-2284. doi: 10.1001/jama.2023.22295.
3
Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine.
注意力并非全部所需:在医疗保健和医学中使用大型语言模型所涉及的复杂伦理问题。
EBioMedicine. 2023 Apr;90:104512. doi: 10.1016/j.ebiom.2023.104512. Epub 2023 Mar 15.
4
To explain or not to explain?-Artificial intelligence explainability in clinical decision support systems.解释还是不解释?——临床决策支持系统中的人工智能可解释性
PLOS Digit Health. 2022 Feb 17;1(2):e0000016. doi: 10.1371/journal.pdig.0000016. eCollection 2022 Feb.
5
Impact of Implementing Electronic Health Records on Medication Safety at an HIMSS Stage 6 Hospital: The Pharmacist's Perspective.在一家医疗卫生信息与管理系统协会(HIMSS)6级医院实施电子健康记录对用药安全的影响:药剂师的观点
Can J Hosp Pharm. 2022 Oct 3;75(4):267-275. doi: 10.4212/cjhp.3223. eCollection 2022 Fall.
6
Medication Errors in Pediatrics: Proposals to Improve the Quality and Safety of Care Through Clinical Risk Management.儿科用药错误:通过临床风险管理提高医疗质量和安全性的建议。
Front Med (Lausanne). 2022 Jan 14;8:814100. doi: 10.3389/fmed.2021.814100. eCollection 2021.
7
Prevalence of Medication Errors Among Paediatric Inpatients: Systematic Review and Meta-Analysis.儿科住院患者用药错误的发生率:系统评价和荟萃分析。
Drug Saf. 2019 Nov;42(11):1329-1342. doi: 10.1007/s40264-019-00850-1.
8
Errors and causes of communication failures from hospital information systems to electronic health record: A record-review study.从医院信息系统到电子健康记录的沟通失败的错误和原因:一项记录回顾研究。
Int J Med Inform. 2018 Nov;119:47-53. doi: 10.1016/j.ijmedinf.2018.09.004. Epub 2018 Sep 6.
9
Interventions to reduce pediatric medication errors: a systematic review.减少儿科用药错误的干预措施:一项系统综述
Pediatrics. 2014 Aug;134(2):338-60. doi: 10.1542/peds.2013-3531. Epub 2014 Jul 14.
10
Frequency of pediatric medication administration errors and contributing factors.儿科用药错误的发生率及相关因素。
J Nurs Care Qual. 2011 Apr-Jun;26(2):136-43. doi: 10.1097/NCQ.0b013e3182031006.