• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估 ChatGPT 在分娩镇痛管理中的性能。

The evaluation of the performance of ChatGPT in the management of labor analgesia.

机构信息

Department of Anesthesiology, El Camino Health, 2500 Grant Road, Mountain View, California 94040, USA.

Department of Anesthesiology, Perioperative and Pain Medicine, Stanford University School of Medicine, 300 Pasteur Drive, Room H3580, MC 5640, Stanford 94305, CA, USA.

出版信息

J Clin Anesth. 2024 Nov;98:111582. doi: 10.1016/j.jclinane.2024.111582. Epub 2024 Aug 20.

DOI:10.1016/j.jclinane.2024.111582
PMID:39167880
Abstract

UNLABELLED

ChatGPT4 is a leading large language model (LLM) chatbot released by OpenAI in 2023. ChatGPT4 can respond to free-text queries, answer questions and make suggestions regarding virtually any topic. ChatGPT4 has successfully answered anesthesia and even obstetric anesthesia knowledge-based questions with reasonable accuracy. However, ChatGPT4 has yet to be challenged in obstetric anesthesia clinical decision-making.

STUDY OBJECTIVE

In this study, we evaluated the performance of ChatGPT4 in the management of clinical labor analgesia scenarios compared to expert obstetric anesthesiologists.

INTERVENTION

Eight clinical questions with progressively increasing medical complexity were posed to ChatGPT4.

MEASUREMENTS

The ChatGPT4 responses were rated by seven expert obstetric anesthesiologists based on safety, accuracy and completeness of each response using a five-point Likert rating scale.

MAIN RESULTS

ChatGPT4 was deemed safe in 73% of responses to the presented obstetric anesthesia clinical scenarios (27% of responses were deemed unsafe). None of the ChatGPT4 responses were unanimously deemed to be safe by all seven expert obstetric anesthesiologists. Moreover, ChatGPT4 responses were overall partly accurate (score 4 out of 5) and somewhat incomplete (score 3.5 out of 5).

CONCLUSIONS

In summary, approximately one quarter of all responses by ChatGPT4 were deemed unsafe by expert obstetric anesthesiologists. These findings may suggest the need for more fine-tuning and training of LLMs such as ChatGPT4 specifically for clinical decision making in obstetric anesthesia or other specialized medical fields. These LLMs may come to play an important future role in assisting obstetric anesthesiologists in clinical decision making and enhancing overall patient care.

摘要

未加标签

ChatGPT4 是 OpenAI 于 2023 年发布的一款领先的大型语言模型(LLM)聊天机器人。ChatGPT4 可以对自由文本查询做出响应,回答关于几乎任何主题的问题并提供建议。ChatGPT4 已经成功地以合理的准确性回答了麻醉学,甚至产科麻醉学的基础知识问题。然而,ChatGPT4 在产科麻醉临床决策方面尚未受到挑战。

研究目的

在这项研究中,我们评估了 ChatGPT4 在管理临床分娩镇痛场景方面的表现,与专家产科麻醉师进行了比较。

干预措施

向 ChatGPT4 提出了八个具有逐渐增加医学复杂性的临床问题。

测量

七位专家产科麻醉师根据每个回答的安全性、准确性和完整性,使用五点 Likert 评分量表对 ChatGPT4 的回答进行评分。

主要结果

ChatGPT4 被认为在呈现的产科麻醉临床场景的 73%的回答中是安全的(27%的回答被认为是不安全的)。在所有七个专家产科麻醉师中,没有一个人一致认为 ChatGPT4 的所有回答都是安全的。此外,ChatGPT4 的回答总体上是部分准确的(得分为 5 分中的 4 分),并且有些不完整(得分为 5 分中的 3.5 分)。

结论

总之,ChatGPT4 的大约四分之一的回答被专家产科麻醉师认为是不安全的。这些发现可能表明需要对像 ChatGPT4 这样的大型语言模型进行更精细的调整和培训,特别是在产科麻醉或其他专业医学领域的临床决策中。这些大型语言模型可能会在未来在协助产科麻醉师进行临床决策和增强整体患者护理方面发挥重要作用。

相似文献

1
The evaluation of the performance of ChatGPT in the management of labor analgesia.评估 ChatGPT 在分娩镇痛管理中的性能。
J Clin Anesth. 2024 Nov;98:111582. doi: 10.1016/j.jclinane.2024.111582. Epub 2024 Aug 20.
2
Practice Bulletin No. 177 Summary: Obstetric Analgesia and Anesthesia.第177号实践公告摘要:产科镇痛与麻醉
Obstet Gynecol. 2017 Apr;129(4):766-768. doi: 10.1097/AOG.0000000000002009.
3
Assessing ChatGPT4 with and without retrieval-augmented generation in anticoagulation management for gastrointestinal procedures.在胃肠道手术的抗凝管理中评估有无检索增强生成功能的ChatGPT4。
Ann Gastroenterol. 2024 Sep-Oct;37(5):514-526. doi: 10.20524/aog.2024.0907. Epub 2024 Aug 19.
4
A Qualitative Evaluation of ChatGPT4 and PaLM2's Response to Patient's Questions Regarding Age-Related Macular Degeneration.对ChatGPT4和PaLM2关于年龄相关性黄斑变性患者问题回答的定性评估
Diagnostics (Basel). 2024 Jul 9;14(14):1468. doi: 10.3390/diagnostics14141468.
5
Practice Bulletin No. 177: Obstetric Analgesia and Anesthesia.第177号实践公告:产科镇痛与麻醉
Obstet Gynecol. 2017 Apr;129(4):e73-e89. doi: 10.1097/AOG.0000000000002018.
6
Leveraging Large Language Models (LLM) for the Plastic Surgery Resident Training: Do They Have a Role?利用大语言模型进行整形外科住院医师培训:它们能发挥作用吗?
Indian J Plast Surg. 2023 Aug 28;56(5):413-420. doi: 10.1055/s-0043-1772704. eCollection 2023 Oct.
7
Comparison of artificial intelligence large language model chatbots in answering frequently asked questions in anaesthesia.人工智能大语言模型聊天机器人在回答麻醉常见问题方面的比较。
BJA Open. 2024 May 8;10:100280. doi: 10.1016/j.bjao.2024.100280. eCollection 2024 Jun.
8
Survey of nulliparous parturients' attitudes regarding timing of epidural analgesia initiation.初产妇对硬膜外镇痛起始时间的态度调查。
J Clin Anesth. 2017 Sep;41:106-111. doi: 10.1016/j.jclinane.2017.06.008. Epub 2017 Jun 23.
9
Pain management during labor and vaginal birth.分娩和阴道分娩过程中的疼痛管理。
Best Pract Res Clin Obstet Gynaecol. 2020 Aug;67:100-112. doi: 10.1016/j.bpobgyn.2020.03.002. Epub 2020 Mar 7.
10
[Labor analgesia in the US and Japan].[美国和日本的分娩镇痛]
Masui. 2007 Sep;56(9):1040-3; discussion 1044-6.

引用本文的文献

1
Clinical and economic impact of a large language model in perioperative medicine: a randomized crossover trial.大语言模型在围手术期医学中的临床和经济影响:一项随机交叉试验
NPJ Digit Med. 2025 Jul 21;8(1):462. doi: 10.1038/s41746-025-01858-x.
2
Performance of ChatGPT in Pediatric Audiology as Rated by Students and Experts.学生和专家对ChatGPT在儿科听力学方面表现的评价
J Clin Med. 2025 Jan 28;14(3):875. doi: 10.3390/jcm14030875.