• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较 ChatGPT 和一位麻醉医生对常见患者问题的回答:对一组麻醉医生进行的探索性横断面调查。

Comparing ChatGPT and a Single Anesthesiologist's Responses to Common Patient Questions: An Exploratory Cross-Sectional Survey of a Panel of Anesthesiologists.

机构信息

Department of Anesthesia and Pain Medicine, Johns Hopkins All Children's Hospital, 601 5th St South, Suite C725, St Petersburg, FL, 33701, USA.

Epidemiology and Biostatistics Shared Resource, Institute for Clinical and Translational Research, Johns Hopkins All Children's Hospital, St Petersburg, FL, USA.

出版信息

J Med Syst. 2024 Aug 22;48(1):77. doi: 10.1007/s10916-024-02100-z.

DOI:10.1007/s10916-024-02100-z
PMID:39172169
Abstract

Increased patient access to electronic medical records and resources has resulted in higher volumes of health-related questions posed to clinical staff, while physicians' rising clinical workloads have resulted in less time for comprehensive, thoughtful responses to patient questions. Artificial intelligence chatbots powered by large language models (LLMs) such as ChatGPT could help anesthesiologists efficiently respond to electronic patient inquiries, but their ability to do so is unclear. A cross-sectional exploratory survey-based study comprised of 100 anesthesia-related patient question/response sets based on two fictitious simple clinical scenarios was performed. Each question was answered by an independent board-certified anesthesiologist and ChatGPT (GPT-3.5 model, August 3, 2023 version). The responses were randomized and evaluated via survey by three blinded board-certified anesthesiologists for various quality and empathy measures. On a 5-point Likert scale, ChatGPT received similar overall quality ratings (4.2 vs. 4.1, p = .81) and significantly higher overall empathy ratings (3.7 vs. 3.4, p < .01) compared to the anesthesiologist. ChatGPT underperformed the anesthesiologist regarding rate of responses in agreement with scientific consensus (96.6% vs. 99.3%, p = .02) and possibility of harm (4.7% vs. 1.7%, p = .04), but performed similarly in other measures (percentage of responses with inappropriate/incorrect information (5.7% vs. 2.7%, p = .07) and missing information (10.0% vs. 7.0%, p = .19)). In conclusion, LLMs show great potential in healthcare, but additional improvement is needed to decrease the risk of patient harm and reduce the need for close physician oversight. Further research with more complex clinical scenarios, clinicians, and live patients is necessary to validate their role in healthcare.

摘要

患者对电子病历和资源的访问增加导致向临床工作人员提出的与健康相关的问题数量增加,而医生不断增加的临床工作量导致他们用于全面、深思熟虑地回答患者问题的时间减少。由大型语言模型(LLM)驱动的人工智能聊天机器人(如 ChatGPT)可以帮助麻醉师高效地回复电子患者咨询,但它们是否有能力做到这一点尚不清楚。进行了一项基于横断面探索性调查的研究,该研究由基于两个虚构简单临床场景的 100 个麻醉相关患者问题/回答集组成。每个问题都由一名独立的、经过 board 认证的麻醉师和 ChatGPT(GPT-3.5 模型,2023 年 8 月 3 日版本)回答。通过三名经过 board 认证的麻醉师进行的随机调查评估了这些回答,以评估各种质量和同理心指标。在 5 分制的 Likert 量表上,ChatGPT 获得了类似的整体质量评分(4.2 与 4.1,p=0.81)和明显更高的整体同理心评分(3.7 与 3.4,p<0.01)与麻醉师相比。ChatGPT 在与科学共识一致的回答率(96.6% 与 99.3%,p=0.02)和可能造成伤害的可能性(4.7% 与 1.7%,p=0.04)方面表现不如麻醉师,但在其他指标上表现相似(回答中存在不适当/不正确信息的百分比(5.7% 与 2.7%,p=0.07)和缺少信息的百分比(10.0% 与 7.0%,p=0.19))。总之,大型语言模型在医疗保健领域具有巨大潜力,但需要进一步改进,以降低患者伤害的风险并减少对医生密切监督的需求。需要进行更多具有复杂临床场景、临床医生和真实患者的研究,以验证它们在医疗保健中的作用。

相似文献

1
Comparing ChatGPT and a Single Anesthesiologist's Responses to Common Patient Questions: An Exploratory Cross-Sectional Survey of a Panel of Anesthesiologists.比较 ChatGPT 和一位麻醉医生对常见患者问题的回答:对一组麻醉医生进行的探索性横断面调查。
J Med Syst. 2024 Aug 22;48(1):77. doi: 10.1007/s10916-024-02100-z.
2
"Doctor ChatGPT, Can You Help Me?" The Patient's Perspective: Cross-Sectional Study.“医生 ChatGPT,你能帮我吗?”患者视角:横断面研究。
J Med Internet Res. 2024 Oct 1;26:e58831. doi: 10.2196/58831.
3
Physician Versus Large Language Model Chatbot Responses to Web-Based Questions From Autistic Patients in Chinese: Cross-Sectional Comparative Analysis.中文自闭症患者网络问诊中,医生与大型语言模型聊天机器人回复的对比分析:横断面研究。
J Med Internet Res. 2024 Apr 30;26:e54706. doi: 10.2196/54706.
4
Is the information provided by large language models valid in educating patients about adolescent idiopathic scoliosis? An evaluation of content, clarity, and empathy : The perspective of the European Spine Study Group.大语言模型提供的信息在对患者进行青少年特发性脊柱侧凸教育方面是否有效?内容、清晰度和同理心的评估:欧洲脊柱研究小组的观点
Spine Deform. 2025 Mar;13(2):361-372. doi: 10.1007/s43390-024-00955-3. Epub 2024 Nov 4.
5
Performance of Large Language Models in Patient Complaint Resolution: Web-Based Cross-Sectional Survey.大型语言模型在患者投诉解决中的表现:基于网络的横断面调查。
J Med Internet Res. 2024 Aug 9;26:e56413. doi: 10.2196/56413.
6
Large Language Models and Empathy: Systematic Review.大语言模型与同理心:系统综述
J Med Internet Res. 2024 Dec 11;26:e52597. doi: 10.2196/52597.
7
A Comparative Study of Responses to Retina Questions from Either Experts, Expert-Edited Large Language Models, or Expert-Edited Large Language Models Alone.专家、经过专家编辑的大语言模型或仅经过专家编辑的大语言模型对视网膜问题回答的比较研究。
Ophthalmol Sci. 2024 Feb 6;4(4):100485. doi: 10.1016/j.xops.2024.100485. eCollection 2024 Jul-Aug.
8
The performance of artificial intelligence chatbot large language models to address skeletal biology and bone health queries.人工智能聊天机器人大型语言模型在解决骨骼生物学和骨骼健康问题方面的表现。
J Bone Miner Res. 2024 Mar 22;39(2):106-115. doi: 10.1093/jbmr/zjad007.
9
Artificial Intelligence for Anesthesiology Board-Style Examination Questions: Role of Large Language Models.人工智能在麻醉学 board 式考试问题中的应用:大语言模型的作用。
J Cardiothorac Vasc Anesth. 2024 May;38(5):1251-1259. doi: 10.1053/j.jvca.2024.01.032. Epub 2024 Feb 1.
10
Evaluation of the quality and quantity of artificial intelligence-generated responses about anesthesia and surgery: using ChatGPT 3.5 and 4.0.关于麻醉和手术的人工智能生成回复的质量和数量评估:使用ChatGPT 3.5和4.0
Front Med (Lausanne). 2024 Jul 11;11:1400153. doi: 10.3389/fmed.2024.1400153. eCollection 2024.

引用本文的文献

1
Comparison of Multiple State-of-the-Art Large Language Models for Patient Education Prior to CT and MRI Examinations.CT和MRI检查前用于患者教育的多种先进大语言模型的比较
J Pers Med. 2025 Jun 5;15(6):235. doi: 10.3390/jpm15060235.
2
ChatOCT: Embedded Clinical Decision Support Systems for Optical Coherence Tomography in Offline and Resource-Limited Settings.ChatOCT:用于离线和资源受限环境下光学相干断层扫描的嵌入式临床决策支持系统
J Med Syst. 2025 May 7;49(1):59. doi: 10.1007/s10916-025-02188-x.

本文引用的文献

1
Could ChatGPT-4 pass an anaesthesiology board examination? Follow-up assessment of a comprehensive set of board examination practice questions.ChatGPT-4能通过麻醉学委员会考试吗?对一套全面的委员会考试练习题的后续评估。
Br J Anaesth. 2024 Jan;132(1):172-174. doi: 10.1016/j.bja.2023.10.025. Epub 2023 Nov 22.
2
Quantitative evaluation of ChatGPT versus Bard responses to anaesthesia-related queries.ChatGPT与Bard对麻醉相关问题回答的定量评估。
Br J Anaesth. 2024 Jan;132(1):169-171. doi: 10.1016/j.bja.2023.09.030. Epub 2023 Nov 7.
3
Creation and Adoption of Large Language Models in Medicine.
医学领域中大型语言模型的创建与采用。
JAMA. 2023 Sep 5;330(9):866-869. doi: 10.1001/jama.2023.14217.
4
Large language models in medicine.医学中的大型语言模型。
Nat Med. 2023 Aug;29(8):1930-1940. doi: 10.1038/s41591-023-02448-8. Epub 2023 Jul 17.
5
Large language models encode clinical knowledge.大语言模型编码临床知识。
Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.
6
Assessment of ChatGPT success with specialty medical knowledge using anaesthesiology board examination practice questions.使用麻醉学委员会考试练习题评估ChatGPT在专业医学知识方面的表现。
Br J Anaesth. 2023 Aug;131(2):e31-e34. doi: 10.1016/j.bja.2023.04.017. Epub 2023 May 18.
7
Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum.比较医生和人工智能聊天机器人对发布在公共社交媒体论坛上的患者问题的回复。
JAMA Intern Med. 2023 Jun 1;183(6):589-596. doi: 10.1001/jamainternmed.2023.1838.
8
Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.GPT-4作为医学人工智能聊天机器人的益处、局限性和风险
N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.
9
How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.ChatGPT在美国医师执照考试(USMLE)中的表现如何?大语言模型对医学教育和知识评估的影响。
JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.
10
Patient Safety Issues From Information Overload in Electronic Medical Records.电子病历信息过载引发的患者安全问题。
J Patient Saf. 2022 Sep 1;18(6):e999-e1003. doi: 10.1097/PTS.0000000000001002. Epub 2022 Apr 7.