使用生成式人工智能自动回复患者门户消息

Automating Responses to Patient Portal Messages Using Generative AI.

作者信息

Kaur Amarpreet, Budko Alexander, Liu Katrina, Eaton Eric, Steitz Bryan D, Johnson Kevin B

机构信息

Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania.

School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania.

出版信息

Appl Clin Inform. 2025 May;16(3):718-731. doi: 10.1055/a-2565-9155. Epub 2025 Mar 25.

DOI:10.1055/a-2565-9155

PMID:40132987

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12310298/

Abstract

Patient portals bridge patient and provider communications but exacerbate physician and nursing burnout. Large language models (LLMs) can generate message responses that are viewed favorably by health care professionals/providers (HCPs); however, these studies have not included diverse message types or new prompt-engineering strategies.Our goal is to investigate and compare the quality and precision of GPT-generated message responses versus real doctor responses across the spectrum of message types within a patient portal.We used prompt engineering techniques to craft synthetic provider responses tailored to adult primary care patients. We enrolled a sample of primary care providers in a cross-sectional study to compare authentic with synthetic patient portal message responses generated by GPT-3.5-turbo, July 2023 version (GPT). The survey assessed each response's empathy, relevance, medical accuracy, and readability on a scale from 0 to 5. Respondents were asked to identify responses that were GPT-generated versus provider-generated. Mean scores for all metrics were computed for subsequent analysis.A total of 49 HCPs participated in the survey (59% completion rate), comprising 16 physicians and 32 advanced practice providers (APPs). In comparison to responses generated by real doctors, GPT-generated responses scored statistically significantly higher than doctors in two of the four parameters: empathy ( < 0.05) and readability ( < 0.05). However, no statistically significant difference was observed for relevance and accuracy ( > 0.05). Although readability scores were significantly different, the absolute difference was small, and the clinical significance of this finding remains uncertain.Our findings affirm the potential of GPT-generated message responses to achieve comparable levels of empathy, relevance, and readability to those found in typical responses crafted by HCPs. Additional studies should be done within provider workflows and with careful evaluation of patient attitudes and concerns related to the ethics as well as the quality of generated responses in all settings.

摘要

患者门户网站架起了患者与医疗服务提供者沟通的桥梁，但加剧了医生和护士的职业倦怠。大语言模型（LLMs）可以生成受到医疗保健专业人员/提供者（HCPs）好评的消息回复；然而，这些研究并未涵盖不同的消息类型或新的提示工程策略。我们的目标是调查和比较在患者门户网站中，GPT生成的消息回复与真实医生回复在各种消息类型范围内的质量和准确性。我们使用提示工程技术来精心制作针对成年初级保健患者的模拟医疗服务提供者回复。我们招募了一组初级保健提供者参与一项横断面研究，以比较由GPT-3.5-turbo（2023年7月版本，GPT）生成的真实与模拟患者门户网站消息回复。该调查从0到5的量表评估每个回复的同理心、相关性、医学准确性和可读性。要求受访者识别出GPT生成的回复与医疗服务提供者生成的回复。计算所有指标的平均分数以供后续分析。共有49名HCPs参与了调查（完成率为59%），包括16名医生和32名高级实践提供者（APPs）。与真实医生生成的回复相比，GPT生成的回复在四个参数中的两个参数上得分在统计学上显著高于医生：同理心（<0.05）和可读性（<0.05）。然而，在相关性和准确性方面未观察到统计学上的显著差异（>0.05）。尽管可读性得分有显著差异，但绝对差异很小，这一发现的临床意义仍不确定。我们的研究结果证实了GPT生成的消息回复在实现与HCPs精心制作的典型回复相当的同理心、相关性和可读性水平方面的潜力。应在医疗服务提供者的工作流程内进行更多研究，并仔细评估患者对伦理问题以及在所有环境中生成回复的质量的态度和担忧。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5171/12310298/947e042fdab8/10-1055-a-2565-9155-i202408ra0250-1.jpg

相似文献

Automating Responses to Patient Portal Messages Using Generative AI.使用生成式人工智能自动回复患者门户消息

Appl Clin Inform. 2025 May;16(3):718-731. doi: 10.1055/a-2565-9155. Epub 2025 Mar 25.

Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。

Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.

Sexual Harassment and Prevention Training性骚扰与预防培训

The diagnostic and triage accuracy of the GPT-3 artificial intelligence model: an observational study.GPT-3 人工智能模型的诊断和分诊准确性：一项观察性研究。

Lancet Digit Health. 2024 Aug;6(8):e555-e561. doi: 10.1016/S2589-7500(24)00097-9.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Artificial intelligence-simplified information to advance reproductive genetic literacy and health equity.人工智能简化信息以促进生殖遗传知识普及和健康公平。

Hum Reprod. 2025 Jul 22. doi: 10.1093/humrep/deaf135.

Sertindole for schizophrenia.用于治疗精神分裂症的舍吲哚。

Cochrane Database Syst Rev. 2005 Jul 20;2005(3):CD001715. doi: 10.1002/14651858.CD001715.pub2.

Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。

Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.

本文引用的文献

Predicting Postoperative Pain and Opioid Use with Machine Learning Applied to Longitudinal Electronic Health Record and Wearable Data.运用机器学习对纵向电子健康记录和可穿戴数据进行分析，预测术后疼痛和阿片类药物的使用情况。

Appl Clin Inform. 2024 May;15(3):569-582. doi: 10.1055/a-2321-0397. Epub 2024 May 7.

AI-Generated Draft Replies Integrated Into Health Records and Physicians' Electronic Communication.人工智能生成的草稿回复整合到健康记录和医生的电子通信中。

JAMA Netw Open. 2024 Apr 1;7(4):e246565. doi: 10.1001/jamanetworkopen.2024.6565.

Artificial Intelligence-Generated Draft Replies to Patient Inbox Messages.人工智能生成的回复患者收件箱消息草稿。

JAMA Netw Open. 2024 Mar 4;7(3):e243201. doi: 10.1001/jamanetworkopen.2024.3201.

Leveraging large language models for generating responses to patient messages-a subjective analysis.利用大型语言模型生成对患者信息的回复——主观分析。

J Am Med Inform Assoc. 2024 May 20;31(6):1367-1379. doi: 10.1093/jamia/ocae052.

A systematic review of artificial intelligence-powered (AI-powered) chatbot intervention for managing chronic illness.人工智能驱动的（AI 驱动）聊天机器人干预管理慢性疾病的系统评价。

Ann Med. 2024 Dec;56(1):2302980. doi: 10.1080/07853890.2024.2302980. Epub 2024 Mar 11.

Potential Use of ChatGPT in Responding to Patient Questions and Creating Patient Resources.ChatGPT 在回答患者问题和创建患者资源方面的潜在用途。

JMIR Dermatol. 2024 Mar 6;7:e48451. doi: 10.2196/48451.

ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.ChatGPT在德国妇产科考试中的表现——为人工智能强化医学教育和临床实践铺平道路。

Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.

Ensuring Equitable Access to Patient Portals-Closing the "Techquity" Gap.确保患者门户的公平获取——弥合“技术公平”差距。

JAMA Health Forum. 2023 Nov 3;4(11):e233406. doi: 10.1001/jamahealthforum.2023.3406.

ChatGPT and its Role in the Decision-Making for the Diagnosis and Treatment of Lumbar Spinal Stenosis: A Comparative Analysis and Narrative Review.ChatGPT及其在腰椎管狭窄症诊断和治疗决策中的作用：一项比较分析与叙述性综述

Global Spine J. 2024 Apr;14(3):998-1017. doi: 10.1177/21925682231195783. Epub 2023 Aug 10.

Association of physician burnout with perceived EHR work stress and potentially actionable factors.医生倦怠与感知电子病历工作压力及潜在可操作因素的关联。

J Am Med Inform Assoc. 2023 Sep 25;30(10):1665-1672. doi: 10.1093/jamia/ocad136.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用生成式人工智能自动回复患者门户消息

Automating Responses to Patient Portal Messages Using Generative AI.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献