• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发一个由GPT-4驱动的虚拟模拟患者和沟通训练平台,供医学生练习与患者讨论异常乳房X光检查结果:多阶段研究。

Development of a GPT-4-Powered Virtual Simulated Patient and Communication Training Platform for Medical Students to Practice Discussing Abnormal Mammogram Results With Patients: Multiphase Study.

作者信息

Weisman Dan, Sugarman Alanna, Huang Yue Ming, Gelberg Lillian, Ganz Patricia A, Comulada Warren Scott

机构信息

UCLA Simulation Center, University of California, Los Angeles, Los Angeles, CA, United States.

David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, United States.

出版信息

JMIR Form Res. 2025 Apr 17;9:e65670. doi: 10.2196/65670.

DOI:10.2196/65670
PMID:40246299
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12046251/
Abstract

BACKGROUND

Standardized patients (SPs) prepare medical students for difficult conversations with patients. Despite their value, SP-based simulation training is constrained by available resources and competing clinical demands. Researchers are turning to artificial intelligence and large language models, such as generative pretrained transformers, to create communication training that incorporates virtual simulated patients (VSPs). GPT-4 is a large language model advance allowing developers to design virtual simulation scenarios using text-based prompts instead of relying on branching path simulations with prescripted dialogue. These nascent developmental practices have not taken root in the literature to guide other researchers in developing their own simulations.

OBJECTIVE

This study aims to describe our developmental process and lessons learned for creating a GPT-4-driven VSP. We designed the VSP to help medical student learners rehearse discussing abnormal mammography results with a patient as a primary care physician (PCP). We aimed to assess GPT-4's ability to generate appropriate VSP responses to learners during spoken conversations and provide appropriate feedback on learner performance.

METHODS

A research team comprised of physicians, a medical student, an educator, an SP program director, a learning experience designer, and a health care researcher conducted the study. A formative phase with in-depth knowledge user interviews informed development, followed by a development phase to create the virtual training module. The team conducted interviews with 5 medical students, 5 PCPs, and 5 breast cancer survivors. They then developed a VSP using simulation authoring software and provided the GPT-4-enabled VSP with an initial prompt consisting of a scenario description, emotional state, and expectations for learner dialogue. It was iteratively refined through an agile design process involving repeated cycles of testing, documenting issues, and revising the prompt. As an exploratory feature, the simulation used GPT-4 to provide written feedback to learners about their performance communicating with the VSP and their adherence to guidelines for difficult conversations.

RESULTS

In-depth interviews helped establish the appropriate timing, mode of communication, and protocol for conversations between PCPs and patients during the breast cancer screening process. The scenario simulated a telephone call between a physician and patient to discuss the abnormal results of a diagnostic mammogram that that indicated a need for a biopsy. Preliminary testing was promising. The VSP asked sensible questions about their mammography results and responded to learner inquiries using a voice replete with appropriate emotional inflections. GPT-4 generated performance feedback that successfully identified strengths and areas for improvement using relevant quotes from the learner-VSP conversation, but it occasionally misidentified learner adherence to communication protocols.

CONCLUSIONS

GPT-4 streamlined development and facilitated more dynamic, humanlike interactions between learners and the VSP compared to branching path simulations. For the next steps, we will pilot-test the VSP with medical students to evaluate its feasibility and acceptability.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e563/12046251/db32bfead503/formative_v9i1e65670_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e563/12046251/2f77b3471ed5/formative_v9i1e65670_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e563/12046251/14bcfcafd502/formative_v9i1e65670_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e563/12046251/db32bfead503/formative_v9i1e65670_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e563/12046251/2f77b3471ed5/formative_v9i1e65670_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e563/12046251/14bcfcafd502/formative_v9i1e65670_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e563/12046251/db32bfead503/formative_v9i1e65670_fig3.jpg
摘要

背景

标准化病人(SPs)帮助医学生为与患者进行困难对话做好准备。尽管它们具有价值,但基于标准化病人的模拟培训受到可用资源和相互竞争的临床需求的限制。研究人员正在转向人工智能和大语言模型,如生成式预训练变换器,以创建包含虚拟模拟病人(VSPs)的沟通培训。GPT-4是一种先进的大语言模型,使开发者能够使用基于文本的提示来设计虚拟模拟场景,而不是依赖带有预设对话的分支路径模拟。这些新兴的开发实践尚未在文献中扎根,无法指导其他研究人员开发自己的模拟。

目的

本研究旨在描述我们创建由GPT-4驱动的虚拟模拟病人的开发过程和经验教训。我们设计虚拟模拟病人是为了帮助医学生学习者作为初级保健医生(PCP)排练与患者讨论异常乳房X光检查结果的过程。我们旨在评估GPT-4在口语对话中对学习者生成适当的虚拟模拟病人反应的能力,并对学习者的表现提供适当的反馈。

方法

一个由医生、一名医学生、一名教育工作者、一名标准化病人项目主任、一名学习体验设计师和一名医疗保健研究人员组成的研究团队进行了这项研究。一个形成阶段,通过深入的知识用户访谈为开发提供信息,随后是一个创建虚拟培训模块的开发阶段。该团队采访了5名医学生、5名初级保健医生和5名乳腺癌幸存者。然后,他们使用模拟创作软件开发了一个虚拟模拟病人,并为启用GPT-4的虚拟模拟病人提供了一个初始提示,包括场景描述、情绪状态和对学习者对话的期望。通过一个敏捷设计过程进行迭代优化,该过程包括反复的测试、记录问题和修改提示。作为一个探索性特征,该模拟使用GPT-4为学习者提供关于他们与虚拟模拟病人沟通表现以及他们对困难对话指南遵守情况的书面反馈。

结果

深入访谈有助于确定初级保健医生和患者在乳腺癌筛查过程中对话的适当时间、沟通方式和协议。该场景模拟了医生和患者之间的电话通话,以讨论诊断性乳房X光检查的异常结果,该结果表明需要进行活检。初步测试很有前景。虚拟模拟病人询问了关于他们乳房X光检查结果的合理问题,并使用充满适当情感变化的声音回答学习者的询问。GPT-4生成的表现反馈使用学习者与虚拟模拟病人对话中的相关引述成功识别了优势和改进领域,但它偶尔会错误识别学习者对沟通协议的遵守情况。

结论

与分支路径模拟相比,GPT-4简化了开发过程,并促进了学习者与虚拟模拟病人之间更动态、更像人类的互动。对于下一步,我们将对医学生进行虚拟模拟病人的试点测试,以评估其可行性和可接受性。

相似文献

1
Development of a GPT-4-Powered Virtual Simulated Patient and Communication Training Platform for Medical Students to Practice Discussing Abnormal Mammogram Results With Patients: Multiphase Study.开发一个由GPT-4驱动的虚拟模拟患者和沟通训练平台,供医学生练习与患者讨论异常乳房X光检查结果:多阶段研究。
JMIR Form Res. 2025 Apr 17;9:e65670. doi: 10.2196/65670.
2
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
The effectiveness of using non-traditional teaching methods to prepare student health care professionals for the delivery of mental state examination: a systematic review.使用非传统教学方法培养学生医护专业人员进行精神状态检查的有效性:一项系统综述。
JBI Database System Rev Implement Rep. 2015 Aug 14;13(7):177-212. doi: 10.11124/jbisrir-2015-2263.
5
A Web-Based Training Intervention for Primary Care Providers on Preparing Patients for Cancer Treatment Decisions and Conversations About Clinical Trials: Evaluation of a Pilot Study Using Mixed Methods and Follow-Up.一项针对初级保健提供者的基于网络的培训干预措施,内容为帮助患者为癌症治疗决策及关于临床试验的对话做好准备:一项采用混合方法和随访的试点研究评估
JMIR Med Educ. 2025 Jul 17;11:e66892. doi: 10.2196/66892.
6
Interventions for interpersonal communication about end of life care between health practitioners and affected people.干预健康从业者与受影响者之间关于临终关怀的人际沟通。
Cochrane Database Syst Rev. 2022 Jul 8;7(7):CD013116. doi: 10.1002/14651858.CD013116.pub2.
7
The experience of adults who choose watchful waiting or active surveillance as an approach to medical treatment: a qualitative systematic review.选择观察等待或主动监测作为治疗方法的成年人的经历:一项定性系统评价。
JBI Database System Rev Implement Rep. 2016 Feb;14(2):174-255. doi: 10.11124/jbisrir-2016-2270.
8
Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。
Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.
9
Can We Enhance Shared Decision-making for Periacetabular Osteotomy Surgery? A Qualitative Study of Patient Experiences.我们能否加强髋臼周围截骨术的共同决策?一项关于患者体验的定性研究。
Clin Orthop Relat Res. 2025 Jan 1;483(1):120-136. doi: 10.1097/CORR.0000000000003198. Epub 2024 Jul 23.
10
Neonatal Nurses' Understanding of the Factors That Enhance and Hinder Early Communication Between Preterm Infants and Their Parents: A Narrative Inquiry Study.新生儿护士对促进和阻碍早产儿与其父母早期沟通因素的理解:一项叙事探究研究。
Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70093. doi: 10.1111/1460-6984.70093.

引用本文的文献

1
DIALOGUE: A Generative AI-Based Pre-Post Simulation Study to Enhance Diagnostic Communication in Medical Students Through Virtual Type 2 Diabetes Scenarios.对话:一项基于生成式人工智能的模拟前后研究,通过虚拟2型糖尿病场景增强医学生的诊断沟通能力。
Eur J Investig Health Psychol Educ. 2025 Aug 7;15(8):152. doi: 10.3390/ejihpe15080152.
2
Empowering tomorrow's public health researchers and clinicians to develop digital health interventions using chatbots, virtual reality, and other AI technologies.助力未来的公共卫生研究人员和临床医生利用聊天机器人、虚拟现实及其他人工智能技术开发数字健康干预措施。
Front Public Health. 2025 Jul 8;13:1577076. doi: 10.3389/fpubh.2025.1577076. eCollection 2025.

本文引用的文献

1
The TRIPOD-LLM reporting guideline for studies using large language models.使用大语言模型的研究的TRIPOD-LLM报告指南。
Nat Med. 2025 Jan;31(1):60-69. doi: 10.1038/s41591-024-03425-5. Epub 2025 Jan 8.
2
A Language Model-Powered Simulated Patient With Automated Feedback for History Taking: Prospective Study.基于语言模型的模拟患者与自动化反馈的病史采集:前瞻性研究。
JMIR Med Educ. 2024 Aug 16;10:e59213. doi: 10.2196/59213.
3
Assessing GPT-4's Performance in Delivering Medical Advice: Comparative Analysis With Human Experts.
评估 GPT-4 提供医疗建议的表现:与人类专家的比较分析。
JMIR Med Educ. 2024 Jul 8;10:e51282. doi: 10.2196/51282.
4
The effect of using a large language model to respond to patient messages.使用大语言模型回复患者信息的效果。
Lancet Digit Health. 2024 Jun;6(6):e379-e381. doi: 10.1016/S2589-7500(24)00060-8. Epub 2024 Apr 24.
5
Evaluating large language models as agents in the clinic.评估大型语言模型作为临床中的智能体。
NPJ Digit Med. 2024 Apr 3;7(1):84. doi: 10.1038/s41746-024-01083-y.
6
Utilizing generative conversational artificial intelligence to create simulated patient encounters: a pilot study for anaesthesia training.利用生成式对话人工智能创建模拟患者就诊:麻醉培训的初步研究。
Postgrad Med J. 2024 Mar 18;100(1182):237-241. doi: 10.1093/postmj/qgad137.
7
A Generative Pretrained Transformer (GPT)-Powered Chatbot as a Simulated Patient to Practice History Taking: Prospective, Mixed Methods Study.基于生成式预训练转换器(GPT)的聊天机器人作为模拟患者进行病史采集的实践研究:前瞻性混合方法研究。
JMIR Med Educ. 2024 Jan 16;10:e53961. doi: 10.2196/53961.
8
Embodied Conversational Agents for Chronic Diseases: Scoping Review.具身对话代理在慢性病中的应用:范围综述。
J Med Internet Res. 2024 Jan 9;26:e47134. doi: 10.2196/47134.
9
Assessing the potential of GPT-4 to perpetuate racial and gender biases in health care: a model evaluation study.评估 GPT-4 在医疗保健中延续种族和性别偏见的潜力:一项模型评估研究。
Lancet Digit Health. 2024 Jan;6(1):e12-e22. doi: 10.1016/S2589-7500(23)00225-X.
10
Evaluation of the performance of GPT-3.5 and GPT-4 on the Polish Medical Final Examination.评估 GPT-3.5 和 GPT-4 在波兰医学期末考试中的表现。
Sci Rep. 2023 Nov 22;13(1):20512. doi: 10.1038/s41598-023-46995-z.