• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生成式人工智能初步患者安全分类系统的开发。

Development of a Preliminary Patient Safety Classification System for Generative AI.

作者信息

Hose Bat-Zion, Handley Jessica L, Biro Joshua, Reddy Sahithi, Krevat Seth, Hettinger Aaron Zachary, Ratwani Raj M

机构信息

National Center for Human Factors in Healthcare, MedStar Health Research Institute, Washington, District of Columbia, USA

Georgetown University Medical Center, Washington, District of Columbia, USA.

出版信息

BMJ Qual Saf. 2025 Jan 28;34(2):130-132. doi: 10.1136/bmjqs-2024-017918.

DOI:10.1136/bmjqs-2024-017918
PMID:39753358
Abstract

Generative artificial intelligence (AI) technologies have the potential to revolutionise healthcare delivery but require classification and monitoring of patient safety risks. To address this need, we developed and evaluated a preliminary classification system for categorising generative AI patient safety errors. Our classification system is organised around two AI system stages (input and output) with specific error types by stage. We applied our classification system to two generative AI applications to assess its effectiveness in categorising safety issues: patient-facing conversational large language models (LLMs) and an ambient digital scribe (ADS) system for clinical documentation. In the LLM analysis, we identified 45 errors across 27 patient medical queries, with omission being the most common (42% of errors). Of the identified errors, 50% were categorised as low clinical significance, 25% as moderate clinical significance and 25% as high clinical significance. Similarly, in the ADS simulation, we identified 66 errors across 11 patient visits, with omission being the most common (83% of errors). Of the identified errors, 55% were categorised as low clinical significance and 45% were categorised as moderate clinical significance. These findings demonstrate the classification system's utility in categorising output errors from two different AI healthcare applications, providing a starting point for developing a robust process to better understand AI-enabled errors.

摘要

生成式人工智能(AI)技术有潜力彻底改变医疗服务的提供方式,但需要对患者安全风险进行分类和监测。为满足这一需求,我们开发并评估了一个用于对生成式人工智能患者安全错误进行分类的初步分类系统。我们的分类系统围绕人工智能系统的两个阶段(输入和输出)进行组织,并按阶段划分了特定的错误类型。我们将分类系统应用于两个生成式人工智能应用程序,以评估其在对安全问题进行分类方面的有效性:面向患者的对话式大语言模型(LLMs)和用于临床文档记录的环境数字抄写员(ADS)系统。在大语言模型分析中,我们在27个患者医疗查询中识别出了45个错误,其中遗漏最为常见(占错误的42%)。在已识别的错误中,50%被归类为临床意义较低,25%为中等临床意义,25%为高临床意义。同样,在环境数字抄写员模拟中,我们在11次患者就诊中识别出了66个错误,其中遗漏最为常见(占错误的83%)。在已识别的错误中,55%被归类为临床意义较低,45%被归类为中等临床意义。这些发现证明了该分类系统在对两种不同的人工智能医疗应用程序的输出错误进行分类方面的实用性,为开发一个强大的流程以更好地理解人工智能导致的错误提供了一个起点。

相似文献

1
Development of a Preliminary Patient Safety Classification System for Generative AI.生成式人工智能初步患者安全分类系统的开发。
BMJ Qual Saf. 2025 Jan 28;34(2):130-132. doi: 10.1136/bmjqs-2024-017918.
2
Accuracy and Safety of AI-Enabled Scribe Technology: Instrument Validation Study.人工智能辅助抄写技术的准确性与安全性:仪器验证研究
J Med Internet Res. 2025 Jan 27;27:e64993. doi: 10.2196/64993.
3
Using ChatGPT-4 to Create Structured Medical Notes From Audio Recordings of Physician-Patient Encounters: Comparative Study.利用 ChatGPT-4 从医患对话的音频记录中创建结构化的医疗记录:比较研究。
J Med Internet Res. 2024 Apr 22;26:e54419. doi: 10.2196/54419.
4
Ethical Application of Generative Artificial Intelligence in Medicine.生成式人工智能在医学中的伦理应用
Arthroscopy. 2025 Apr;41(4):874-885. doi: 10.1016/j.arthro.2024.12.011. Epub 2024 Dec 15.
5
Artificial Intelligence Scribe and Large Language Model Technology in Healthcare Documentation: Advantages, Limitations, and Recommendations.医疗记录中的人工智能抄写员和大语言模型技术:优势、局限性与建议
Plast Reconstr Surg Glob Open. 2025 Jan 16;13(1):e6450. doi: 10.1097/GOX.0000000000006450. eCollection 2025 Jan.
6
Perspective review: Will generative AI make common data models obsolete in future analyses of distributed data networks?观点综述:生成式人工智能会使通用数据模型在分布式数据网络的未来分析中过时吗?
Ther Adv Drug Saf. 2025 Apr 21;16:20420986251332743. doi: 10.1177/20420986251332743. eCollection 2025.
7
Generative Large Language Model-Powered Conversational AI App for Personalized Risk Assessment: Case Study in COVID-19.用于个性化风险评估的生成式大语言模型驱动的对话式人工智能应用程序:COVID-19案例研究
JMIR AI. 2025 Mar 27;4:e67363. doi: 10.2196/67363.
8
Artificial Intelligence in Health Care: A Rallying Cry for Critical Clinical Research and Ethical Thinking.医疗保健中的人工智能:呼吁开展关键临床研究与进行伦理思考
Clin Oncol (R Coll Radiol). 2025 May;41:103798. doi: 10.1016/j.clon.2025.103798. Epub 2025 Mar 8.
9
Detecting Algorithmic Errors and Patient Harms for AI-Enabled Medical Devices in Randomized Controlled Trials: Protocol for a Systematic Review.在随机对照试验中检测人工智能医疗设备的算法错误和患者伤害:系统评价方案。
JMIR Res Protoc. 2024 Jun 28;13:e51614. doi: 10.2196/51614.
10
Leveraging Generative AI and Large Language Models: A Comprehensive Roadmap for Healthcare Integration.利用生成式人工智能和大语言模型:医疗保健整合综合路线图。
Healthcare (Basel). 2023 Oct 20;11(20):2776. doi: 10.3390/healthcare11202776.

引用本文的文献

1
Use of ChatGPT for Urinary Symptom Management Among People With Spinal Cord Injury or Disease: Qualitative Study.脊髓损伤或疾病患者使用ChatGPT进行泌尿系统症状管理:定性研究
JMIR Rehabil Assist Technol. 2025 May 29;12:e70339. doi: 10.2196/70339.