• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对安全关键环境中人工智能的负责任部署,基于经验得出的评估要求。

Empirically derived evaluation requirements for responsible deployments of AI in safety-critical settings.

作者信息

Morey Dane A, Rayo Michael F, Woods David D

机构信息

The Ohio State University, Columbus, OH, USA.

出版信息

NPJ Digit Med. 2025 Jun 18;8(1):374. doi: 10.1038/s41746-025-01784-y.

DOI:10.1038/s41746-025-01784-y
PMID:40533485
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12177047/
Abstract

Processes to assure the safe, effective, and responsible deployment of artificial intelligence (AI) in safety-critical settings are urgently needed. Here we show a procedure to empirically evaluate the impacts of AI augmentation as a basis for responsible deployment. We evaluated three augmentative AI technologies nurses used to recognize imminent patient emergencies, including combinations of AI recommendations and explanations. The evaluation involved 450 nursing students and 12 licensed nurses assessing 10 historical patient cases. With each technology, nurses' performance was both improved and degraded when the AI algorithm was most correct and misleading, respectively. Our findings caution that AI capabilities alone do not guarantee a safe and effective joint human-AI system. We propose two minimum requirements for evaluating AI in safety-critical settings: (1) empirically measure the performance of people and AI together and (2) examine a range of challenging cases which produce a range of strong, mediocre, and poor AI performance.

摘要

迫切需要制定相关流程,以确保人工智能(AI)在安全关键环境中的安全、有效和负责任的部署。在此,我们展示了一种通过实证评估AI增强影响的程序,作为负责任部署的基础。我们评估了护士用于识别即将发生的患者紧急情况的三种增强型AI技术,包括AI建议和解释的组合。该评估涉及450名护理专业学生和12名执业护士,他们对10个历史患者病例进行了评估。对于每种技术,当AI算法最正确和最具误导性时,护士的表现分别得到了改善和下降。我们的研究结果提醒人们,仅靠AI能力并不能保证一个安全有效的人机联合系统。我们提出了在安全关键环境中评估AI的两个最低要求:(1)通过实证测量人与AI的共同表现;(2)检查一系列具有挑战性的案例,这些案例会产生一系列优秀、中等和较差的AI表现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d55/12177047/b139440383b7/41746_2025_1784_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d55/12177047/7a6bf93dd936/41746_2025_1784_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d55/12177047/d8a9a085ac9b/41746_2025_1784_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d55/12177047/b139440383b7/41746_2025_1784_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d55/12177047/7a6bf93dd936/41746_2025_1784_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d55/12177047/d8a9a085ac9b/41746_2025_1784_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d55/12177047/b139440383b7/41746_2025_1784_Fig3_HTML.jpg

相似文献

1
Empirically derived evaluation requirements for responsible deployments of AI in safety-critical settings.针对安全关键环境中人工智能的负责任部署,基于经验得出的评估要求。
NPJ Digit Med. 2025 Jun 18;8(1):374. doi: 10.1038/s41746-025-01784-y.
2
Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略
Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.
3
Surveillance for Violent Deaths - National Violent Death Reporting System, 50 States, the District of Columbia, and Puerto Rico, 2022.暴力死亡监测——2022年全国暴力死亡报告系统,50个州、哥伦比亚特区和波多黎各
MMWR Surveill Summ. 2025 Jun 12;74(5):1-42. doi: 10.15585/mmwr.ss7405a1.
4
Trust, Trustworthiness, and the Future of Medical AI: Outcomes of an Interdisciplinary Expert Workshop.信任、可信度与医学人工智能的未来:跨学科专家研讨会成果
J Med Internet Res. 2025 Jun 2;27:e71236. doi: 10.2196/71236.
5
Pharmacological and electronic cigarette interventions for smoking cessation in adults: component network meta-analyses.药物和电子烟干预成人戒烟的效果:成分网络荟萃分析。
Cochrane Database Syst Rev. 2023 Sep 12;9(9):CD015226. doi: 10.1002/14651858.CD015226.pub2.
6
Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。
Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.
7
Embracing artificial intelligence in nursing: exploring the relationship between artificial intelligence-related attitudes, creative self-efficacy, and clinical reasoning competency among nurses.护理领域接纳人工智能:探讨护士对人工智能的态度、创造性自我效能感与临床推理能力之间的关系。
BMC Nurs. 2025 Jun 20;24(1):661. doi: 10.1186/s12912-025-03306-3.
8
Transcriptome analysis and artificial intelligence for predicting lymph node metastasis of esophageal squamous cell carcinoma.用于预测食管鳞状细胞癌淋巴结转移的转录组分析与人工智能
J Thorac Dis. 2025 May 30;17(5):3283-3296. doi: 10.21037/jtd-2025-662. Epub 2025 May 28.
9
Expectations and Requirements of Surgical Staff for an AI-Supported Clinical Decision Support System for Older Patients: Qualitative Study.外科医护人员对用于老年患者的人工智能支持临床决策支持系统的期望与要求:定性研究
JMIR Aging. 2024 Dec 17;7:e57899. doi: 10.2196/57899.
10
Navigating artificial intelligence in home healthcare: challenges and opportunities in nursing wound care.在家居医疗保健中运用人工智能:护理伤口护理的挑战与机遇
BMC Nurs. 2025 Jun 19;24(1):660. doi: 10.1186/s12912-025-03348-7.

本文引用的文献

1
When combinations of humans and AI are useful: A systematic review and meta-analysis.人机结合何时有用:系统评价与荟萃分析
Nat Hum Behav. 2024 Dec;8(12):2293-2303. doi: 10.1038/s41562-024-02024-1. Epub 2024 Oct 28.
2
Early Warning Scores With and Without Artificial Intelligence.早期预警评分与人工智能。
JAMA Netw Open. 2024 Oct 1;7(10):e2438986. doi: 10.1001/jamanetworkopen.2024.38986.
3
Eye tracking insights into physician behaviour with safe and unsafe explainable AI recommendations.通过安全和不安全的可解释人工智能建议对医生行为的眼动追踪洞察。
NPJ Digit Med. 2024 Aug 2;7(1):202. doi: 10.1038/s41746-024-01200-x.
4
Effectiveness of an Artificial Intelligence-Enabled Intervention for Detecting Clinical Deterioration.人工智能干预在临床恶化检测中的有效性。
JAMA Intern Med. 2024 May 1;184(5):557-562. doi: 10.1001/jamainternmed.2024.0084.
5
Heterogeneity and predictors of the effects of AI assistance on radiologists.人工智能辅助对放射科医生影响的异质性和预测因素。
Nat Med. 2024 Mar;30(3):837-849. doi: 10.1038/s41591-024-02850-w. Epub 2024 Mar 19.
6
Automation Bias and Assistive AI: Risk of Harm From AI-Driven Clinical Decision Support.自动化偏差与辅助性人工智能:人工智能驱动的临床决策支持带来的伤害风险
JAMA. 2023 Dec 19;330(23):2255-2257. doi: 10.1001/jama.2023.22557.
7
Measuring the Impact of AI in the Diagnosis of Hospitalized Patients: A Randomized Clinical Vignette Survey Study.测量人工智能在住院患者诊断中的影响:一项随机临床病例调查研究。
JAMA. 2023 Dec 19;330(23):2275-2284. doi: 10.1001/jama.2023.22295.
8
Optimized glycemic control of type 2 diabetes with reinforcement learning: a proof-of-concept trial.基于强化学习的 2 型糖尿病优化血糖控制:一项概念验证试验。
Nat Med. 2023 Oct;29(10):2633-2642. doi: 10.1038/s41591-023-02552-9. Epub 2023 Sep 14.
9
Ironies of artificial intelligence.人工智能的讽刺。
Ergonomics. 2023 Nov;66(11):1656-1668. doi: 10.1080/00140139.2023.2243404. Epub 2023 Aug 6.
10
Evaluation of machine learning-based models for prediction of clinical deterioration: A systematic literature review.基于机器学习的临床恶化预测模型评估:系统文献回顾。
Int J Med Inform. 2023 Jul;175:105084. doi: 10.1016/j.ijmedinf.2023.105084. Epub 2023 Apr 25.