• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越二元决策:评估人工智能错误类型对人工智能辅助任务中的信任和性能的影响。

Beyond Binary Decisions: Evaluating the Effects of AI Error Type on Trust and Performance in AI-Assisted Tasks.

作者信息

Kim Jin Yong, Lester Corey, Yang X Jessie

机构信息

University of Michigan, USA.

出版信息

Hum Factors. 2025 Mar 19:187208251326795. doi: 10.1177/00187208251326795.

DOI:10.1177/00187208251326795
PMID:40104968
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12273520/
Abstract

ObjectiveWe investigated how various error patterns from an AI aid in the nonbinary decision scenario influence human operators' trust in the AI system and their task performance.BackgroundExisting research on trust in automation/autonomy predominantly uses the signal detection theory (SDT) to model autonomy performance. The SDT classifies the world into binary states and hence oversimplifies the interaction observed in real-world scenarios. Allowing multi-class classification of the world reveals intriguing error patterns previously unexplored in prior literature.MethodThirty-five participants completed 60 trials of a simulated mental rotation task assisted by an AI with 70-80% reliability. Participants' trust in and dependence on the AI system and their performance were measured. By combining participants' initial performance and the AI aid's performance, five distinct patterns emerged. Mixed-effects models were built to examine the effects of different patterns on trust adjustment, performance, and reaction time.ResultsVarying error patterns from AI impacted performance, reaction times, and trust. Some AI errors provided false reassurance, misleading operators into believing their incorrect decisions were correct, worsening performance and trust. Paradoxically, some AI errors prompted safety checks and verifications, which, despite causing a moderate decrease in trust, ultimately enhanced overall performance.ConclusionThe findings demonstrate that the types of errors made by an AI system significantly affect human trust and performance, emphasizing the need to model the complicated human-AI interaction in real life.ApplicationThese insights can guide the development of AI systems that classify the state of the world into multiple classes, enabling the operators to make more informed and accurate decisions based on feedback.

摘要

目的

我们研究了人工智能辅助在非二元决策场景中的各种错误模式如何影响人类操作员对人工智能系统的信任及其任务表现。

背景

现有关于对自动化/自主性信任的研究主要使用信号检测理论(SDT)来模拟自主性表现。信号检测理论将世界分为二元状态,因此过度简化了现实世界场景中观察到的交互。允许对世界进行多类别分类揭示了先前文献中未探索的有趣错误模式。

方法

35名参与者完成了60次由可靠性为70%-80%的人工智能辅助的模拟心理旋转任务试验。测量了参与者对人工智能系统的信任和依赖以及他们的表现。通过结合参与者的初始表现和人工智能辅助的表现,出现了五种不同的模式。建立混合效应模型以检查不同模式对信任调整、表现和反应时间的影响。

结果

人工智能的不同错误模式影响了表现、反应时间和信任。一些人工智能错误提供了虚假的安心感,误导操作员相信他们的错误决策是正确的,从而使表现和信任恶化。矛盾的是,一些人工智能错误促使进行安全检查和验证,尽管这导致信任适度下降,但最终提高了整体表现。

结论

研究结果表明,人工智能系统所犯错误的类型会显著影响人类的信任和表现,强调了在现实生活中对复杂的人机交互进行建模的必要性。

应用

这些见解可以指导将世界状态分类为多个类别的人工智能系统的开发,使操作员能够根据反馈做出更明智和准确的决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/6c7032d809ed/10.1177_00187208251326795-fig13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/fdc06ed5094d/10.1177_00187208251326795-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/6faea2ce75fc/10.1177_00187208251326795-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/097681184d55/10.1177_00187208251326795-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/8a46a9b541b9/10.1177_00187208251326795-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/a924866bac75/10.1177_00187208251326795-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/88ddea876315/10.1177_00187208251326795-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/d338491e2c7b/10.1177_00187208251326795-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/d2cfd7502ff2/10.1177_00187208251326795-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/cc8cb3514ba5/10.1177_00187208251326795-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/4a8daf022e69/10.1177_00187208251326795-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/752a0e02b6dc/10.1177_00187208251326795-fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/b9a561d107f8/10.1177_00187208251326795-fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/6c7032d809ed/10.1177_00187208251326795-fig13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/fdc06ed5094d/10.1177_00187208251326795-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/6faea2ce75fc/10.1177_00187208251326795-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/097681184d55/10.1177_00187208251326795-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/8a46a9b541b9/10.1177_00187208251326795-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/a924866bac75/10.1177_00187208251326795-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/88ddea876315/10.1177_00187208251326795-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/d338491e2c7b/10.1177_00187208251326795-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/d2cfd7502ff2/10.1177_00187208251326795-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/cc8cb3514ba5/10.1177_00187208251326795-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/4a8daf022e69/10.1177_00187208251326795-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/752a0e02b6dc/10.1177_00187208251326795-fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/b9a561d107f8/10.1177_00187208251326795-fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/6c7032d809ed/10.1177_00187208251326795-fig13.jpg

相似文献

1
Beyond Binary Decisions: Evaluating the Effects of AI Error Type on Trust and Performance in AI-Assisted Tasks.超越二元决策:评估人工智能错误类型对人工智能辅助任务中的信任和性能的影响。
Hum Factors. 2025 Mar 19:187208251326795. doi: 10.1177/00187208251326795.
2
Accreditation through the eyes of nurse managers: an infinite staircase or a phenomenon that evaporates like water.护士长眼中的认证:是无尽的阶梯还是如流水般消逝的现象。
J Health Organ Manag. 2025 Jun 30. doi: 10.1108/JHOM-01-2025-0029.
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
Trust, Trustworthiness, and the Future of Medical AI: Outcomes of an Interdisciplinary Expert Workshop.信任、可信度与医学人工智能的未来:跨学科专家研讨会成果
J Med Internet Res. 2025 Jun 2;27:e71236. doi: 10.2196/71236.
5
Perspectives of Health Care Professionals on the Use of AI to Support Clinical Decision-Making in the Management of Multiple Long-Term Conditions: Interview Study.医疗保健专业人员对使用人工智能支持多种慢性病管理中临床决策的看法:访谈研究
J Med Internet Res. 2025 Jul 4;27:e71980. doi: 10.2196/71980.
6
Community views on mass drug administration for soil-transmitted helminths: a qualitative evidence synthesis.社区对土壤传播蠕虫群体药物给药的看法:定性证据综合分析
Cochrane Database Syst Rev. 2025 Jun 20;6:CD015794. doi: 10.1002/14651858.CD015794.pub2.
7
What makes a 'good' decision with artificial intelligence? A grounded theory study in paediatric care.如何利用人工智能做出“好”的决策?一项关于儿科护理的扎根理论研究。
BMJ Evid Based Med. 2025 May 20;30(3):183-193. doi: 10.1136/bmjebm-2024-112919.
8
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
9
Improving reliability of movement assessment in Parkinson's disease using computer vision-based automated severity estimation.利用基于计算机视觉的自动严重程度估计提高帕金森病运动评估的可靠性。
J Parkinsons Dis. 2025 Mar;15(2):349-360. doi: 10.1177/1877718X241312605. Epub 2025 Feb 13.
10
Short-Term Memory Impairment短期记忆障碍

引用本文的文献

1
Comparative Analysis of Generative Artificial Intelligence Systems in Solving Clinical Pharmacy Problems: Mixed Methods Study.生成式人工智能系统解决临床药学问题的比较分析:混合方法研究
JMIR Med Inform. 2025 Jul 24;13:e76128. doi: 10.2196/76128.

本文引用的文献

1
The Effects of Presenting AI Uncertainty Information on Pharmacists' Trust in Automated Pill Recognition Technology: Exploratory Mixed Subjects Study.呈现人工智能不确定性信息对药剂师对自动药丸识别技术信任度的影响:探索性混合主题研究
JMIR Hum Factors. 2025 Feb 11;12:e60273. doi: 10.2196/60273.
2
Effect of Artificial Intelligence Helpfulness and Uncertainty on Cognitive Interactions with Pharmacists: Randomized Controlled Trial.人工智能的有用性和不确定性对与药剂师认知互动的影响:随机对照试验
J Med Internet Res. 2025 Jan 31;27:e59946. doi: 10.2196/59946.
3
Designing Human-Centered AI to Prevent Medication Dispensing Errors: Focus Group Study With Pharmacists.
设计以用户为中心的人工智能以预防配药错误:与药剂师的焦点小组研究
JMIR Form Res. 2023 Dec 25;7:e51921. doi: 10.2196/51921.
4
Trust in Robots: Challenges and Opportunities.对机器人的信任:挑战与机遇。
Curr Robot Rep. 2020;1(4):297-309. doi: 10.1007/s43154-020-00029-y. Epub 2020 Sep 3.
5
Toward Quantifying Trust Dynamics: How People Adjust Their Trust After Moment-to-Moment Interaction With Automation.量化信任动态:人们如何在与自动化进行实时交互后调整其信任。
Hum Factors. 2023 Aug;65(5):862-878. doi: 10.1177/00187208211034716. Epub 2021 Aug 29.
6
Performance evaluation of a prescription medication image classification model: an observational cohort.一种处方药图像分类模型的性能评估:一项观察性队列研究。
NPJ Digit Med. 2021 Jul 27;4(1):118. doi: 10.1038/s41746-021-00483-8.
7
Trust in Artificial Intelligence: Meta-Analytic Findings.对人工智能的信任:元分析研究结果。
Hum Factors. 2023 Mar;65(2):337-359. doi: 10.1177/00187208211013988. Epub 2021 May 28.
8
Not All Information Is Equal: Effects of Disclosing Different Types of Likelihood Information on Trust, Compliance and Reliance, and Task Performance in Human-Automation Teaming.并非所有信息都是平等的:在人机协作中披露不同类型可能性信息对信任、遵从和依赖的影响,以及对任务绩效的影响。
Hum Factors. 2020 Sep;62(6):987-1001. doi: 10.1177/0018720819862916. Epub 2019 Jul 26.
9
A systematic review of the nature of dispensing errors in hospital pharmacies.医院药房调配差错性质的系统评价。
Integr Pharm Res Pract. 2016 Jan 12;5:1-10. doi: 10.2147/IPRP.S95733. eCollection 2016.
10
Trust and the Compliance-Reliance Paradigm: The Effects of Risk, Error Bias, and Reliability on Trust and Dependence.信任与合规-依赖范式:风险、错误偏差和可靠性对信任与依赖的影响。
Hum Factors. 2017 May;59(3):333-345. doi: 10.1177/0018720816682648. Epub 2016 Dec 19.