Suppr超能文献

超越二元决策:评估人工智能错误类型对人工智能辅助任务中的信任和性能的影响。

Beyond Binary Decisions: Evaluating the Effects of AI Error Type on Trust and Performance in AI-Assisted Tasks.

作者信息

Kim Jin Yong, Lester Corey, Yang X Jessie

机构信息

University of Michigan, USA.

出版信息

Hum Factors. 2025 Mar 19:187208251326795. doi: 10.1177/00187208251326795.

Abstract

ObjectiveWe investigated how various error patterns from an AI aid in the nonbinary decision scenario influence human operators' trust in the AI system and their task performance.BackgroundExisting research on trust in automation/autonomy predominantly uses the signal detection theory (SDT) to model autonomy performance. The SDT classifies the world into binary states and hence oversimplifies the interaction observed in real-world scenarios. Allowing multi-class classification of the world reveals intriguing error patterns previously unexplored in prior literature.MethodThirty-five participants completed 60 trials of a simulated mental rotation task assisted by an AI with 70-80% reliability. Participants' trust in and dependence on the AI system and their performance were measured. By combining participants' initial performance and the AI aid's performance, five distinct patterns emerged. Mixed-effects models were built to examine the effects of different patterns on trust adjustment, performance, and reaction time.ResultsVarying error patterns from AI impacted performance, reaction times, and trust. Some AI errors provided false reassurance, misleading operators into believing their incorrect decisions were correct, worsening performance and trust. Paradoxically, some AI errors prompted safety checks and verifications, which, despite causing a moderate decrease in trust, ultimately enhanced overall performance.ConclusionThe findings demonstrate that the types of errors made by an AI system significantly affect human trust and performance, emphasizing the need to model the complicated human-AI interaction in real life.ApplicationThese insights can guide the development of AI systems that classify the state of the world into multiple classes, enabling the operators to make more informed and accurate decisions based on feedback.

摘要

目的

我们研究了人工智能辅助在非二元决策场景中的各种错误模式如何影响人类操作员对人工智能系统的信任及其任务表现。

背景

现有关于对自动化/自主性信任的研究主要使用信号检测理论(SDT)来模拟自主性表现。信号检测理论将世界分为二元状态,因此过度简化了现实世界场景中观察到的交互。允许对世界进行多类别分类揭示了先前文献中未探索的有趣错误模式。

方法

35名参与者完成了60次由可靠性为70%-80%的人工智能辅助的模拟心理旋转任务试验。测量了参与者对人工智能系统的信任和依赖以及他们的表现。通过结合参与者的初始表现和人工智能辅助的表现,出现了五种不同的模式。建立混合效应模型以检查不同模式对信任调整、表现和反应时间的影响。

结果

人工智能的不同错误模式影响了表现、反应时间和信任。一些人工智能错误提供了虚假的安心感,误导操作员相信他们的错误决策是正确的,从而使表现和信任恶化。矛盾的是,一些人工智能错误促使进行安全检查和验证,尽管这导致信任适度下降,但最终提高了整体表现。

结论

研究结果表明,人工智能系统所犯错误的类型会显著影响人类的信任和表现,强调了在现实生活中对复杂的人机交互进行建模的必要性。

应用

这些见解可以指导将世界状态分类为多个类别的人工智能系统的开发,使操作员能够根据反馈做出更明智和准确的决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ba1/12420941/fdc06ed5094d/10.1177_00187208251326795-fig1.jpg

相似文献

1
6
Community views on mass drug administration for soil-transmitted helminths: a qualitative evidence synthesis.
Cochrane Database Syst Rev. 2025 Jun 20;6:CD015794. doi: 10.1002/14651858.CD015794.pub2.
7
What makes a 'good' decision with artificial intelligence? A grounded theory study in paediatric care.
BMJ Evid Based Med. 2025 May 20;30(3):183-193. doi: 10.1136/bmjebm-2024-112919.
8
Systemic treatments for metastatic cutaneous melanoma.
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
9
Improving reliability of movement assessment in Parkinson's disease using computer vision-based automated severity estimation.
J Parkinsons Dis. 2025 Mar;15(2):349-360. doi: 10.1177/1877718X241312605. Epub 2025 Feb 13.

本文引用的文献

4
Trust in Robots: Challenges and Opportunities.
Curr Robot Rep. 2020;1(4):297-309. doi: 10.1007/s43154-020-00029-y. Epub 2020 Sep 3.
5
Toward Quantifying Trust Dynamics: How People Adjust Their Trust After Moment-to-Moment Interaction With Automation.
Hum Factors. 2023 Aug;65(5):862-878. doi: 10.1177/00187208211034716. Epub 2021 Aug 29.
6
7
Trust in Artificial Intelligence: Meta-Analytic Findings.
Hum Factors. 2023 Mar;65(2):337-359. doi: 10.1177/00187208211013988. Epub 2021 May 28.
9
A systematic review of the nature of dispensing errors in hospital pharmacies.
Integr Pharm Res Pract. 2016 Jan 12;5:1-10. doi: 10.2147/IPRP.S95733. eCollection 2016.
10
Trust and the Compliance-Reliance Paradigm: The Effects of Risk, Error Bias, and Reliability on Trust and Dependence.
Hum Factors. 2017 May;59(3):333-345. doi: 10.1177/0018720816682648. Epub 2016 Dec 19.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验