• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

可解释人工智能作为公平决策的证据。

Explainable AI as evidence of fair decisions.

作者信息

Leben Derek

机构信息

Tepper School of Business, Carnegie Mellon University, Pittsburgh, PA, United States.

出版信息

Front Psychol. 2023 Feb 14;14:1069426. doi: 10.3389/fpsyg.2023.1069426. eCollection 2023.

DOI:10.3389/fpsyg.2023.1069426
PMID:36865358
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9971226/
Abstract

This paper will propose that explanations are valuable to those impacted by a model's decisions (model patients) to the extent that they provide evidence that a past adverse decision was unfair. Under this proposal, we should favor models and explainability methods which generate counterfactuals of two types. The first type of counterfactual is evidence of fairness: a set of states under the control of the patient which (if changed) would have led to a beneficial decision. The second type of counterfactual is evidence of fairness: a set of irrelevant group or behavioral attributes which (if changed) would have led to a beneficial decision. Each of these counterfactual statements is related to fairness, under the Liberal Egalitarian idea that treating one person differently than another is justified only on the basis of features which were plausibly under each person's control. Other aspects of an explanation, such as feature importance and actionable recourse, are essential under this view, and need not be a goal of explainable AI.

摘要

本文将提出,解释对于受模型决策影响的人(模型患者)具有价值,其程度取决于这些解释能提供证据表明过去的不利决策是不公平的。根据这一提议,我们应该青睐能生成两种反事实情况的模型和可解释性方法。第一种反事实情况是公平性的证据:一组在患者控制之下的状态(如果改变这些状态)会导致有利的决策。第二种反事实情况也是公平性的证据:一组不相关的群体或行为属性(如果改变这些属性)会导致有利的决策。根据自由平等主义的观点,只有基于每个人可能控制的特征来区别对待一个人和另一个人才是合理的,上述每一个反事实陈述都与公平性相关。在这种观点下,解释的其他方面,如特征重要性和可采取的行动,是至关重要的,而且不一定是可解释人工智能的目标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e3/9971226/d6eace975a2f/fpsyg-14-1069426-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e3/9971226/d6eace975a2f/fpsyg-14-1069426-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68e3/9971226/d6eace975a2f/fpsyg-14-1069426-g0001.jpg

相似文献

1
Explainable AI as evidence of fair decisions.可解释人工智能作为公平决策的证据。
Front Psychol. 2023 Feb 14;14:1069426. doi: 10.3389/fpsyg.2023.1069426. eCollection 2023.
2
: counterfactual explanations for fairness.公平性的反事实解释
Mach Learn. 2023 Mar 28:1-32. doi: 10.1007/s10994-023-06319-8.
3
From local counterfactuals to global feature importance: efficient, robust, and model-agnostic explanations for brain connectivity networks.从局部反事实推断到全局特征重要性:脑连接网络的高效、稳健且与模型无关的解释。
Comput Methods Programs Biomed. 2023 Jun;236:107550. doi: 10.1016/j.cmpb.2023.107550. Epub 2023 Apr 16.
4
DECE: Decision Explorer with Counterfactual Explanations for Machine Learning Models.决策探索器:用于机器学习模型的反事实解释的决策探索器。
IEEE Trans Vis Comput Graph. 2021 Feb;27(2):1438-1447. doi: 10.1109/TVCG.2020.3030342. Epub 2021 Jan 28.
5
Explaining the black-box smoothly-A counterfactual approach.黑盒解释的平滑化——反事实方法。
Med Image Anal. 2023 Feb;84:102721. doi: 10.1016/j.media.2022.102721. Epub 2022 Dec 13.
6
Can counterfactual explanations of AI systems' predictions skew lay users' causal intuitions about the world? If so, can we correct for that?人工智能系统预测的反事实解释会扭曲普通用户对世界的因果直觉吗?如果是这样,我们能对此加以纠正吗?
Patterns (N Y). 2022 Dec 9;3(12):100635. doi: 10.1016/j.patter.2022.100635.
7
How I Would have been Differently Treated. Discrimination Through the Lens of Counterfactual Fairness.我可能会受到怎样不同的对待。基于反事实公平视角的歧视
Res Publica. 2023;29(2):185-211. doi: 10.1007/s11158-023-09586-3. Epub 2023 Mar 20.
8
Explanatory pragmatism: a context-sensitive framework for explainable medical AI.解释性实用主义:一个用于可解释医学人工智能的上下文敏感框架。
Ethics Inf Technol. 2022;24(1):13. doi: 10.1007/s10676-022-09632-3. Epub 2022 Feb 28.
9
Training calibration-based counterfactual explainers for deep learning models in medical image analysis.基于训练校准的深度学习模型反事实解释器在医学图像分析中的应用。
Sci Rep. 2022 Jan 12;12(1):597. doi: 10.1038/s41598-021-04529-5.
10
How people reason with counterfactual and causal explanations for Artificial Intelligence decisions in familiar and unfamiliar domains.人们如何在熟悉和不熟悉的领域中,用反事实和因果解释来推理人工智能决策。
Mem Cognit. 2023 Oct;51(7):1481-1496. doi: 10.3758/s13421-023-01407-5. Epub 2023 Mar 24.

本文引用的文献

1
Artificial Intelligence Can't Be Charmed: The Effects of Impartiality on Laypeople's Algorithmic Preferences.人工智能无法被“迷惑”:公正性对普通大众算法偏好的影响。
Front Psychol. 2022 Jun 29;13:898027. doi: 10.3389/fpsyg.2022.898027. eCollection 2022.
2
Artificial intelligence explainability: the technical and ethical dimensions.人工智能可解释性:技术和伦理维度。
Philos Trans A Math Phys Eng Sci. 2021 Oct 4;379(2207):20200363. doi: 10.1098/rsta.2020.0363. Epub 2021 Aug 16.
3
Principles and Practice of Explainable Machine Learning.
可解释机器学习原理与实践
Front Big Data. 2021 Jul 1;4:688969. doi: 10.3389/fdata.2021.688969. eCollection 2021.
4
Unequal chances: ex ante fairness and individual control.不平等的机会:事前公平与个人控制。
Sci Rep. 2020 Dec 14;10(1):21862. doi: 10.1038/s41598-020-78335-w.
5
Are Individuals Luck Egalitarians? - An Experiment on the Influence of Brute and Option Luck on Social Preferences.个体是运气平等主义者吗?——关于原生运气和选项运气对社会偏好影响的一项实验
Front Psychol. 2017 Mar 29;8:460. doi: 10.3389/fpsyg.2017.00460. eCollection 2017.
6
Two paths to blame: Intentionality directs moral information processing along two distinct tracks.两种归咎路径:意向性将道德信息处理引导至两条截然不同的路径。
J Exp Psychol Gen. 2017 Jan;146(1):123-133. doi: 10.1037/xge0000234.
7
Four-factor justice and daily job satisfaction: a multilevel investigation.四因素公平与日常工作满意度:一项多层次调查。
J Appl Psychol. 2009 May;94(3):770-81. doi: 10.1037/a0015714.
8
Crime and punishment: distinguishing the roles of causal and intentional analyses in moral judgment.犯罪与惩罚:区分因果分析和意图分析在道德判断中的作用
Cognition. 2008 Aug;108(2):353-80. doi: 10.1016/j.cognition.2008.03.006. Epub 2008 Apr 24.
9
Fairness versus reason in the ultimatum game.最后通牒博弈中的公平与理性
Science. 2000 Sep 8;289(5485):1773-5. doi: 10.1126/science.289.5485.1773.
10
Philosophical conceptions of the self: implications for cognitive science.自我的哲学概念:对认知科学的启示
Trends Cogn Sci. 2000 Jan;4(1):14-21. doi: 10.1016/s1364-6613(99)01417-5.