为人工智能系统的可靠性可信度辩护：一种可靠主义的方法。

Justifying Our Credences in the Trustworthiness of AI Systems: A Reliabilistic Approach.

机构信息

Institute of Biomedical Ethics and History of Medicine, University of Zurich, Zurich, Switzerland.

ETH Zurich, Zurich, Switzerland.

出版信息

Sci Eng Ethics. 2024 Nov 21;30(6):55. doi: 10.1007/s11948-024-00522-z.

DOI:10.1007/s11948-024-00522-z

PMID:39570550

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11582117/

Abstract

We address an open problem in the philosophy of artificial intelligence (AI): how to justify the epistemic attitudes we have towards the trustworthiness of AI systems. The problem is important, as providing reasons to believe that AI systems are worthy of trust is key to appropriately rely on these systems in human-AI interactions. In our approach, we consider the trustworthiness of an AI as a time-relative, composite property of the system with two distinct facets. One is the actual trustworthiness of the AI and the other is the perceived trustworthiness of the system as assessed by its users while interacting with it. We show that credences, namely, beliefs we hold with a degree of confidence, are the appropriate attitude for capturing the facets of the trustworthiness of an AI over time. Then, we introduce a reliabilistic account providing justification to the credences in the trustworthiness of AI, which we derive from Tang's probabilistic theory of justified credence. Our account stipulates that a credence in the trustworthiness of an AI system is justified if and only if it is caused by an assessment process that tends to result in a high proportion of credences for which the actual and perceived trustworthiness of the AI are calibrated. This approach informs research on the ethics of AI and human-AI interactions by providing actionable recommendations on how to measure the reliability of the process through which users perceive the trustworthiness of the system, investigating its calibration to the actual levels of trustworthiness of the AI as well as users' appropriate reliance on the system.

摘要

我们解决了人工智能哲学中的一个开放性问题

如何为我们对人工智能系统可信度的认知态度提供理由。这个问题很重要，因为为相信人工智能系统值得信任提供理由，是在人机交互中适当依赖这些系统的关键。在我们的方法中，我们将人工智能的可信度视为系统的一个具有两个不同方面的时间相关的复合属性。一方面是人工智能的实际可信度，另一方面是用户在与系统交互时评估的系统的感知可信度。我们表明，信任度，即我们持有一定置信度的信念，是随着时间的推移捕捉人工智能可信度各个方面的适当态度。然后，我们引入了一个可靠主义的解释，为人工智能可信度的信任度提供了理由，这是我们从 Tang 的概率性合理信任度理论中推导出来的。我们的解释规定，如果并且仅当信任度是由一个评估过程引起的，该过程倾向于导致高度的信任度，其中人工智能的实际和感知可信度是校准的，那么对人工智能系统的可信度的信任度就是合理的。这种方法通过提供关于如何通过用户感知系统可信度的过程的可靠性来衡量的可操作建议，为人工智能的伦理和人机交互研究提供了信息，同时也调查了其对人工智能实际可信度水平的校准以及用户对系统的适当依赖程度。

相似文献

Justifying Our Credences in the Trustworthiness of AI Systems: A Reliabilistic Approach.

Sci Eng Ethics. 2024 Nov 21;30(6):55. doi: 10.1007/s11948-024-00522-z.

Trust, Trustworthiness, and the Future of Medical AI: Outcomes of an Interdisciplinary Expert Workshop.

J Med Internet Res. 2025 Jun 2;27:e71236. doi: 10.2196/71236.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.

medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.

Parents' and informal caregivers' views and experiences of communication about routine childhood vaccination: a synthesis of qualitative evidence.

Cochrane Database Syst Rev. 2017 Feb 7;2(2):CD011787. doi: 10.1002/14651858.CD011787.pub2.

Community views on mass drug administration for soil-transmitted helminths: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2025 Jun 20;6:CD015794. doi: 10.1002/14651858.CD015794.pub2.

Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.

Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.

Perceptions and experiences of the prevention, detection, and management of postpartum haemorrhage: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2023 Nov 27;11(11):CD013795. doi: 10.1002/14651858.CD013795.pub2.

Stigma Management Strategies of Autistic Social Media Users.

Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

本文引用的文献

Trustworthy artificial intelligence and the European Union AI act: On the conflation of trustworthiness and acceptability of risk.

Regul Gov. 2024 Jan;18(1):3-32. doi: 10.1111/rego.12512. Epub 2023 Feb 6.

AI as an Epistemic Technology.

Sci Eng Ethics. 2023 Aug 21;29(5):32. doi: 10.1007/s11948-023-00451-3.

Over What Range Should Reliabilists Measure Reliability?

Erkenntnis. 2023 Jan 4:1-21. doi: 10.1007/s10670-022-00645-4.

Ethics of the algorithmic prediction of goal of care preferences: from theory to practice.

J Med Ethics. 2023 Mar;49(3):165-174. doi: 10.1136/jme-2022-108371. Epub 2022 Nov 8.

The AI life cycle: a holistic approach to creating ethical AI for health decisions.

Nat Med. 2022 Nov;28(11):2247-2249. doi: 10.1038/s41591-022-01993-y.

How transparency modulates trust in artificial intelligence.

Patterns (N Y). 2022 Feb 24;3(4):100455. doi: 10.1016/j.patter.2022.100455. eCollection 2022 Apr 8.

Should we replace radiologists with deep learning? Pigeons, error and trust in medical AI.

Bioethics. 2022 Feb;36(2):121-133. doi: 10.1111/bioe.12959. Epub 2021 Oct 18.

Who is afraid of black box algorithms? On the epistemological and ethical basis of trust in medical AI.

J Med Ethics. 2021 Mar 18. doi: 10.1136/medethics-2020-106820.

In AI We Trust: Ethics, Artificial Intelligence, and Reliability.

Sci Eng Ethics. 2020 Oct;26(5):2749-2767. doi: 10.1007/s11948-020-00228-y. Epub 2020 Jun 10.

How to Design AI for Social Good: Seven Essential Factors.

Sci Eng Ethics. 2020 Jun;26(3):1771-1796. doi: 10.1007/s11948-020-00213-5. Epub 2020 Apr 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

为人工智能系统的可靠性可信度辩护：一种可靠主义的方法。

Justifying Our Credences in the Trustworthiness of AI Systems: A Reliabilistic Approach.

机构信息

出版信息

我们解决了人工智能哲学中的一个开放性问题

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献