通过 IKE-XAI 解释人工智能中的顿悟时刻：可解释人工智能的隐式知识提取。

Explaining Aha! moments in artificial agents through IKE-XAI: Implicit Knowledge Extraction for eXplainable AI.

机构信息

IMT Atlantique, Lab-STICC, UMR CNRS 6285, F-29238 Brest, France.

U2IS Dept., ENSTA, Institut Polytechnique Paris, Inria Flowers Team, 828, Boulevard des Maréchaux 91762 Palaiseau Cedex, France; Segula Technologies, Parc d'activité de Pissaloup, Trappes, France; Institut des Systèmes Intelligents et de Robotique, Sorbonne Université, Paris, France.

出版信息

Neural Netw. 2022 Nov;155:95-118. doi: 10.1016/j.neunet.2022.08.002. Epub 2022 Aug 6.

DOI:10.1016/j.neunet.2022.08.002

PMID:36049396

Abstract

During the learning process, a child develops a mental representation of the task he or she is learning. A Machine Learning algorithm develops also a latent representation of the task it learns. We investigate the development of the knowledge construction of an artificial agent through the analysis of its behavior, i.e., its sequences of moves while learning to perform the Tower of Hanoï (TOH) task. The TOH is a well-known task in experimental contexts to study the problem-solving processes and one of the fundamental processes of children's knowledge construction about their world. We position ourselves in the field of explainable reinforcement learning for developmental robotics, at the crossroads of cognitive modeling and explainable AI. Our main contribution proposes a 3-step methodology named Implicit Knowledge Extraction with eXplainable Artificial Intelligence (IKE-XAI) to extract the implicit knowledge, in form of an automaton, encoded by an artificial agent during its learning. We showcase this technique to solve and explain the TOH task when researchers have only access to moves that represent observational behavior as in human-machine interaction. Therefore, to extract the agent acquired knowledge at different stages of its training, our approach combines: first, a Q-learning agent that learns to perform the TOH task; second, a trained recurrent neural network that encodes an implicit representation of the TOH task; and third, an XAI process using a post-hoc implicit rule extraction algorithm to extract finite state automata. We propose using graph representations as visual and explicit explanations of the behavior of the Q-learning agent. Our experiments show that the IKE-XAI approach helps understanding the development of the Q-learning agent behavior by providing a global explanation of its knowledge evolution during learning. IKE-XAI also allows researchers to identify the agent's Aha! moment by determining from what moment the knowledge representation stabilizes and the agent no longer learns.

摘要

在学习过程中，儿童会对他们正在学习的任务形成心理表征。机器学习算法也会对它所学习的任务形成潜在的表示。我们通过分析其行为来研究人工代理的知识构建的发展，即它在学习执行汉诺塔（TOH）任务时的移动序列。TOH 是实验情境中用于研究问题解决过程的一个著名任务，也是儿童对其世界的知识构建的基本过程之一。我们在可解释强化学习的领域中定位自己，处于认知建模和可解释 AI 的交叉点。我们的主要贡献提出了一种名为 IKE-XAI 的三步方法论，用于提取隐含知识，以自动机的形式表示人工代理在学习过程中编码的隐含知识。我们展示了这种技术来解决和解释 TOH 任务，当研究人员只能访问代表人机交互中的观察行为的移动时。因此，为了提取代理在其训练的不同阶段获得的知识，我们的方法结合了：首先，一个 Q-learning 代理，用于学习执行 TOH 任务；其次，一个经过训练的递归神经网络，用于编码 TOH 任务的隐含表示；最后，使用事后隐含规则提取算法的 XAI 过程，以提取有限状态自动机。我们提出使用图表示作为 Q-learning 代理行为的可视化和显式解释。我们的实验表明，IKE-XAI 方法通过提供其学习过程中知识演变的全局解释，有助于理解 Q-learning 代理行为的发展。IKE-XAI 还允许研究人员通过确定知识表示何时稳定且代理不再学习来确定代理的顿悟时刻。

相似文献

Explaining Aha! moments in artificial agents through IKE-XAI: Implicit Knowledge Extraction for eXplainable AI.通过 IKE-XAI 解释人工智能中的顿悟时刻：可解释人工智能的隐式知识提取。

Neural Netw. 2022 Nov;155:95-118. doi: 10.1016/j.neunet.2022.08.002. Epub 2022 Aug 6.

A Survey on Medical Explainable AI (XAI): Recent Progress, Explainability Approach, Human Interaction and Scoring System.医学可解释人工智能（XAI）调查：最新进展、可解释性方法、人机交互和评分系统。

Sensors (Basel). 2022 Oct 21;22(20):8068. doi: 10.3390/s22208068.

Explainability and white box in drug discovery.药物发现中的可解释性和白盒。

Chem Biol Drug Des. 2023 Jul;102(1):217-233. doi: 10.1111/cbdd.14262. Epub 2023 Apr 27.

Applications of Explainable Artificial Intelligence in Diagnosis and Surgery.可解释人工智能在诊断与手术中的应用。

Diagnostics (Basel). 2022 Jan 19;12(2):237. doi: 10.3390/diagnostics12020237.

Explainable AI in radiology: a white paper of the Italian Society of Medical and Interventional Radiology.放射医学可解释人工智能：意大利医学和介入放射学会白皮书。

Radiol Med. 2023 Jun;128(6):755-764. doi: 10.1007/s11547-023-01634-5. Epub 2023 May 8.

Explainability and Transparency of Classifiers for Air-Handling Unit Faults Using Explainable Artificial Intelligence (XAI).使用可解释人工智能（XAI）解释空气处理单元故障分类器的可解释性和透明度。

Sensors (Basel). 2022 Aug 23;22(17):6338. doi: 10.3390/s22176338.

Overview of Explainable Artificial Intelligence for Prognostic and Health Management of Industrial Assets Based on Preferred Reporting Items for Systematic Reviews and Meta-Analyses.基于系统评价和荟萃分析首选报告项目的工业资产预后和健康管理的可解释人工智能概述。

Sensors (Basel). 2021 Dec 1;21(23):8020. doi: 10.3390/s21238020.

Toward explainable AI-empowered cognitive health assessment.迈向可解释人工智能赋能的认知健康评估。

Front Public Health. 2023 Mar 9;11:1024195. doi: 10.3389/fpubh.2023.1024195. eCollection 2023.

Clinical domain knowledge-derived template improves post hoc AI explanations in pneumothorax classification.临床领域知识衍生模板可提高气胸分类事后人工智能解释的质量。

J Biomed Inform. 2024 Aug;156:104673. doi: 10.1016/j.jbi.2024.104673. Epub 2024 Jun 9.

Essential properties and explanation effectiveness of explainable artificial intelligence in healthcare: A systematic review.可解释人工智能在医疗保健中的基本属性和解释效果：一项系统综述。

Heliyon. 2023 May 8;9(5):e16110. doi: 10.1016/j.heliyon.2023.e16110. eCollection 2023 May.

引用本文的文献

XAI-XGBoost: an innovative explainable intrusion detection approach for securing internet of medical things systems.XAI-XGBoost：一种用于保障医疗物联网系统安全的创新型可解释入侵检测方法。

Sci Rep. 2025 Jul 1;15(1):22278. doi: 10.1038/s41598-025-07790-0.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过 IKE-XAI 解释人工智能中的顿悟时刻：可解释人工智能的隐式知识提取。

Explaining Aha! moments in artificial agents through IKE-XAI: Implicit Knowledge Extraction for eXplainable AI.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献