• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过在AlphaZero中进行概念发现与转移来弥合人类与人工智能之间的知识差距。

Bridging the human-AI knowledge gap through concept discovery and transfer in AlphaZero.

作者信息

Schut Lisa, Tomašev Nenad, McGrath Thomas, Hassabis Demis, Paquet Ulrich, Kim Been

机构信息

Oxford Applied and Theoretical Machine Learning Group, Department of Computer Science, University of Oxford, Oxford OX1 3QG, United Kingdom.

Google DeepMind, London N1C 4AG, United Kingdom.

出版信息

Proc Natl Acad Sci U S A. 2025 Apr;122(13):e2406675122. doi: 10.1073/pnas.2406675122. Epub 2025 Mar 26.

DOI:10.1073/pnas.2406675122
PMID:40138346
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12002201/
Abstract

AI systems have attained superhuman performance across various domains. If the hidden knowledge encoded in these highly capable systems can be leveraged, human knowledge and performance can be advanced. Yet, this internal knowledge is difficult to extract. Due to the vast space of possible internal representations, searching for meaningful new conceptual knowledge can be like finding a needle in a haystack. Here, we introduce a method that extracts new chess concepts from AlphaZero, an AI system that mastered chess via self-play without human supervision. Our method excavates vectors that represent concepts from AlphaZero's internal representations using convex optimization, and filters the concepts based on teachability (whether the concept is transferable to another AI agent) and novelty (whether the concept contains information not present in human chess games). These steps ensure that the discovered concepts are useful and meaningful. For the resulting set of concepts, prototypes (chess puzzle-solution pairs) are presented to experts for final validation. In a preliminary human study, four top chess grandmasters (all former or current world chess champions) were evaluated on their ability to solve concept prototype positions. All grandmasters showed improvement after the learning phase, suggesting that the concepts are at the frontier of human understanding. Despite the small scale, our result is a proof of concept demonstrating the possibility of leveraging knowledge from a highly capable AI system to advance the frontier of human knowledge; a development that could bear profound implications and shape how we interact with AI systems across many applications.

摘要

人工智能系统在各个领域都取得了超越人类的表现。如果能够利用这些高性能系统中编码的隐藏知识,人类的知识和表现就能得到提升。然而,这种内部知识很难提取。由于可能的内部表示空间巨大,寻找有意义的新概念知识就如同大海捞针。在此,我们介绍一种从AlphaZero中提取新国际象棋概念的方法,AlphaZero是一个通过自我对弈在无人类监督的情况下掌握国际象棋的人工智能系统。我们的方法使用凸优化从AlphaZero的内部表示中挖掘代表概念的向量,并基于可教性(该概念是否可转移到另一个智能体)和新颖性(该概念是否包含人类国际象棋游戏中不存在的信息)对概念进行筛选。这些步骤确保所发现的概念是有用且有意义的。对于所得的概念集,向专家展示原型(国际象棋谜题 - 解决方案对)以进行最终验证。在一项初步的人类研究中,对四位顶尖国际象棋特级大师(均为前世界冠军或现任世界冠军)解决概念原型棋局的能力进行了评估。所有特级大师在学习阶段后都有进步,这表明这些概念处于人类理解的前沿。尽管规模较小,但我们的结果是一个概念验证,证明了利用高性能人工智能系统的知识来推进人类知识前沿的可能性;这一发展可能会产生深远影响,并塑造我们在许多应用中与人工智能系统交互的方式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/eeaf1d0dfb00/pnas.2406675122fig07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/476c62a0c0b2/pnas.2406675122fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/e5c6dedb3ec5/pnas.2406675122fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/57c25624dfaa/pnas.2406675122fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/6aac868d30bf/pnas.2406675122fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/498884c86eb9/pnas.2406675122fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/5c1126765419/pnas.2406675122fig06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/eeaf1d0dfb00/pnas.2406675122fig07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/476c62a0c0b2/pnas.2406675122fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/e5c6dedb3ec5/pnas.2406675122fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/57c25624dfaa/pnas.2406675122fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/6aac868d30bf/pnas.2406675122fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/498884c86eb9/pnas.2406675122fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/5c1126765419/pnas.2406675122fig06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995a/12002201/eeaf1d0dfb00/pnas.2406675122fig07.jpg

相似文献

1
Bridging the human-AI knowledge gap through concept discovery and transfer in AlphaZero.通过在AlphaZero中进行概念发现与转移来弥合人类与人工智能之间的知识差距。
Proc Natl Acad Sci U S A. 2025 Apr;122(13):e2406675122. doi: 10.1073/pnas.2406675122. Epub 2025 Mar 26.
2
Acquisition of chess knowledge in AlphaZero.阿尔法零(AlphaZero)获取国际象棋知识。
Proc Natl Acad Sci U S A. 2022 Nov 22;119(47):e2206625119. doi: 10.1073/pnas.2206625119. Epub 2022 Nov 14.
3
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.一种通过自我对弈掌握国际象棋、将棋和围棋的通用强化学习算法。
Science. 2018 Dec 7;362(6419):1140-1144. doi: 10.1126/science.aar6404.
4
Chess AI: Competing Paradigms for Machine Intelligence.国际象棋人工智能:机器智能的竞争范式
Entropy (Basel). 2022 Apr 14;24(4):550. doi: 10.3390/e24040550.
5
AlphaDDA: strategies for adjusting the playing strength of a fully trained AlphaZero system to a suitable human training partner.AlphaDDA:将完全训练好的AlphaZero系统的游戏强度调整到适合人类训练伙伴的策略。
PeerJ Comput Sci. 2022 Oct 4;8:e1123. doi: 10.7717/peerj-cs.1123. eCollection 2022.
6
Information based explanation methods for deep learning agents-with applications on large open-source chess models.用于深度学习智能体的基于信息的解释方法——在大型开源国际象棋模型上的应用
Sci Rep. 2024 Aug 30;14(1):20174. doi: 10.1038/s41598-024-70701-2.
7
Artificial intelligence for breast cancer detection and its health technology assessment: A scoping review.用于乳腺癌检测的人工智能及其健康技术评估:一项范围综述。
Comput Biol Med. 2025 Jan;184:109391. doi: 10.1016/j.compbiomed.2024.109391. Epub 2024 Nov 22.
8
Mastering the game of Stratego with model-free multiagent reinforcement learning.运用无模型多智能体强化学习掌握 Stratego 游戏。
Science. 2022 Dec 2;378(6623):990-996. doi: 10.1126/science.add4679. Epub 2022 Dec 1.
9
[The Triumph of "Stupidity" : Deep Blue`s Victory over Garri Kasparov. The Controversy about its Impact on Artficial Intelligence Research].["“愚蠢”的胜利:深蓝击败加里·卡斯帕罗夫。关于其对人工智能研究影响的争议"]
NTM. 2017 Mar;25(1):1-33. doi: 10.1007/s00048-017-0167-6.
10
Leveraging code-free deep learning for pill recognition in clinical settings: A multicenter, real-world study of performance across multiple platforms.利用无代码深度学习在临床环境中进行药丸识别:在多个平台上进行的多中心真实世界性能研究。
Artif Intell Med. 2024 Apr;150:102844. doi: 10.1016/j.artmed.2024.102844. Epub 2024 Mar 13.

本文引用的文献

1
Impossibility theorems for feature attribution.特征归因的不可能定理。
Proc Natl Acad Sci U S A. 2024 Jan 9;121(2):e2304406120. doi: 10.1073/pnas.2304406120. Epub 2024 Jan 5.
2
Acquisition of chess knowledge in AlphaZero.阿尔法零(AlphaZero)获取国际象棋知识。
Proc Natl Acad Sci U S A. 2022 Nov 22;119(47):e2206625119. doi: 10.1073/pnas.2206625119. Epub 2022 Nov 14.
3
MoËT: Mixture of Expert Trees and its application to verifiable reinforcement learning.Moët:混合专家树及其在可验证强化学习中的应用。
Neural Netw. 2022 Jul;151:34-47. doi: 10.1016/j.neunet.2022.03.022. Epub 2022 Mar 23.
4
Mastering Atari, Go, chess and shogi by planning with a learned model.通过使用学习模型进行规划,掌握 Atari、围棋、国际象棋和将棋。
Nature. 2020 Dec;588(7839):604-609. doi: 10.1038/s41586-020-03051-4. Epub 2020 Dec 23.
5
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.一种通过自我对弈掌握国际象棋、将棋和围棋的通用强化学习算法。
Science. 2018 Dec 7;362(6419):1140-1144. doi: 10.1126/science.aar6404.
6
Convex and semi-nonnegative matrix factorizations.凸和半非负矩阵分解。
IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):45-55. doi: 10.1109/TPAMI.2008.277.