超级人工智能在单挑无限注德州扑克中击败顶级职业选手：Libratus 胜出。

Superhuman AI for heads-up no-limit poker: Libratus beats top professionals.

机构信息

Computer Science Department, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA.

出版信息

Science. 2018 Jan 26;359(6374):418-424. doi: 10.1126/science.aao1733. Epub 2017 Dec 17.

Abstract

No-limit Texas hold'em is the most popular form of poker. Despite artificial intelligence (AI) successes in perfect-information games, the private information and massive game tree have made no-limit poker difficult to tackle. We present Libratus, an AI that, in a 120,000-hand competition, defeated four top human specialist professionals in heads-up no-limit Texas hold'em, the leading benchmark and long-standing challenge problem in imperfect-information game solving. Our game-theoretic approach features application-independent techniques: an algorithm for computing a blueprint for the overall strategy, an algorithm that fleshes out the details of the strategy for subgames that are reached during play, and a self-improver algorithm that fixes potential weaknesses that opponents have identified in the blueprint strategy.

摘要

无限注德州扑克是最受欢迎的扑克游戏形式。尽管人工智能（AI）在完美信息游戏中取得了成功，但由于私人信息和庞大的游戏树，无限注德州扑克仍然难以解决。我们介绍 Libratus，这是一种人工智能，在 120,000 手牌的比赛中，它击败了四名顶尖的人类专业高手，在单挑无限注德州扑克中，这是一个领先的基准和长期存在的信息不完美游戏解决挑战问题。我们的博弈论方法具有独立于应用的技术：一种用于计算总体策略蓝图的算法，一种用于充实在游戏过程中达到的子游戏策略细节的算法，以及一种自我改进算法，可以修复对手在蓝图策略中发现的潜在弱点。

相似文献

Superhuman AI for heads-up no-limit poker: Libratus beats top professionals.超级人工智能在单挑无限注德州扑克中击败顶级职业选手：Libratus 胜出。

Science. 2018 Jan 26;359(6374):418-424. doi: 10.1126/science.aao1733. Epub 2017 Dec 17.

Superhuman AI for multiplayer poker.用于多人扑克的超级人工智能。

Science. 2019 Aug 30;365(6456):885-890. doi: 10.1126/science.aay2400. Epub 2019 Jul 11.

DeepStack: Expert-level artificial intelligence in heads-up no-limit poker.深筹码：单人无限注德州扑克中的专家级人工智能。

Science. 2017 May 5;356(6337):508-513. doi: 10.1126/science.aam6960. Epub 2017 Mar 2.

Computer science. Heads-up limit hold'em poker is solved.计算机科学。顶对限制加注德州扑克已被破解。

Science. 2015 Jan 9;347(6218):145-9. doi: 10.1126/science.1259433.

Student of Games: A unified learning algorithm for both perfect and imperfect information games.博弈学习者：一种适用于完全信息博弈和不完全信息博弈的统一学习算法。

Sci Adv. 2023 Nov 17;9(46):eadg3256. doi: 10.1126/sciadv.adg3256. Epub 2023 Nov 15.

Commentary: Heads-up limit hold'em poker is solved.评论：单挑无限注德州扑克已被破解。

Front Psychol. 2018 Feb 21;9:210. doi: 10.3389/fpsyg.2018.00210. eCollection 2018.

OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research.OpenHoldem：大规模不完全信息博弈研究的一个基准

IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14618-14632. doi: 10.1109/TNNLS.2023.3280186. Epub 2024 Oct 7.

No limit: AI poker bot is first to beat professionals at multiplayer game.无极限：人工智能扑克机器人首次在多人游戏中击败专业选手。

Nature. 2019 Jul;571(7765):307-308. doi: 10.1038/d41586-019-02156-9.

Synergistic Information Processing Encrypts Strategic Reasoning in Poker.协同信息处理对扑克中的策略推理进行加密。

Cogn Sci. 2018 Jun 14. doi: 10.1111/cogs.12632.

Is poker a game of skill or chance? A quasi-experimental study.扑克是技巧游戏还是运气游戏？一项准实验研究。

J Gambl Stud. 2013 Sep;29(3):535-50. doi: 10.1007/s10899-012-9327-8.

引用本文的文献

Gambling as Work: A Study of German Poker Players.赌博与工作：一项德国扑克玩家的研究。

J Gambl Stud. 2024 Sep;40(3):1653-1678. doi: 10.1007/s10899-023-10277-0. Epub 2023 Dec 22.

Student of Games: A unified learning algorithm for both perfect and imperfect information games.博弈学习者：一种适用于完全信息博弈和不完全信息博弈的统一学习算法。

Sci Adv. 2023 Nov 17;9(46):eadg3256. doi: 10.1126/sciadv.adg3256. Epub 2023 Nov 15.

Breaking the traditional: a survey of algorithmic mechanism design applied to economic and complex environments.打破传统：对应用于经济和复杂环境的算法机制设计的一项调查。

Neural Comput Appl. 2023 May 20:1-30. doi: 10.1007/s00521-023-08647-1.

Competitive and cooperative games for probing the neural basis of social decision-making in animals.用于探测动物社会决策神经基础的竞争与合作游戏。

Neurosci Biobehav Rev. 2023 Jun;149:105158. doi: 10.1016/j.neubiorev.2023.105158. Epub 2023 Apr 4.

Intelligent Control of Groundwater in Slopes with Deep Reinforcement Learning.基于深度强化学习的边坡地下水智能控制。

Sensors (Basel). 2022 Nov 4;22(21):8503. doi: 10.3390/s22218503.

Application of Reinforcement Learning in Multiagent Intelligent Decision-Making.强化学习在多智能体智能决策中的应用。

Comput Intell Neurosci. 2022 Sep 16;2022:8683616. doi: 10.1155/2022/8683616. eCollection 2022.

: Stacked Generalization with for Highly Accurate Predictions of Polymer Bandgap.用于聚合物带隙高精度预测的堆叠泛化方法。（你提供的原文中“with”后面似乎缺少具体内容，我按照常见理解进行了补充翻译，若有偏差请指出。）

ACS Omega. 2022 Aug 15;7(34):29787-29793. doi: 10.1021/acsomega.2c02554. eCollection 2022 Aug 30.

Diving into the Deep End: Machine Learning for the Chemist.深入探讨：面向化学家的机器学习

ACS Omega. 2022 Jul 20;7(30):25906-25908. doi: 10.1021/acsomega.2c04373. eCollection 2022 Aug 2.

Adoption of AI-Enabled Tools in Social Development Organizations in India: An Extension of UTAUT Model.人工智能支持的工具在印度社会发展组织中的应用：技术接受与使用整合理论模型的扩展

Front Psychol. 2022 Jun 20;13:893691. doi: 10.3389/fpsyg.2022.893691. eCollection 2022.

Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.通过演员-评论家强化学习实现多人扑克的最优策略

Entropy (Basel). 2022 May 30;24(6):774. doi: 10.3390/e24060774.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

超级人工智能在单挑无限注德州扑克中击败顶级职业选手：Libratus 胜出。

Superhuman AI for heads-up no-limit poker: Libratus beats top professionals.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献