Computer Science Department, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA.
Science. 2018 Jan 26;359(6374):418-424. doi: 10.1126/science.aao1733. Epub 2017 Dec 17.
No-limit Texas hold'em is the most popular form of poker. Despite artificial intelligence (AI) successes in perfect-information games, the private information and massive game tree have made no-limit poker difficult to tackle. We present Libratus, an AI that, in a 120,000-hand competition, defeated four top human specialist professionals in heads-up no-limit Texas hold'em, the leading benchmark and long-standing challenge problem in imperfect-information game solving. Our game-theoretic approach features application-independent techniques: an algorithm for computing a blueprint for the overall strategy, an algorithm that fleshes out the details of the strategy for subgames that are reached during play, and a self-improver algorithm that fixes potential weaknesses that opponents have identified in the blueprint strategy.
无限注德州扑克是最受欢迎的扑克游戏形式。尽管人工智能(AI)在完美信息游戏中取得了成功,但由于私人信息和庞大的游戏树,无限注德州扑克仍然难以解决。我们介绍 Libratus,这是一种人工智能,在 120,000 手牌的比赛中,它击败了四名顶尖的人类专业高手,在单挑无限注德州扑克中,这是一个领先的基准和长期存在的信息不完美游戏解决挑战问题。我们的博弈论方法具有独立于应用的技术:一种用于计算总体策略蓝图的算法,一种用于充实在游戏过程中达到的子游戏策略细节的算法,以及一种自我改进算法,可以修复对手在蓝图策略中发现的潜在弱点。