Zheng Stephan, Trott Alexander, Srinivasa Sunil, Parkes David C, Socher Richard
Salesforce Research, Palo Alto, CA, USA.
Harvard University, Cambridge, MA, USA.
Sci Adv. 2022 May 6;8(18):eabk2607. doi: 10.1126/sciadv.abk2607. Epub 2022 May 4.
Artificial intelligence (AI) and reinforcement learning (RL) have improved many areas but are not yet widely adopted in economic policy design, mechanism design, or economics at large. The AI Economist is a two-level, deep RL framework for policy design in which agents and a social planner coadapt. In particular, the AI Economist uses structured curriculum learning to stabilize the challenging two-level, coadaptive learning problem. We validate this framework in the domain of taxation. In one-step economies, the AI Economist recovers the optimal tax policy of economic theory. In spatiotemporal economies, the AI Economist substantially improves both utilitarian social welfare and the trade-off between equality and productivity over baselines. It does so despite emergent tax-gaming strategies while accounting for emergent labor specialization, agent interactions, and behavioral change. These results demonstrate that two-level, deep RL complements economic theory and unlocks an AI-based approach to designing and understanding economic policy.
人工智能(AI)和强化学习(RL)已在许多领域取得进展,但在经济政策设计、机制设计或整个经济学领域尚未得到广泛应用。“人工智能经济学家”是一个用于政策设计的两级深度强化学习框架,其中智能体和社会规划者共同适应。具体而言,“人工智能经济学家”使用结构化课程学习来稳定具有挑战性的两级共同适应学习问题。我们在税收领域验证了这一框架。在单步经济中,“人工智能经济学家”恢复了经济理论的最优税收政策。在时空经济中,“人工智能经济学家”在基线之上大幅提高了功利主义社会福利以及平等与生产率之间的权衡。尽管出现了税收博弈策略,但它在考虑到新兴劳动专业化、智能体交互和行为变化的情况下仍能做到这一点。这些结果表明,两级深度强化学习补充了经济理论,并开启了一种基于人工智能的经济政策设计和理解方法。