《人工智能经济学家：通过两级深度多智能体强化学习进行税收政策设计》

The AI Economist: Taxation policy design via two-level deep multiagent reinforcement learning.

作者信息

Zheng Stephan, Trott Alexander, Srinivasa Sunil, Parkes David C, Socher Richard

机构信息

Salesforce Research, Palo Alto, CA, USA.

Harvard University, Cambridge, MA, USA.

出版信息

Sci Adv. 2022 May 6;8(18):eabk2607. doi: 10.1126/sciadv.abk2607. Epub 2022 May 4.

DOI:10.1126/sciadv.abk2607

PMID:35507657

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9067926/

Abstract

Artificial intelligence (AI) and reinforcement learning (RL) have improved many areas but are not yet widely adopted in economic policy design, mechanism design, or economics at large. The AI Economist is a two-level, deep RL framework for policy design in which agents and a social planner coadapt. In particular, the AI Economist uses structured curriculum learning to stabilize the challenging two-level, coadaptive learning problem. We validate this framework in the domain of taxation. In one-step economies, the AI Economist recovers the optimal tax policy of economic theory. In spatiotemporal economies, the AI Economist substantially improves both utilitarian social welfare and the trade-off between equality and productivity over baselines. It does so despite emergent tax-gaming strategies while accounting for emergent labor specialization, agent interactions, and behavioral change. These results demonstrate that two-level, deep RL complements economic theory and unlocks an AI-based approach to designing and understanding economic policy.

摘要

人工智能（AI）和强化学习（RL）已在许多领域取得进展，但在经济政策设计、机制设计或整个经济学领域尚未得到广泛应用。“人工智能经济学家”是一个用于政策设计的两级深度强化学习框架，其中智能体和社会规划者共同适应。具体而言，“人工智能经济学家”使用结构化课程学习来稳定具有挑战性的两级共同适应学习问题。我们在税收领域验证了这一框架。在单步经济中，“人工智能经济学家”恢复了经济理论的最优税收政策。在时空经济中，“人工智能经济学家”在基线之上大幅提高了功利主义社会福利以及平等与生产率之间的权衡。尽管出现了税收博弈策略，但它在考虑到新兴劳动专业化、智能体交互和行为变化的情况下仍能做到这一点。这些结果表明，两级深度强化学习补充了经济理论，并开启了一种基于人工智能的经济政策设计和理解方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4a7/9067926/3d827238d8ef/sciadv.abk2607-f1.jpg

相似文献

The AI Economist: Taxation policy design via two-level deep multiagent reinforcement learning.《人工智能经济学家：通过两级深度多智能体强化学习进行税收政策设计》

Sci Adv. 2022 May 6;8(18):eabk2607. doi: 10.1126/sciadv.abk2607. Epub 2022 May 4.

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain.深度强化学习探索：从单智能体到多智能体领域

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):8762-8782. doi: 10.1109/TNNLS.2023.3236361. Epub 2024 Jul 8.

Human locomotion with reinforcement learning using bioinspired reward reshaping strategies.基于生物启发式奖励重塑策略的强化学习的人类运动。

Med Biol Eng Comput. 2021 Jan;59(1):243-256. doi: 10.1007/s11517-020-02309-3. Epub 2021 Jan 8.

Learning multi-agent cooperation.学习多智能体协作。

Front Neurorobot. 2022 Oct 14;16:932671. doi: 10.3389/fnbot.2022.932671. eCollection 2022.

Off-Policy Reinforcement Learning for Synchronization in Multiagent Graphical Games.多智能体图博弈中的非策略强化学习同步。

IEEE Trans Neural Netw Learn Syst. 2017 Oct;28(10):2434-2445. doi: 10.1109/TNNLS.2016.2609500. Epub 2017 Apr 17.

Application of Reinforcement Learning in Multiagent Intelligent Decision-Making.强化学习在多智能体智能决策中的应用。

Comput Intell Neurosci. 2022 Sep 16;2022:8683616. doi: 10.1155/2022/8683616. eCollection 2022.

Deep reinforcement learning to study spatial navigation, learning and memory in artificial and biological agents.深度强化学习用于研究人工和生物智能体中的空间导航、学习与记忆。

Biol Cybern. 2021 Apr;115(2):131-134. doi: 10.1007/s00422-021-00862-0. Epub 2021 Feb 9.

Symmetry reduction for deep reinforcement learning active control of chaotic spatiotemporal dynamics.用于混沌时空动力学深度强化学习主动控制的对称性约化

Phys Rev E. 2021 Jul;104(1-1):014210. doi: 10.1103/PhysRevE.104.014210.

A trade based view on casino taxation: market conditions.基于贸易视角的赌场税收：市场状况

J Gambl Stud. 2015 Jun;31(2):585-606. doi: 10.1007/s10899-013-9407-4.

Harnessing artificial intelligence (AI) to increase wellbeing for all: The case for a new technology diplomacy.利用人工智能提升全民福祉：新技术外交的必要性

Telecomm Policy. 2020 Jul;44(6):101988. doi: 10.1016/j.telpol.2020.101988. Epub 2020 May 6.

引用本文的文献

Tabula rasa agents display emergent in-group behavior.白板智能体表现出新兴的群体行为。

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319947121. doi: 10.1073/pnas.2319947121. Epub 2025 Jun 16.

Deep mechanism design: Learning social and economic policies for human benefit.深度机制设计：学习造福人类的社会和经济政策。

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319949121. doi: 10.1073/pnas.2319949121. Epub 2025 Jun 16.

Artificial intelligence orchestration for text-based ultrasonic simulation via self-review by multi-large language model agents.通过多大型语言模型代理进行自我审查，实现基于文本的超声模拟的人工智能编排。

Sci Rep. 2025 Apr 11;15(1):12474. doi: 10.1038/s41598-025-97498-y.

Quantifying the use and potential benefits of artificial intelligence in scientific research.量化人工智能在科学研究中的应用及潜在益处。

Nat Hum Behav. 2024 Dec;8(12):2281-2292. doi: 10.1038/s41562-024-02020-5. Epub 2024 Oct 11.

The impact of generative artificial intelligence on socioeconomic inequalities and policy making.生成式人工智能对社会经济不平等和政策制定的影响。

PNAS Nexus. 2024 Jun 11;3(6):pgae191. doi: 10.1093/pnasnexus/pgae191. eCollection 2024 Jun.

Investigation toward the economic feasibility of personalized medicine for healthcare service providers: the case of bladder cancer.医疗服务提供者个性化医疗的经济可行性调查：以膀胱癌为例。

Front Med (Lausanne). 2024 May 14;11:1388685. doi: 10.3389/fmed.2024.1388685. eCollection 2024.

Framework-based qualitative analysis of free responses of Large Language Models: Algorithmic fidelity.基于框架的大语言模型自由回答定性分析：算法保真度

PLoS One. 2024 Mar 12;19(3):e0300024. doi: 10.1371/journal.pone.0300024. eCollection 2024.

Breaking the traditional: a survey of algorithmic mechanism design applied to economic and complex environments.打破传统：对应用于经济和复杂环境的算法机制设计的一项调查。

Neural Comput Appl. 2023 May 20:1-30. doi: 10.1007/s00521-023-08647-1.

Bridging adaptive management and reinforcement learning for more robust decisions.通过自适应管理和强化学习实现更稳健的决策。

Philos Trans R Soc Lond B Biol Sci. 2023 Jul 17;378(1881):20220195. doi: 10.1098/rstb.2022.0195. Epub 2023 May 29.

Machine learning for a sustainable energy future.面向可持续能源未来的机器学习。

Nat Rev Mater. 2023;8(3):202-215. doi: 10.1038/s41578-022-00490-5. Epub 2022 Oct 18.

本文引用的文献

Grandmaster level in StarCraft II using multi-agent reinforcement learning.星际争霸 II 中的大师级水平使用多智能体强化学习。

Nature. 2019 Nov;575(7782):350-354. doi: 10.1038/s41586-019-1724-z. Epub 2019 Oct 30.

Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。

Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.

Income inequality and health: what have we learned so far?收入不平等与健康：我们目前了解到了什么？

Epidemiol Rev. 2004;26:78-91. doi: 10.1093/epirev/mxh003.

Agent-based modeling: methods and techniques for simulating human systems.基于主体的建模：模拟人类系统的方法与技术。

Proc Natl Acad Sci U S A. 2002 May 14;99 Suppl 3(Suppl 3):7280-7. doi: 10.1073/pnas.082080899.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

《人工智能经济学家：通过两级深度多智能体强化学习进行税收政策设计》

The AI Economist: Taxation policy design via two-level deep multiagent reinforcement learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献