Suppr超能文献

《人工智能经济学家:通过两级深度多智能体强化学习进行税收政策设计》

The AI Economist: Taxation policy design via two-level deep multiagent reinforcement learning.

作者信息

Zheng Stephan, Trott Alexander, Srinivasa Sunil, Parkes David C, Socher Richard

机构信息

Salesforce Research, Palo Alto, CA, USA.

Harvard University, Cambridge, MA, USA.

出版信息

Sci Adv. 2022 May 6;8(18):eabk2607. doi: 10.1126/sciadv.abk2607. Epub 2022 May 4.

Abstract

Artificial intelligence (AI) and reinforcement learning (RL) have improved many areas but are not yet widely adopted in economic policy design, mechanism design, or economics at large. The AI Economist is a two-level, deep RL framework for policy design in which agents and a social planner coadapt. In particular, the AI Economist uses structured curriculum learning to stabilize the challenging two-level, coadaptive learning problem. We validate this framework in the domain of taxation. In one-step economies, the AI Economist recovers the optimal tax policy of economic theory. In spatiotemporal economies, the AI Economist substantially improves both utilitarian social welfare and the trade-off between equality and productivity over baselines. It does so despite emergent tax-gaming strategies while accounting for emergent labor specialization, agent interactions, and behavioral change. These results demonstrate that two-level, deep RL complements economic theory and unlocks an AI-based approach to designing and understanding economic policy.

摘要

人工智能(AI)和强化学习(RL)已在许多领域取得进展,但在经济政策设计、机制设计或整个经济学领域尚未得到广泛应用。“人工智能经济学家”是一个用于政策设计的两级深度强化学习框架,其中智能体和社会规划者共同适应。具体而言,“人工智能经济学家”使用结构化课程学习来稳定具有挑战性的两级共同适应学习问题。我们在税收领域验证了这一框架。在单步经济中,“人工智能经济学家”恢复了经济理论的最优税收政策。在时空经济中,“人工智能经济学家”在基线之上大幅提高了功利主义社会福利以及平等与生产率之间的权衡。尽管出现了税收博弈策略,但它在考虑到新兴劳动专业化、智能体交互和行为变化的情况下仍能做到这一点。这些结果表明,两级深度强化学习补充了经济理论,并开启了一种基于人工智能的经济政策设计和理解方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4a7/9067926/3d827238d8ef/sciadv.abk2607-f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验