Wang Linghao, Jiang Zheyuan, Hu Chenke, Zhao Jun, Zhu Zheng, Chen Xiqun, Wang Ziyi, Liu Tianming, He Guibing, Yin Yafeng, Lee Der-Horng
Institute of Intelligent Transportation Systems, College of Civil Engineering and Architecture, Zhejiang University, Hangzhou, China.
Institute of Intelligent Transportation Systems & Polytechnic Institute, College of Civil Engineering and Architecture, Zhejiang University, Hangzhou, China.
iScience. 2025 May 21;28(6):112711. doi: 10.1016/j.isci.2025.112711. eCollection 2025 Jun 20.
Artificial intelligence (AI) is trying to catch up with human beings in many aspects. In this track, the potential for replacing human decision-making with AI models, such as large language models (LLMs), has become a topic of considerable debate. To test the performance of AI in daily decision-making, we compared humans, LLMs, and reinforcement learning (RL) in a multi-day commute decision-making game. It denotes a collaborative decision-making process where individual and collective outcomes are interdependent. We examined various performance metrics, including overall system results, system convergence progress, individual decision dynamics, and individual decision mechanisms. We find that LLMs exhibit human-like abilities to learn from historical experience and achieve convergence when making daily commute decisions. However, in the context of multi-person collaboration, LLMs still face challenges, such as weak perception of others' choices, poor group decision-making mechanisms, and a lack of physical knowledge.
人工智能(AI)正试图在许多方面追赶人类。在这条道路上,用诸如大语言模型(LLMs)等AI模型取代人类决策的可能性已成为一个备受争议的话题。为了测试AI在日常决策中的表现,我们在一个多日通勤决策游戏中对人类、大语言模型和强化学习(RL)进行了比较。它表示一个协作决策过程,其中个人和集体结果相互依存。我们研究了各种性能指标,包括整体系统结果、系统收敛进度、个体决策动态和个体决策机制。我们发现,大语言模型在做出日常通勤决策时表现出类似人类的从历史经验中学习并实现收敛的能力。然而,在多人协作的背景下,大语言模型仍然面临挑战,比如对他人选择的感知较弱、群体决策机制不佳以及缺乏物理知识。