Ministry of Education Key Laboratory for Biodiversity and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China.
Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, People's Republic of China.
Sci Rep. 2021 Dec 8;11(1):23593. doi: 10.1038/s41598-021-02979-5.
A principle of choice in animal decision-making named probability matching (PM) has long been detected in animals, and can arise from different decision-making strategies. Little is known about how environmental stochasticity may influence the switching time of these different decision-making strategies. Here we address this problem using a combination of behavioral and theoretical approaches, and show, that although a simple Win-Stay-Loss-Shift (WSLS) strategy can generate PM in binary-choice tasks theoretically, budgerigars (Melopsittacus undulates) actually apply a range of sub-tactics more often when they are expected to make more accurate decisions. Surprisingly, budgerigars did not get more rewards than would be predicted when adopting a WSLS strategy, and their decisions also exhibited PM. Instead, budgerigars followed a learning strategy based on reward history, which potentially benefits individuals indirectly from paying lower switching costs. Furthermore, our data suggest that more stochastic environments may promote reward learning through significantly less switching. We suggest that switching costs driven by the stochasticity of an environmental niche can potentially represent an important selection pressure associated with decision-making that may play a key role in driving the evolution of complex cognition in animals.
在动物决策中,有一种名为概率匹配(PM)的选择原则,长期以来一直被发现存在于动物中,并且可能源于不同的决策策略。对于环境随机性如何影响这些不同决策策略的切换时间,我们知之甚少。在这里,我们使用行为和理论相结合的方法来解决这个问题,结果表明,尽管简单的“赢留输换”(WSLS)策略在理论上可以在二元选择任务中产生 PM,但虎皮鹦鹉(Melopsittacus undulates)实际上在需要做出更准确决策时更经常地采用一系列策略。令人惊讶的是,当虎皮鹦鹉采用 WSLS 策略时,它们并没有获得比预期更多的奖励,而且它们的决策也表现出 PM。相反,虎皮鹦鹉遵循一种基于奖励历史的学习策略,这可能会间接地从降低切换成本中受益。此外,我们的数据表明,更随机的环境可能会通过显著减少切换来促进奖励学习。我们认为,由环境小生境的随机性驱动的切换成本可能代表与决策相关的一个重要选择压力,这可能在推动动物复杂认知的进化中发挥关键作用。