Department of Physiology, Anatomy & Genetics, University of Oxford, Oxford, UK.
Institute of Behavioral Neuroscience, University College London, London, UK.
Nat Commun. 2024 Aug 17;15(1):7093. doi: 10.1038/s41467-024-51393-8.
Perceptual decisions should depend on sensory evidence. However, such decisions are also influenced by past choices and outcomes. These choice history biases may reflect advantageous strategies to exploit temporal regularities of natural environments. However, it is unclear whether and how observers can adapt their choice history biases to different temporal regularities, to exploit the multitude of temporal correlations that exist in nature. Here, we show that male mice adapt their perceptual choice history biases to different temporal regularities of visual stimuli. This adaptation was slow, evolving over hundreds of trials across several days. It occurred alongside a fast non-adaptive choice history bias, limited to a few trials. Both fast and slow trial history effects are well captured by a normative reinforcement learning algorithm with multi-trial belief states, comprising both current trial sensory and previous trial memory states. We demonstrate that dorsal striatal dopamine tracks predictions of the model and behavior, suggesting that striatal dopamine reports reward predictions associated with adaptive choice history biases. Our results reveal the adaptive nature of perceptual choice history biases and shed light on their underlying computational principles and neural correlates.
感知决策应该取决于感官证据。然而,这些决策也受到过去选择和结果的影响。这些选择历史偏差可能反映了利用自然环境时间规律的有利策略。然而,目前尚不清楚观察者是否以及如何能够适应不同的时间规律,以利用自然界中存在的大量时间相关性。在这里,我们表明雄性小鼠会适应视觉刺激的不同时间规律的感知选择历史偏差。这种适应过程缓慢,需要数百次试验才能在数天内完成。它伴随着一个快速的非自适应选择历史偏差,仅限于几次试验。快速和慢速试验历史效应都被一个具有多试验信念状态的规范性强化学习算法很好地捕捉到,该算法包含当前试验的感觉和前一个试验的记忆状态。我们证明背侧纹状体多巴胺跟踪模型和行为的预测,表明纹状体多巴胺报告与自适应选择历史偏差相关的奖励预测。我们的研究结果揭示了感知选择历史偏差的适应性,并为其潜在的计算原理和神经关联提供了线索。