School of Psychological Sciences, Tel Aviv University, Tel Aviv, Israel.
Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel.
Sci Adv. 2023 Oct 20;9(42):eadi2704. doi: 10.1126/sciadv.adi2704.
Current studies suggest that individuals estimate the value of their choices based on observed feedback. Here, we ask whether individuals also update the value of their unchosen actions, even when the associated feedback remains unknown. One hundred seventy-eight individuals completed a multi-armed bandit task, making choices to gain rewards. We found robust evidence suggesting latent value updating of unchosen actions based on the chosen action's outcome. Computational modeling results suggested that this effect is mainly explained by a value updating mechanism whereby individuals integrate the outcome history for choosing an option with that of rejecting the alternative. Properties of the deliberation (i.e., duration/difficulty) did not moderate the latent value updating of unchosen actions, suggesting that memory traces generated during deliberation might take a smaller role in this specific phenomenon than previously thought. We discuss the mechanisms facilitating credit assignment to unchosen actions and their implications for human decision-making.
目前的研究表明,个体基于观察到的反馈来估计其选择的价值。在这里,我们想知道个体是否也会更新未选中的动作的价值,即使相关反馈仍然未知。178 名个体完成了一项多臂赌博任务,通过选择来获得奖励。我们发现了强有力的证据表明,基于所选动作的结果,对未选中的动作进行潜在的价值更新。计算建模结果表明,这种效应主要是由一种价值更新机制解释的,即个体将选择一个选项的结果历史与拒绝另一个选项的结果历史进行整合。审议的特性(即持续时间/难度)并没有调节未选中的动作的潜在价值更新,这表明在这个特定现象中,审议期间生成的记忆痕迹可能比之前认为的作用更小。我们讨论了促进对未选中的动作进行信用分配的机制及其对人类决策的影响。