Institut national de la santé et de la recherche médicale (INSERM) & École normale supérieure (ENS), Paris, France.
Commun Biol. 2023 Feb 8;6(1):158. doi: 10.1038/s42003-023-04544-4.
Behavioral results suggest that learning by trial-and-error (i.e., reinforcement learning) relies on a teaching signal, the prediction error, which quantifies the difference between the obtained and the expected reward. Evidence suggests that distinct cortico-striatal circuits are recruited to encode better-than-expected (positive prediction error) and worst-than-expected (negative prediction error) outcomes. A recent study by Villano et al. provides evidence for differential networks that underlie learning from positive and negative prediction errors in humans using real-life behavioral data. More specifically, they found that university students are more likely to update beliefs concerning grade expectations following positive rather than negative prediction errors.
行为学研究结果表明,通过试错进行的学习(即强化学习)依赖于一种教学信号,即预测误差,它量化了获得的奖励与预期奖励之间的差异。有证据表明,不同的皮质纹状体回路被招募来对好于预期(正预测误差)和差于预期(负预测误差)的结果进行编码。最近,Villano 等人的一项研究利用真实的行为数据,为人类从正、负预测误差中学习的基础上的差异网络提供了证据。更具体地说,他们发现大学生在收到正、负预测误差后,更有可能根据正预测误差更新对成绩期望的信念。