Department of Psychology, Biological Psychology, University of Cologne, Germany.
Helen Wills Neuroscience Institute, University of California at Berkeley, Berkeley, California, United States of America.
PLoS Comput Biol. 2020 Apr 20;16(4):e1007615. doi: 10.1371/journal.pcbi.1007615. eCollection 2020 Apr.
Sequential sampling models such as the drift diffusion model (DDM) have a long tradition in research on perceptual decision-making, but mounting evidence suggests that these models can account for response time (RT) distributions that arise during reinforcement learning and value-based decision-making. Building on this previous work, we implemented the DDM as the choice rule in inter-temporal choice (temporal discounting) and risky choice (probability discounting) using hierarchical Bayesian parameter estimation. We validated our approach in data from nine patients with focal lesions to the ventromedial prefrontal cortex / medial orbitofrontal cortex (vmPFC/mOFC) and nineteen age- and education-matched controls. Model comparison revealed that, for both tasks, the data were best accounted for by a variant of the drift diffusion model including a non-linear mapping from value-differences to trial-wise drift rates. Posterior predictive checks confirmed that this model provided a superior account of the relationship between value and RT. We then applied this modeling framework and 1) reproduced our previous results regarding temporal discounting in vmPFC/mOFC patients and 2) showed in a previously unpublished data set on risky choice that vmPFC/mOFC patients exhibit increased risk-taking relative to controls. Analyses of DDM parameters revealed that patients showed substantially increased non-decision times and reduced response caution during risky choice. In contrast, vmPFC/mOFC damage abolished neither scaling nor asymptote of the drift rate. Relatively intact value processing was also confirmed using DDM mixture models, which revealed that in both groups >98% of trials were better accounted for by a DDM with value modulation than by a null model without value modulation. Our results highlight that novel insights can be gained from applying sequential sampling models in studies of inter-temporal and risky decision-making in cognitive neuroscience.
序贯抽样模型,如漂移扩散模型(DDM),在研究知觉决策方面有着悠久的传统,但越来越多的证据表明,这些模型可以解释强化学习和基于价值的决策中出现的反应时间(RT)分布。在此之前的工作基础上,我们使用分层贝叶斯参数估计,将 DDM 作为跨时选择(时间折扣)和风险选择(概率折扣)的选择规则实现。我们在来自九名腹内侧前额叶皮层/内侧眶额皮层(vmPFC/mOFC)局灶性损伤患者和十九名年龄和教育匹配的对照组的数据中验证了我们的方法。模型比较表明,对于这两个任务,数据最好由一个包含从价值差异到逐次漂移率的非线性映射的漂移扩散模型的变体来解释。后验预测检查证实,该模型更好地解释了价值和 RT 之间的关系。然后,我们应用了这个建模框架,并 1)重现了我们之前关于 vmPFC/mOFC 患者的时间折扣的研究结果,2)在一个关于风险选择的未发表的数据集上表明,vmPFC/mOFC 患者的风险承担相对于对照组增加。DDM 参数的分析表明,患者在风险选择中表现出明显增加的非决策时间和降低的反应谨慎。相比之下,vmPFC/mOFC 损伤既没有取消漂移率的缩放,也没有取消其渐近线。DDM 混合模型的分析也证实了相对完整的价值处理,结果表明,在两组中,>98%的试验都可以更好地用具有价值调制的 DDM 来解释,而不是用没有价值调制的空模型来解释。我们的研究结果强调,在认知神经科学中,通过在跨时和风险决策研究中应用序贯抽样模型,可以获得新的见解。