Bénon Juliette, Lee Douglas, Hopper William, Verdeil Morgan, Pessiglione Mathias, Vinckier Fabien, Bouret Sebastien, Rouault Marion, Lebouc Raphael, Pezzulo Giovanni, Schreiweis Christiane, Burguière Eric, Daunizeau Jean
Paris Brain Institute, Paris, France.
School of Electrical and Electronic Engineering, University College Dublin, Dublin, Ireland.
Commun Psychol. 2024 Mar 27;2(1):23. doi: 10.1038/s44271-024-00071-y.
Difficult decisions typically involve mental effort, which scales with the deployment of cognitive (e.g., mnesic, attentional) resources engaged in processing decision-relevant information. But how does the brain regulate mental effort? A possibility is that the brain optimizes a resource allocation problem, whereby the amount of invested resources balances its expected cost (i.e. effort) and benefit. Our working assumption is that subjective decision confidence serves as the benefit term of the resource allocation problem, hence the "metacognitive" nature of decision control. Here, we present a computational model for the online metacognitive control of decisions or oMCD. Formally, oMCD is a Markov Decision Process that optimally solves the ensuing resource allocation problem under agnostic assumptions about the inner workings of the underlying decision system. We demonstrate how this makes oMCD a quasi-optimal control policy for a broad class of decision processes, including -but not limited to- progressive attribute integration. We disclose oMCD's main properties (in terms of choice, confidence and response time), and show that they reproduce most established empirical results in the field of value-based decision making. Finally, we discuss the possible connections between oMCD and most prominent neurocognitive theories about decision control and mental effort regulation.
困难的决策通常需要脑力付出,这种付出随着处理与决策相关信息时所投入的认知(如记忆、注意力)资源的多少而变化。但是大脑是如何调节脑力付出的呢?一种可能性是大脑优化资源分配问题,即投入的资源量要平衡其预期成本(即付出)和收益。我们的工作假设是主观决策信心充当资源分配问题的收益项,因此决策控制具有“元认知”性质。在此,我们提出一个用于决策在线元认知控制(oMCD)的计算模型。形式上,oMCD是一个马尔可夫决策过程,它在对基础决策系统内部运作情况未知的假设下,最优地解决随之而来的资源分配问题。我们展示了这如何使oMCD成为一类广泛决策过程的准最优控制策略,包括但不限于渐进属性整合。我们揭示了oMCD的主要特性(在选择、信心和反应时间方面),并表明它们重现了基于价值决策领域中大多数已确立的实证结果。最后,我们讨论了oMCD与关于决策控制和脑力付出调节的最著名神经认知理论之间可能的联系。