一种用于理解拖延行为的强化学习方法：价值近似不准确会导致任务的非理性延迟吗？

A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?

作者信息

Feng Zheyu, Nagase Asako Mitsuto, Morita Kenji

机构信息

Physical and Health Education, Graduate School of Education, The University of Tokyo, Tokyo, Japan.

Division of Neurology, Department of Brain and Neurosciences, Faculty of Medicine, Tottori University, Yonago, Japan.

出版信息

Front Neurosci. 2021 Sep 16;15:660595. doi: 10.3389/fnins.2021.660595. eCollection 2021.

DOI:10.3389/fnins.2021.660595

PMID:34602962

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8481628/

Abstract

Procrastination is the voluntary but irrational postponing of a task despite being aware that the delay can lead to worse consequences. It has been extensively studied in psychological field, from contributing factors, to theoretical models. From value-based decision making and reinforcement learning (RL) perspective, procrastination has been suggested to be caused by non-optimal choice resulting from cognitive limitations. Exactly what sort of cognitive limitations are involved, however, remains elusive. In the current study, we examined if a particular type of cognitive limitation, namely, inaccurate valuation resulting from inadequate state representation, would cause procrastination. Recent work has suggested that humans may adopt a particular type of state representation called the successor representation (SR) and that humans can learn to represent states by relatively low-dimensional features. Combining these suggestions, we assumed a dimension-reduced version of SR. We modeled a series of behaviors of a "student" doing assignments during the school term, when putting off doing the assignments (i.e., procrastination) is not allowed, and during the vacation, when whether to procrastinate or not can be freely chosen. We assumed that the "student" had acquired a rigid reduced SR of each state, corresponding to each step in completing an assignment, under the policy without procrastination. The "student" learned the approximated value of each state which was computed as a linear function of features of the states in the rigid reduced SR, through temporal-difference (TD) learning. During the vacation, the "student" made decisions at each time-step whether to procrastinate based on these approximated values. Simulation results showed that the reduced SR-based RL model generated procrastination behavior, which worsened across episodes. According to the values approximated by the "student," to procrastinate was the better choice, whereas not to procrastinate was mostly better according to the true values. Thus, the current model generated procrastination behavior caused by inaccurate value approximation, which resulted from the adoption of the reduced SR as state representation. These findings indicate that the reduced SR, or more generally, the dimension reduction in state representation, can be a potential form of cognitive limitation that leads to procrastination.

摘要

拖延是指尽管意识到拖延可能会导致更糟糕的后果，但仍自愿且非理性地推迟任务。它在心理学领域得到了广泛研究，涵盖了从影响因素到理论模型等方面。从基于价值的决策和强化学习（RL）的角度来看，拖延被认为是由认知局限导致的非最优选择所引起的。然而，究竟涉及何种认知局限仍然难以捉摸。在当前的研究中，我们考察了一种特定类型的认知局限，即由于状态表征不足导致的估值不准确，是否会引发拖延。最近的研究表明，人类可能会采用一种称为后继表征（SR）的特定类型的状态表征，并且人类可以通过相对低维的特征来学习表征状态。结合这些观点，我们假设了一个维度缩减版的SR。我们对一名“学生”在学期期间做作业的一系列行为进行了建模，此时不允许推迟做作业（即拖延），以及在假期期间，此时是否拖延可以自由选择。我们假设“学生”在不拖延的策略下，已经获得了与完成作业的每个步骤相对应的每个状态的刚性缩减SR。“学生”通过时间差分（TD）学习，学习了每个状态的近似值，该近似值被计算为刚性缩减SR中状态特征的线性函数。在假期期间，“学生”在每个时间步根据这些近似值决定是否拖延。模拟结果表明，基于缩减SR的RL模型产生了拖延行为，并且这种行为在各轮中逐渐恶化。根据“学生”近似的值，拖延是更好的选择，而根据真实值，不拖延大多时候更好。因此，当前模型产生了由不准确的价值近似导致的拖延行为，这种不准确的价值近似是由于采用缩减SR作为状态表征而产生的。这些发现表明，缩减SR，或者更一般地说，状态表征的维度缩减，可能是导致拖延的一种潜在认知局限形式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f047/8481628/017381647552/fnins-15-660595-g001.jpg

相似文献

A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?

Front Neurosci. 2021 Sep 16;15:660595. doi: 10.3389/fnins.2021.660595. eCollection 2021.

Basic Behavioral Processes Involved in Procrastination.

Front Psychol. 2021 Nov 23;12:769928. doi: 10.3389/fpsyg.2021.769928. eCollection 2021.

A neuro-computational account of procrastination behavior.

Nat Commun. 2022 Sep 26;13(1):5639. doi: 10.1038/s41467-022-33119-w.

To do it now or later: The cognitive mechanisms and neural substrates underlying procrastination.

Wiley Interdiscip Rev Cogn Sci. 2019 Jul;10(4):e1492. doi: 10.1002/wcs.1492. Epub 2019 Jan 14.

Procrastination in the pigeon: Can conditioned reinforcement increase the likelihood of human procrastination?

Psychon Bull Rev. 2018 Oct;25(5):1952-1957. doi: 10.3758/s13423-017-1409-2.

Outcome Value and Task Aversiveness Impact Task Procrastination through Separate Neural Pathways.

Cereb Cortex. 2021 Jul 5;31(8):3846-3855. doi: 10.1093/cercor/bhab053.

Stimulation of left dorsolateral prefrontal cortex enhances willingness for task completion by amplifying task outcome value.

J Exp Psychol Gen. 2023 Apr;152(4):1122-1133. doi: 10.1037/xge0001312. Epub 2022 Nov 28.

Procrastination and predictor variables among a group of dental students in Turkey.

Psychol Health Med. 2018 Jul;23(6):726-732. doi: 10.1080/13548506.2017.1418014. Epub 2017 Dec 21.

Student pacing in a master's level course: Procrastination, preference, and performance.

J Appl Behav Anal. 2021 Jun;54(3):1220-1234. doi: 10.1002/jaba.806. Epub 2020 Dec 30.

Psychometric evaluation of the Swedish version of the pure procrastination scale, the irrational procrastination scale, and the susceptibility to temptation scale in a clinical population.

BMC Psychol. 2014 Dec 11;2(1):54. doi: 10.1186/s40359-014-0054-z. eCollection 2014.

本文引用的文献

The neural basis of effort valuation: A meta-analysis of functional magnetic resonance imaging studies.

Neurosci Biobehav Rev. 2021 Dec;131:1275-1287. doi: 10.1016/j.neubiorev.2021.10.024. Epub 2021 Oct 25.

Rigid reduced successor representation as a potential mechanism for addiction.

Eur J Neurosci. 2021 Jun;53(11):3768-3790. doi: 10.1111/ejn.15227. Epub 2021 May 10.

Trait Procrastination and Mobile Phone Addiction Among Chinese College Students: A Moderated Mediation Model of Stress and Gender.

Front Psychol. 2020 Dec 1;11:614660. doi: 10.3389/fpsyg.2020.614660. eCollection 2020.

A Unified Framework for Dopamine Signals across Timescales.

Cell. 2020 Dec 10;183(6):1600-1616.e25. doi: 10.1016/j.cell.2020.11.013. Epub 2020 Nov 27.

A distributional code for value in dopamine-based reinforcement learning.

Nature. 2020 Jan;577(7792):671-675. doi: 10.1038/s41586-019-1924-6. Epub 2020 Jan 15.

Learning task-state representations.

Nat Neurosci. 2019 Oct;22(10):1544-1553. doi: 10.1038/s41593-019-0470-8. Epub 2019 Sep 24.

Cognitive prostheses for goal achievement.

Nat Hum Behav. 2019 Oct;3(10):1096-1106. doi: 10.1038/s41562-019-0672-9. Epub 2019 Aug 19.

The successor representation in human reinforcement learning.

Nat Hum Behav. 2017 Sep;1(9):680-692. doi: 10.1038/s41562-017-0180-8. Epub 2017 Aug 28.

Learning the payoffs and costs of actions.

PLoS Comput Biol. 2019 Feb 28;15(2):e1006285. doi: 10.1371/journal.pcbi.1006285. eCollection 2019 Feb.

Neuronal evidence for good-based economic decisions under variable action costs.

Nat Commun. 2019 Jan 23;10(1):393. doi: 10.1038/s41467-018-08209-3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于理解拖延行为的强化学习方法：价值近似不准确会导致任务的非理性延迟吗？

A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献