• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

瞳孔扩张和反应速度减缓可区分概率学习任务中的有意探索性选择。

Pupil dilation and response slowing distinguish deliberate explorative choices in the probabilistic learning task.

机构信息

Center for Neurocognitive Research (MEG-Center), Moscow State University of Psychology and Education, 29 Sretenka str, Moscow, 127051, Russia.

出版信息

Cogn Affect Behav Neurosci. 2022 Oct;22(5):1108-1129. doi: 10.3758/s13415-022-00996-z. Epub 2022 Apr 1.

DOI:10.3758/s13415-022-00996-z
PMID:35359274
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9458574/
Abstract

This study examined whether pupil size and response time would distinguish directed exploration from random exploration and exploitation. Eighty-nine participants performed the two-choice probabilistic learning task while their pupil size and response time were continuously recorded. Using LMM analysis, we estimated differences in the pupil size and response time between the advantageous and disadvantageous choices as a function of learning success, i.e., whether or not a participant has learned the probabilistic contingency between choices and their outcomes. We proposed that before a true value of each choice became known to a decision-maker, both advantageous and disadvantageous choices represented a random exploration of the two options with an equally uncertain outcome, whereas the same choices after learning manifested exploitation and direct exploration strategies, respectively. We found that disadvantageous choices were associated with increases both in response time and pupil size, but only after the participants had learned the choice-reward contingencies. For the pupil size, this effect was strongly amplified for those disadvantageous choices that immediately followed gains as compared to losses in the preceding choice. Pupil size modulations were evident during the behavioral choice rather than during the pretrial baseline. These findings suggest that occasional disadvantageous choices, which violate the acquired internal utility model, represent directed exploration. This exploratory strategy shifts choice priorities in favor of information seeking and its autonomic and behavioral concomitants are mainly driven by the conflict between the behavioral plan of the intended exploratory choice and its strong alternative, which has already proven to be more rewarding.

摘要

本研究旨在探究瞳孔大小和反应时是否能够区分有方向的探索与随机探索和开发。89 名参与者在执行二择一概率学习任务时,其瞳孔大小和反应时间被连续记录。通过 LMM 分析,我们估计了在学习成功的情况下(即参与者是否已经学习到选择及其结果之间的概率关系),瞳孔大小和反应时间在有利和不利选择之间的差异。我们提出,在决策者真正了解每个选择的价值之前,有利和不利的选择都代表着对两个选项的随机探索,结果同样不确定,而在学习之后,相同的选择表现出了开发和直接探索策略。我们发现,不利的选择与反应时间和瞳孔大小的增加有关,但仅在参与者学习了选择奖励关系之后。对于瞳孔大小,与在前一次选择中损失相比,那些紧随收益而来的不利选择的影响更强烈。瞳孔大小的调制在行为选择期间而不是在预试验基线期间明显。这些发现表明,偶尔的不利选择,违反了已获得的内部效用模型,代表有方向的探索。这种探索策略改变了选择的优先级,有利于信息搜索,其自主和行为伴随主要是由预期探索选择的行为计划与其强烈替代方案之间的冲突驱动的,后者已经被证明更有回报。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/63f2621be2bd/13415_2022_996_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/44444d16e139/13415_2022_996_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/42e66f58d19c/13415_2022_996_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/8f95a9e37f86/13415_2022_996_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/63f2621be2bd/13415_2022_996_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/44444d16e139/13415_2022_996_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/42e66f58d19c/13415_2022_996_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/8f95a9e37f86/13415_2022_996_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07b/9458574/63f2621be2bd/13415_2022_996_Fig5_HTML.jpg

相似文献

1
Pupil dilation and response slowing distinguish deliberate explorative choices in the probabilistic learning task.瞳孔扩张和反应速度减缓可区分概率学习任务中的有意探索性选择。
Cogn Affect Behav Neurosci. 2022 Oct;22(5):1108-1129. doi: 10.3758/s13415-022-00996-z. Epub 2022 Apr 1.
2
Losses resulting from deliberate exploration trigger beta oscillations in frontal cortex.因刻意探索导致的损失会引发前额叶皮质的β振荡。
Front Neurosci. 2023 May 11;17:1152926. doi: 10.3389/fnins.2023.1152926. eCollection 2023.
3
How pupil responses track value-based decision-making during and after reinforcement learning.瞳孔反应如何在强化学习期间和之后跟踪基于价值的决策。
PLoS Comput Biol. 2018 Nov 30;14(11):e1006632. doi: 10.1371/journal.pcbi.1006632. eCollection 2018 Nov.
4
Pupil diameter predicts changes in the exploration-exploitation trade-off: evidence for the adaptive gain theory.瞳孔直径可预测探索-开发权衡的变化:对适应增益理论的证据。
J Cogn Neurosci. 2011 Jul;23(7):1587-96. doi: 10.1162/jocn.2010.21548. Epub 2010 Jul 28.
5
Pupil Size Encodes Uncertainty during Exploration.瞳孔大小在探索过程中编码不确定性。
J Cogn Neurosci. 2023 Sep 1;35(9):1508-1520. doi: 10.1162/jocn_a_02025.
6
How uncertain are you? Disentangling expected and unexpected uncertainty in pupil-linked brain arousal during reversal learning.你有多不确定?在反转学习过程中,从与瞳孔相关的大脑唤醒中区分预期和意外不确定性。
Cogn Affect Behav Neurosci. 2023 Jun;23(3):578-599. doi: 10.3758/s13415-023-01072-w. Epub 2023 Feb 23.
7
Tonic noradrenergic activity modulates explorative behavior and attentional set shifting: Evidence from pupillometry and gaze pattern analysis.去甲肾上腺素能紧张度调节探索行为和注意定势转移:瞳孔测量和注视模式分析的证据。
Psychophysiology. 2017 Dec;54(12):1839-1854. doi: 10.1111/psyp.12964. Epub 2017 Jul 29.
8
Sex differences in learning from exploration.从探索中学习的性别差异。
Elife. 2021 Nov 19;10:e69748. doi: 10.7554/eLife.69748.
9
Losses as ecological guides: minor losses lead to maximization and not to avoidance.作为生态导向的损失:轻微损失导致最大化而非避免。
Cognition. 2015 Jun;139:10-7. doi: 10.1016/j.cognition.2015.03.001. Epub 2015 Mar 19.
10
Human place and response learning: navigation strategy selection, pupil size and gaze behavior.人类空间与反应学习:导航策略选择、瞳孔大小与注视行为。
Psychol Res. 2016 Jan;80(1):82-93. doi: 10.1007/s00426-014-0642-9. Epub 2014 Dec 24.

引用本文的文献

1
Losses resulting from deliberate exploration trigger beta oscillations in frontal cortex.因刻意探索导致的损失会引发前额叶皮质的β振荡。
Front Neurosci. 2023 May 11;17:1152926. doi: 10.3389/fnins.2023.1152926. eCollection 2023.
2
Value-driven modulation of visual perception by visual and auditory reward cues: The role of performance-contingent delivery of reward.视觉和听觉奖励线索对视觉感知的价值驱动调节:奖励的表现性依存性传递的作用。
Front Hum Neurosci. 2022 Dec 23;16:1062168. doi: 10.3389/fnhum.2022.1062168. eCollection 2022.

本文引用的文献

1
Balancing exploration and exploitation with information and randomization.通过信息与随机化实现探索与利用的平衡。
Curr Opin Behav Sci. 2021 Apr;38:49-56. doi: 10.1016/j.cobeha.2020.10.001. Epub 2020 Nov 6.
2
Locus coeruleus: a new look at the blue spot.蓝斑核:重新审视蓝斑。
Nat Rev Neurosci. 2020 Nov;21(11):644-659. doi: 10.1038/s41583-020-0360-9. Epub 2020 Sep 17.
3
Pupil Size as a Window on Neural Substrates of Cognition.瞳孔大小作为认知神经基质的窗口。
Trends Cogn Sci. 2020 Jun;24(6):466-480. doi: 10.1016/j.tics.2020.03.005. Epub 2020 Apr 21.
4
Eye pupil signals information gain.眼睛瞳孔传递信息增益。
Proc Biol Sci. 2019 Sep 25;286(1911):20191593. doi: 10.1098/rspb.2019.1593. Epub 2019 Sep 18.
5
Computational mechanisms of curiosity and goal-directed exploration.好奇心和目标导向探索的计算机制。
Elife. 2019 May 10;8:e41703. doi: 10.7554/eLife.41703.
6
Dissociation between reaction time and pupil dilation in the Stroop task.在斯特鲁普任务中反应时间和瞳孔扩张的分离。
J Exp Psychol Learn Mem Cogn. 2019 Oct;45(10):1899-1909. doi: 10.1037/xlm0000690. Epub 2019 Feb 28.
7
The algorithmic architecture of exploration in the human brain.人类大脑探索的算法架构。
Curr Opin Neurobiol. 2019 Apr;55:7-14. doi: 10.1016/j.conb.2018.11.003. Epub 2018 Dec 6.
8
How pupil responses track value-based decision-making during and after reinforcement learning.瞳孔反应如何在强化学习期间和之后跟踪基于价值的决策。
PLoS Comput Biol. 2018 Nov 30;14(11):e1006632. doi: 10.1371/journal.pcbi.1006632. eCollection 2018 Nov.
9
Comparing methods for the analysis of pupillary response.比较瞳孔反应分析方法。
Behav Res Methods. 2019 Feb;51(1):83-95. doi: 10.3758/s13428-018-1108-6.
10
Lose-Shift Responding in Humans Is Promoted by Increased Cognitive Load.认知负荷增加会促进人类的失时-转换反应。
Front Integr Neurosci. 2018 Mar 8;12:9. doi: 10.3389/fnint.2018.00009. eCollection 2018.