• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

条件作用中的偶然性、连续性和因果关系:将信息论和韦伯定律应用于信用分配问题。

Contingency, contiguity, and causality in conditioning: Applying information theory and Weber's Law to the assignment of credit problem.

机构信息

Department of Psychology.

出版信息

Psychol Rev. 2019 Oct;126(5):761-773. doi: 10.1037/rev0000163. Epub 2019 Aug 29.

DOI:10.1037/rev0000163
PMID:31464474
Abstract

Contingency is a critical concept for theories of associative learning and the assignment of credit problem in reinforcement learning. Measuring and manipulating it has, however, been problematic. The information-theoretic definition of contingency-normalized mutual information-makes it a readily computed property of the relation between reinforcing events, the stimuli that predict them and the responses that produce them. When necessary, the dynamic range of the required temporal representation divided by the Weber fraction gives a psychologically realistic plug-in estimates of the entropies. There is no measurable prospective contingency between a peck and reinforcement when pigeons peck on a variable interval schedule of reinforcement. There is, however, a perfect retrospective contingency between reinforcement and the immediately preceding peck. Degrading the retrospective contingency by gratis reinforcement reveals a critical value (.25), below which performance declines rapidly. Contingency is time scale invariant, whereas the perception of proximate causality depends-we assume-on there being a short, fixed psychologically negligible critical interval between cause and effect. Increasing the interval between a response and reinforcement that it triggers degrades the retrograde contingency, leading to a decline in performance that restores it to at or above its critical value. Thus, there is no critical interval in the retrospective effect of reinforcement. We conclude with a short review of the broad explanatory scope of information-theoretic contingencies when regarded as causal variables in conditioning. We suggest that the computation of contingencies may supplant the computation of the sum of all future rewards in models of reinforcement learning. (PsycINFO Database Record (c) 2019 APA, all rights reserved).

摘要

contingency 是联想学习理论和强化学习中的归因问题的一个关键概念。然而,对其进行衡量和操纵一直存在问题。信息论中 contingency 的定义——归一化互信息——使它成为强化事件之间、预测强化事件的刺激和产生强化事件的反应之间关系的一个易于计算的属性。当需要时,所需时间表示的动态范围除以韦伯分数,给出了一个心理上现实的熵插入估计值。当鸽子在可变间隔强化程序上啄食时,啄食和强化之间没有可测量的前瞻性关联。然而,强化和紧接着的啄食之间存在完美的回溯关联。通过免费强化来降低回溯关联,可以揭示一个关键值(.25),低于该值,表现会迅速下降。关联是时间尺度不变的,而对因果关系的感知则取决于——我们假设——在因果之间存在一个短的、固定的、心理上可忽略的关键间隔。增加引发强化的反应和强化之间的间隔会降低逆行关联,导致表现下降,直到恢复到或高于其关键值。因此,在强化的回溯效应中没有关键间隔。最后,我们简要回顾了信息论关联作为条件作用中的因果变量时的广泛解释范围。我们认为,在强化学习模型中,关联的计算可能会取代对所有未来奖励的总和的计算。(PsycINFO 数据库记录(c)2019 APA,保留所有权利)。

相似文献

1
Contingency, contiguity, and causality in conditioning: Applying information theory and Weber's Law to the assignment of credit problem.条件作用中的偶然性、连续性和因果关系:将信息论和韦伯定律应用于信用分配问题。
Psychol Rev. 2019 Oct;126(5):761-773. doi: 10.1037/rev0000163. Epub 2019 Aug 29.
2
Time-scale invariant contingency yields one-shot reinforcement learning despite extremely long delays to reinforcement.时间不变协变量尽管强化延迟非常长,但仍能产生单次强化学习。
Proc Natl Acad Sci U S A. 2024 Jul 23;121(30):e2405451121. doi: 10.1073/pnas.2405451121. Epub 2024 Jul 15.
3
Contextual determinants of temporal control: Behavioral contrast in a free-operant psychophysical procedure.时间控制的情境决定因素:自由操作心理物理学程序中的行为对比
Behav Processes. 2006 Feb 28;71(2-3):157-63. doi: 10.1016/j.beproc.2005.11.005. Epub 2005 Dec 20.
4
Less information results in better midsession reversal accuracy by pigeons.信息越少,鸽子在中间阶段的反转准确性越高。
J Exp Psychol Anim Learn Cogn. 2019 Oct;45(4):422-430. doi: 10.1037/xan0000215. Epub 2019 Jun 3.
5
Timing compound stimuli: Relative reinforcer probabilities divide stimulus control in the multiple peak procedure.复合刺激的定时:在多峰程序中,相对强化物概率划分刺激控制。
J Exp Psychol Anim Learn Cogn. 2020 Apr;46(2):124-138. doi: 10.1037/xan0000233. Epub 2019 Dec 5.
6
Timescale invariance and Weber's law in choice.选择中的时间尺度不变性与韦伯定律。
J Exp Psychol Anim Behav Process. 2006 Jul;32(3):229-38. doi: 10.1037/0097-7403.32.3.229.
7
Matching-to-sample performance is better analyzed in terms of a four-term contingency than in terms of a three-term contingency.匹配样本的表现用四项关联条件来分析比用三项关联条件要好。
J Exp Anal Behav. 2013 Jul;100(1):5-26. doi: 10.1002/jeab.32. Epub 2013 May 31.
8
Selective sensitivity of schedule-induced activity to an operant suppression contingency.程序诱导活动对操作性抑制偶发事件的选择性敏感性。
J Exp Anal Behav. 1992 Nov;58(3):471-83. doi: 10.1901/jeab.1992.58-471.
9
Time-scale-invariant information-theoretic contingencies in discrimination learning.辨别学习中的时间尺度不变信息论偶发事件
J Exp Psychol Anim Learn Cogn. 2019 Jul;45(3):280-289. doi: 10.1037/xan0000205. Epub 2019 Apr 25.
10
The effect of reinforcement probability on time discrimination in the midsession reversal task.强化概率对中场反转任务中时间辨别力的影响。
J Exp Anal Behav. 2019 May;111(3):371-386. doi: 10.1002/jeab.513. Epub 2019 Feb 25.

引用本文的文献

1
Explaining Performance on Interval and Ratio Schedules with a Molar View of Behavior.用行为的宏观视角解释间隔和比率强化程序下的表现。
Perspect Behav Sci. 2025 May 21;48(2):173-202. doi: 10.1007/s40614-025-00455-3. eCollection 2025 Jun.
2
Reconceptualized Associative Learning.重新概念化的联想学习
Perspect Behav Sci. 2025 Apr 2;48(2):203-239. doi: 10.1007/s40614-025-00442-8. eCollection 2025 Jun.
3
Extinction context is learned by pigeons, not given by the environment.消退情境是鸽子习得的,而非由环境赋予。
Commun Psychol. 2025 May 24;3(1):83. doi: 10.1038/s44271-025-00261-2.
4
Prospective contingency explains behavior and dopamine signals during associative learning.前瞻性偶然性解释了联想学习过程中的行为和多巴胺信号。
Nat Neurosci. 2025 Mar 18. doi: 10.1038/s41593-025-01915-4.
5
Learning depends on the information conveyed by temporal relationships between events and is reflected in the dopamine response to cues.学习依赖于事件之间时间关系所传递的信息,并反映在多巴胺对线索的反应中。
Sci Adv. 2024 Sep 6;10(36):eadi7137. doi: 10.1126/sciadv.adi7137.
6
Abstinence as Choice: Exploring Voluntary Abstinence from Alcohol Self-Administration Using the Resurgence-as-Choice Framework.将戒酒作为一种选择:运用“复现即选择”框架探索自愿戒酒自我管理行为
Perspect Behav Sci. 2024 May 6;47(2):335-363. doi: 10.1007/s40614-024-00405-5. eCollection 2024 Jun.
7
Time-scale invariant contingency yields one-shot reinforcement learning despite extremely long delays to reinforcement.时间不变协变量尽管强化延迟非常长,但仍能产生单次强化学习。
Proc Natl Acad Sci U S A. 2024 Jul 23;121(30):e2405451121. doi: 10.1073/pnas.2405451121. Epub 2024 Jul 15.
8
The role of prospective contingency in the control of behavior and dopamine signals during associative learning.前瞻性偶然性在联想学习过程中对行为和多巴胺信号的控制作用。
bioRxiv. 2024 Feb 6:2024.02.05.578961. doi: 10.1101/2024.02.05.578961.
9
Language models and psychological sciences.语言模型与心理科学。
Front Psychol. 2023 Oct 20;14:1279317. doi: 10.3389/fpsyg.2023.1279317. eCollection 2023.
10
The relation between implicit statistical learning and proactivity as revealed by EEG.脑电揭示内隐统计学习与主动性之间的关系。
Sci Rep. 2023 Sep 22;13(1):15787. doi: 10.1038/s41598-023-42116-y.