• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鸽子在操作性囚徒困境中与针锋相对策略对抗时的学习。

Learning by pigeons playing against tit-for-tat in an operant prisoner's dilemma.

作者信息

Sanabria Federico, Baker Forest, Rachlin Howard

机构信息

Department of Psychology, State University of New York, Stony Brook, New York 11794-2500, USA.

出版信息

Learn Behav. 2003 Nov;31(4):318-31. doi: 10.3758/bf03195994.

DOI:10.3758/bf03195994
PMID:14733481
Abstract

Each of four pigeons was exposed to a single random-ratio schedule of reinforcement in which the probability of reinforcement for a peck on either of two keys was 1/25. Reinforcer amounts were determined by an iterated prisoner's dilemma (IPD) matrix in which the "other player" (a computer) played tit-for-tat. One key served as the cooperation (C) key; the other served as the defection (D) key. If a peck was scheduled to be reinforced and the D-key was pecked, the immediate reinforcer of that peck was always higher than it would have been had the C-key been pecked. However, if the C-key was pecked and the following peck was scheduled to be reinforced, reinforcement amount for pecks on either key were higher than they would have been if the previous peck had been on the D-key. Although immediate reinforcement was always higher for D-pecks, the overall reinforcement rate increased linearly with the proportion of C-pecks. C-pecks thus constituted a form of self-control. All the pigeons initially defected with this procedure. However, when feedback signals were introduced that indicated which key had last been pecked, cooperation (relative rate of C-pecks)--hence, self-control--increased for all the pigeons.

摘要

四只鸽子中的每一只都被置于一种单一的随机比率强化程序中,在该程序中,啄击两个按键中任意一个获得强化的概率为1/25。强化量由一个重复囚徒困境(IPD)矩阵决定,其中“另一个参与者”(一台计算机)采用针锋相对策略。一个按键用作合作(C)键;另一个用作背叛(D)键。如果一次啄击被安排为获得强化且啄的是D键,那么该啄击的即时强化物总是比啄C键时要高。然而,如果啄的是C键且接下来的啄击被安排为获得强化,那么无论啄哪个键,强化量都比前一次啄D键时要高。尽管对D键啄击的即时强化总是更高,但总体强化率随C键啄击比例的增加而呈线性上升。因此,C键啄击构成了一种自我控制形式。所有鸽子在这个程序开始时都选择背叛。然而,当引入反馈信号以表明最后啄的是哪个键时,所有鸽子的合作(C键啄击的相对比率)——也就是自我控制——都增强了。

相似文献

1
Learning by pigeons playing against tit-for-tat in an operant prisoner's dilemma.鸽子在操作性囚徒困境中与针锋相对策略对抗时的学习。
Learn Behav. 2003 Nov;31(4):318-31. doi: 10.3758/bf03195994.
2
Prisoner's dilemma and the free operant: John Nash, I'd like you to meet Fred Skinner.囚徒困境与自由操作:约翰·纳什,我想让你见见弗雷德·斯金纳。
J Exp Anal Behav. 2023 Nov;120(3):320-329. doi: 10.1002/jeab.874. Epub 2023 Jul 18.
3
Contingencies of reinforcement in a five-person prisoner's dilemma.五人囚徒困境中的强化偶然性
J Exp Anal Behav. 2004 Sep;82(2):161-76. doi: 10.1901/jeab.2004.82-161.
4
Mechanisms underlying the effects of unsignaled delayed reinforcement on key pecking of pigeons under variable-interval schedules.在可变间隔时间表下,无信号延迟强化对鸽子按键啄击行为影响的潜在机制。
J Exp Anal Behav. 1998 Mar;69(2):103-22. doi: 10.1901/jeab.1998.69-103.
5
Animal procrastination: Pigeons choose to defer experiencing an aversive gap or a peck requirement.动物的拖延行为:鸽子会选择推迟经历厌恶间隙或啄击要求。
Learn Behav. 2020 Jun;48(2):246-253. doi: 10.3758/s13420-019-00397-2.
6
Prisoner's dilemma and the pigeon: Control by immediate consequences.囚徒困境与鸽子:即时后果控制。
J Exp Anal Behav. 1995 Jul;64(1):1-17. doi: 10.1901/jeab.1995.64-1.
7
Categorical counting.分类计数
Behav Processes. 2010 Sep;85(1):28-35. doi: 10.1016/j.beproc.2010.06.001. Epub 2010 Jun 9.
8
Responding during reinforcement delay in a self-control paradigm.在自我控制范式中强化延迟期间的反应。
J Exp Anal Behav. 1984 May;41(3):267-77. doi: 10.1901/jeab.1984.41-267.
9
Matching, maximizing, and the behavioral unit: concurrent reinforcement of response sequences.匹配、最大化与行为单元:反应序列的并发强化
J Exp Anal Behav. 1982 Jan;37(1):97-114. doi: 10.1901/jeab.1982.37-97.
10
The emergence of symmetry in a conditional discrimination task using different responses as propioceptive samples in pigeons.在鸽子中,以不同反应作为本体感受样本的条件性辨别任务中对称性的出现。
J Exp Anal Behav. 2006 Jul;86(1):65-80. doi: 10.1901/jeab.2006.67-04.

引用本文的文献

1
Undiscounted costs and socially discounted benefits modulate cooperation in one-shot and iterated prisoner's dilemma games.未贴现成本和社会贴现收益会调节一次性和重复囚徒困境博弈中的合作。
J Exp Anal Behav. 2025 Sep;124(2):e70046. doi: 10.1002/jeab.70046.
2
Competitive and cooperative games for probing the neural basis of social decision-making in animals.用于探测动物社会决策神经基础的竞争与合作游戏。
Neurosci Biobehav Rev. 2023 Jun;149:105158. doi: 10.1016/j.neubiorev.2023.105158. Epub 2023 Apr 4.
3
Commitment and self-control in a prisoner's dilemma game.

本文引用的文献

1
Self-control and social cooperation.自我控制与社会合作。
Behav Processes. 1999 Sep;47(2):65-72. doi: 10.1016/s0376-6357(99)00054-6.
2
How to teach a pigeon to maximize overall reinforcement rate.如何教鸽子最大化整体强化率。
J Exp Anal Behav. 1995 Nov;64(3):277-97. doi: 10.1901/jeab.1995.64-277.
3
Prisoner's dilemma and the pigeon: Control by immediate consequences.囚徒困境与鸽子:即时后果控制。
囚徒困境游戏中的承诺和自我控制。
J Exp Anal Behav. 2012 Jul;98(1):89-103. doi: 10.1901/jeab.2012.98-89.
4
The Temporal Dynamics of Cooperation.合作的时间动态
J Behav Decis Mak. 2012 Jul 1;25(3):257-263. doi: 10.1002/bdm.729. Epub 2011 Jan 24.
5
Short-term gains, long-term pains: how cues about state aid learning in dynamic environments.短期收益,长期痛苦:动态环境中关于国家援助学习的线索是怎样的。
Cognition. 2009 Dec;113(3):293-313. doi: 10.1016/j.cognition.2009.03.013. Epub 2009 May 8.
J Exp Anal Behav. 1995 Jul;64(1):1-17. doi: 10.1901/jeab.1995.64-1.
4
The role of autoshaping in cooperative two-player games between starlings.自身塑造在椋鸟双人合作游戏中的作用。
J Exp Anal Behav. 1993 Jul;60(1):67-83. doi: 10.1901/jeab.1993.60-67.
5
Probability and delay in commitment.承诺的概率和延迟。
J Exp Anal Behav. 1987 Nov;48(3):347-53. doi: 10.1901/jeab.1987.48-347.
6
Commitment, choice and self-control.承诺、选择和自我控制。
J Exp Anal Behav. 1972 Jan;17(1):15-22. doi: 10.1901/jeab.1972.17-15.
7
Concurrent responding with fixed relative rate of reinforcement.同时响应与固定相对强化率。
J Exp Anal Behav. 1969 Nov;12(6):887-95. doi: 10.1901/jeab.1969.12-887.
8
Discounting and reciprocity in an Iterated Prisoner's Dilemma.重复囚徒困境中的折扣与互惠
Science. 2002 Dec 13;298(5601):2216-8. doi: 10.1126/science.1078498.