选择的最不频繁序列的增强。

Reinforcement of least-frequent sequences of choices.

出版信息

J Exp Anal Behav. 1967 Jan;10(1):57-65. doi: 10.1901/jeab.1967.10-57.

DOI:10.1901/jeab.1967.10-57

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1338318/

Abstract

When a pigeon's choices between two keys are probabilistically reinforced, as in discrete trial probability learning procedures and in concurrent variable-interval schedules, the bird tends to maximize, or to choose the alternative with the higher probability of reinforcement. In concurrent variable-interval schedules, steady-state matching, which is an approximate equality between the relative frequency of a response and the relative frequency of reinforcement of that response, has previously been obtained only as a consequence of maximizing. In the present experiment, maximizing was impossible. A choice of one of two keys was reinforced only if it formed, together with the three preceding choices, the sequence of four successive choices that had occurred least often. This sequence was determined by a Bernoulli-trials process with parameter p. Each of three pigeons matched when p was (1/2) or (1/4). Therefore, steady-state matching by individual birds is not always a consequence of maximizing. Choice probability varied between successive reinforcements, and sequential statistics revealed dependencies which were adequately described by a Bernoulli-trials process with p depending on the time since the preceding reinforcement.

摘要

当鸽子在两个键之间的选择是概率强化时，例如在离散试验概率学习程序和同时变时距程序中，鸟类往往会最大化，或者选择强化概率更高的选择。在同时变时距程序中，稳态匹配，即响应的相对频率与该响应的相对强化频率之间的近似相等，以前仅作为最大化的结果获得。在本实验中，最大化是不可能的。只有当选择两个键中的一个与之前的三个选择一起形成了出现次数最少的四个连续选择序列时，该选择才会得到强化。该序列由参数为 p 的伯努利试验过程决定。当 p 为 (1/2) 或 (1/4) 时，三只鸽子中的每一只都匹配。因此，个别鸟类的稳态匹配并不总是最大化的结果。选择概率在连续强化之间变化，序列统计揭示了依赖性，这些依赖性可以通过依赖于上次强化后时间的具有 p 的伯努利试验过程来充分描述。

相似文献

Reinforcement of least-frequent sequences of choices.

J Exp Anal Behav. 1967 Jan;10(1):57-65. doi: 10.1901/jeab.1967.10-57.

Probabilistically reinforced choice behavior in pigeons.

J Exp Anal Behav. 1966 Jul;9(4):443-55. doi: 10.1901/jeab.1966.9-443.

Interval reinforcement of choice behavior in discrete trials.

J Exp Anal Behav. 1969 Nov;12(6):875-85. doi: 10.1901/jeab.1969.12-875.

On some determinants of choice in pigeons.

J Exp Anal Behav. 1963 Jan;6(1):53-9. doi: 10.1901/jeab.1963.6-53.

Duration and rate of reinforcement as determinants of concurrent responding.

J Exp Anal Behav. 1977 Sep;28(2):145-53. doi: 10.1901/jeab.1977.28-145.

Matching, maximizing, and the behavioral unit: concurrent reinforcement of response sequences.

J Exp Anal Behav. 1982 Jan;37(1):97-114. doi: 10.1901/jeab.1982.37-97.

Concurrent variable-interval variable-ratio schedules can provide only weak evidence for matching.

J Exp Anal Behav. 1984 Jan;41(1):83-100. doi: 10.1901/jeab.1984.41-83.

The concurrent reinforcement of two interresponse times: the relative frequency of an interresponse time equals its relative harmonic length.

J Exp Anal Behav. 1969 May;12(3):403-11. doi: 10.1901/jeab.1969.12-403.

Hill-climbing by pigeons.

J Exp Anal Behav. 1983 Jan;39(1):25-47. doi: 10.1901/jeab.1983.39-25.

Choice behavior on discrete trials: a demonstration of the occurrence of a response strategy.

J Exp Anal Behav. 1974 Mar;21(2):315-22. doi: 10.1901/jeab.1974.21-315.

引用本文的文献

Probability matching is not the default decision making strategy in human and non-human primates.

Sci Rep. 2022 Jul 30;12(1):13092. doi: 10.1038/s41598-022-16983-w.

A Critical Review of the Support for Variability as an Operant Dimension.

Perspect Behav Sci. 2020 Jul 27;43(3):579-603. doi: 10.1007/s40614-020-00262-y. eCollection 2020 Sep.

Operant variability: procedures and processes.

Behav Anal. 2012 Fall;35(2):249-55. doi: 10.1007/BF03392284.

A runs-test algorithm: contingent reinforcement and response run structures.

J Exp Anal Behav. 2010 Jan;93(1):61-80. doi: 10.1901/jeab.2010.93-61.

Maximizing and matching on concurrent ratio schedules.

J Exp Anal Behav. 1975 Jul;24(1):107-16. doi: 10.1901/jeab.1975.24-107.

Operant variability: evidence, functions, and theory.

Psychon Bull Rev. 2002 Dec;9(4):672-705. doi: 10.3758/bf03196324.

Increasing the variability of response sequences in pigeons by adjusting the frequency of switching between two keys.

J Exp Anal Behav. 1997 Jul;68(1):1-25. doi: 10.1901/jeab.1997.68-1.

Probability and delay of reinforcement as factors in discrete-trial choice.

J Exp Anal Behav. 1985 May;43(3):341-51. doi: 10.1901/jeab.1985.43-341.

Choice between sequences of fixed-ratio schedules: effects of ratio values and probability of food delivery.

J Exp Anal Behav. 1987 Mar;47(2):225-32. doi: 10.1901/jeab.1987.47-225.

Behavioral variability and frequency-dependent selection.

J Exp Anal Behav. 1992 Sep;58(2):241-63. doi: 10.1901/jeab.1992.58-241.

本文引用的文献

THE LINC: A DESCRIPTION OF THE LABORATORY INSTRUMENT COMPUTER.

Ann N Y Acad Sci. 1964 Jul 31;115:653-68.

FURTHER EXPERIMENTS ON PROBABILITY-MATCHING IN THE PIGEON.

J Exp Anal Behav. 1964 Mar;7(2):151-7. doi: 10.1901/jeab.1964.7-151.

SECONDARY REINFORCEMENT AND RATE OF PRIMARY REINFORCEMENT.

J Exp Anal Behav. 1964 Jan;7(1):27-36. doi: 10.1901/jeab.1964.7-27.

Concurrent performances: reinforcement interaction and response independence.

J Exp Anal Behav. 1963 Apr;6(2):253-63. doi: 10.1901/jeab.1963.6-253.

On some determinants of choice in pigeons.

J Exp Anal Behav. 1963 Jan;6(1):53-9. doi: 10.1901/jeab.1963.6-53.

Relativity of response rate and reinforcement frequency in a multiple schedule.

J Exp Anal Behav. 1961 Apr;4(2):179-84. doi: 10.1901/jeab.1961.4-179.

The dependence of interresponse times upon the relative reinforcement of different interresponse times.

J Exp Psychol. 1956 Sep;52(3):145-61. doi: 10.1037/h0041255.

The reinforcement of least-frequent interresponse times.

J Exp Anal Behav. 1966 Sep;9(5):581-91. doi: 10.1901/jeab.1966.9-581.

Probabilistically reinforced choice behavior in pigeons.

J Exp Anal Behav. 1966 Jul;9(4):443-55. doi: 10.1901/jeab.1966.9-443.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

选择的最不频繁序列的增强。

Reinforcement of least-frequent sequences of choices.

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献