• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

感知机中的概率匹配:条件依赖和线性不可分性的影响。

Probability matching in perceptrons: Effects of conditional dependence and linear nonseparability.

作者信息

Dawson Michael R W, Gupta Maya

机构信息

Department of Psychology, University of Alberta, Edmonton, Alberta, Canada.

出版信息

PLoS One. 2017 Feb 17;12(2):e0172431. doi: 10.1371/journal.pone.0172431. eCollection 2017.

DOI:10.1371/journal.pone.0172431
PMID:28212422
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5315326/
Abstract

Probability matching occurs when the behavior of an agent matches the likelihood of occurrence of events in the agent's environment. For instance, when artificial neural networks match probability, the activity in their output unit equals the past probability of reward in the presence of a stimulus. Our previous research demonstrated that simple artificial neural networks (perceptrons, which consist of a set of input units directly connected to a single output unit) learn to match probability when presented different cues in isolation. The current paper extends this research by showing that perceptrons can match probabilities when presented simultaneous cues, with each cue signaling different reward likelihoods. In our first simulation, we presented up to four different cues simultaneously; the likelihood of reward signaled by the presence of one cue was independent of the likelihood of reward signaled by other cues. Perceptrons learned to match reward probabilities by treating each cue as an independent source of information about the likelihood of reward. In a second simulation, we violated the independence between cues by making some reward probabilities depend upon cue interactions. We did so by basing reward probabilities on a logical combination (AND or XOR) of two of the four possible cues. We also varied the size of the reward associated with the logical combination. We discovered that this latter manipulation was a much better predictor of perceptron performance than was the logical structure of the interaction between cues. This indicates that when perceptrons learn to match probabilities, they do so by assuming that each signal of a reward is independent of any other; the best predictor of perceptron performance is a quantitative measure of the independence of these input signals, and not the logical structure of the problem being learned.

摘要

当一个智能体的行为与该智能体所处环境中事件发生的可能性相匹配时,就会出现概率匹配。例如,当人工神经网络进行概率匹配时,其输出单元的活动等于在有刺激的情况下过去获得奖励的概率。我们之前的研究表明,简单的人工神经网络(感知器,由一组直接连接到单个输出单元的输入单元组成)在单独呈现不同线索时会学习进行概率匹配。本文通过表明感知器在同时呈现线索时能够进行概率匹配来扩展这一研究,其中每个线索表示不同的奖励可能性。在我们的第一个模拟中,我们同时呈现多达四个不同的线索;一个线索出现所表示的奖励可能性与其他线索所表示的奖励可能性无关。感知器通过将每个线索视为关于奖励可能性的独立信息源来学习匹配奖励概率。在第二个模拟中,我们通过使一些奖励概率取决于线索之间的相互作用来打破线索之间的独立性。我们通过基于四个可能线索中的两个线索的逻辑组合(与或异或)来确定奖励概率做到了这一点。我们还改变了与逻辑组合相关的奖励大小。我们发现,与线索之间相互作用的逻辑结构相比,后一种操作能更好地预测感知器的性能。这表明,当感知器学习进行概率匹配时,它们是通过假设奖励的每个信号都独立于任何其他信号来做到这一点的;感知器性能的最佳预测指标是这些输入信号独立性的定量度量,而不是正在学习的问题的逻辑结构。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab27/5315326/549d943c411f/pone.0172431.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab27/5315326/549d943c411f/pone.0172431.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab27/5315326/549d943c411f/pone.0172431.g001.jpg

相似文献

1
Probability matching in perceptrons: Effects of conditional dependence and linear nonseparability.感知机中的概率匹配:条件依赖和线性不可分性的影响。
PLoS One. 2017 Feb 17;12(2):e0172431. doi: 10.1371/journal.pone.0172431. eCollection 2017.
2
Simple artificial neural networks that match probability and exploit and explore when confronting a multiarmed bandit.简单的人工神经网络,在面对多臂老虎机问题时匹配概率并进行利用和探索。
IEEE Trans Neural Netw. 2009 Aug;20(8):1368-71. doi: 10.1109/TNN.2009.2025588. Epub 2009 Jul 10.
3
A learning rule for very simple universal approximators consisting of a single layer of perceptrons.一种由单层感知器组成的非常简单的通用逼近器的学习规则。
Neural Netw. 2008 Jun;21(5):786-95. doi: 10.1016/j.neunet.2007.12.036. Epub 2007 Dec 31.
4
Base rates versus sample accuracy: competition for control in human matching to sample.基础概率与样本准确性:人类样本匹配中控制权的竞争
J Exp Anal Behav. 1999 Mar;71(2):155-69. doi: 10.1901/jeab.1999.71-155.
5
Reward associations and spatial probabilities produce additive effects on attentional selection.奖励关联和空间概率对注意力选择产生累加效应。
Atten Percept Psychophys. 2014 Nov;76(8):2315-25. doi: 10.3758/s13414-014-0720-5.
6
Key-finding by artificial neural networks that learn about key profiles.通过学习关键特征的人工神经网络进行关键发现。
Can J Exp Psychol. 2018 Sep;72(3):153-170. doi: 10.1037/cep0000135. Epub 2017 May 8.
7
On the equivalence of two-layered perceptrons with binary neurons.关于具有二元神经元的两层感知器的等价性
Int J Neural Syst. 1995 Sep;6(3):225-31. doi: 10.1142/s0129065795000160.
8
'Proactive' use of cue-context congruence for building reinforcement learning's reward function.“主动”利用线索-情境一致性构建强化学习的奖励函数。
BMC Neurosci. 2016 Oct 28;17(1):70. doi: 10.1186/s12868-016-0302-7.
9
Using perceptrons to explore the reorientation task.利用感知机探索重新定向任务。
Cognition. 2010 Feb;114(2):207-26. doi: 10.1016/j.cognition.2009.09.006. Epub 2009 Sep 30.
10
An fMRI study of reward-related probability learning.一项关于奖励相关概率学习的功能磁共振成像研究。
Neuroimage. 2005 Feb 1;24(3):862-73. doi: 10.1016/j.neuroimage.2004.10.002. Epub 2004 Nov 18.

本文引用的文献

1
Unified pre- and postsynaptic long-term plasticity enables reliable and flexible learning.统一的突触前和突触后长期可塑性使可靠且灵活的学习成为可能。
Elife. 2015 Aug 26;4:e09457. doi: 10.7554/eLife.09457.
2
Equilibria of perceptrons for simple contingency problems.简单二项问题的感知器平衡。
IEEE Trans Neural Netw Learn Syst. 2012 Aug;23(8):1340-4. doi: 10.1109/TNNLS.2012.2199766.
3
Probabilistic inference of short-term synaptic plasticity in neocortical microcircuits.新皮层微电路中短期突触可塑性的概率推理。
Front Comput Neurosci. 2013 Jun 6;7:75. doi: 10.3389/fncom.2013.00075. eCollection 2013.
4
Differentiating models of associative learning: reorientation, superconditioning, and the role of inhibition.区分联想学习模型:重新定向、超条件作用及抑制的作用。
J Exp Psychol Anim Behav Process. 2013 Jul;39(3):273-86. doi: 10.1037/a0032174. Epub 2013 Apr 29.
5
Using perceptrons to explore the reorientation task.利用感知机探索重新定向任务。
Cognition. 2010 Feb;114(2):207-26. doi: 10.1016/j.cognition.2009.09.006. Epub 2009 Sep 30.
6
Simple artificial neural networks that match probability and exploit and explore when confronting a multiarmed bandit.简单的人工神经网络,在面对多臂老虎机问题时匹配概率并进行利用和探索。
IEEE Trans Neural Netw. 2009 Aug;20(8):1368-71. doi: 10.1109/TNN.2009.2025588. Epub 2009 Jul 10.
7
Learning about environmental geometry: a flaw in Miller and Shettleworth's (2007) operant model.了解环境几何学:米勒和谢特沃思(2007年)操作模型中的一个缺陷。
J Exp Psychol Anim Behav Process. 2008 Jul;34(3):415-8. doi: 10.1037/0097-7403.34.3.415.
8
JUDGMENT OF CONTINGENCY BETWEEN RESPONSES AND OUTCOMES.反应与结果之间的偶然性判断
Psychol Monogr. 1965;79:SUPPL 1:1-17. doi: 10.1037/h0093874.
9
The perceptron: a probabilistic model for information storage and organization in the brain.感知器:大脑中信息存储与组织的概率模型。
Psychol Rev. 1958 Nov;65(6):386-408. doi: 10.1037/h0042519.
10
A review of recent developments in research and theories on human contingency learning.对人类偶然性学习的研究与理论最新进展的综述。
Q J Exp Psychol B. 2002 Oct;55(4):289-310. doi: 10.1080/02724990244000034.