长短期记忆网络中的条件作用与时间表征

Conditioning and time representation in long short-term memory networks.

作者信息

Rivest Francois, Kalaska John F, Bengio Yoshua

机构信息

Department of Mathematics and Computer Science, Royal Military College of Canada, PO Box 17000, Station Forces, Kingston, ON, K7K 7B4, Canada,

出版信息

Biol Cybern. 2014 Feb;108(1):23-48. doi: 10.1007/s00422-013-0575-1. Epub 2013 Nov 21.

DOI:10.1007/s00422-013-0575-1

PMID:24258005

Abstract

Dopaminergic models based on the temporal-difference learning algorithm usually do not differentiate trace from delay conditioning. Instead, they use a fixed temporal representation of elapsed time since conditioned stimulus onset. Recently, a new model was proposed in which timing is learned within a long short-term memory (LSTM) artificial neural network representing the cerebral cortex (Rivest et al. in J Comput Neurosci 28(1):107-130, 2010). In this paper, that model's ability to reproduce and explain relevant data, as well as its ability to make interesting new predictions, are evaluated. The model reveals a strikingly different temporal representation between trace and delay conditioning since trace conditioning requires working memory to remember the past conditioned stimulus while delay conditioning does not. On the other hand, the model predicts no important difference in DA responses between those two conditions when trained on one conditioning paradigm and tested on the other. The model predicts that in trace conditioning, animal timing starts with the conditioned stimulus offset as opposed to its onset. In classical conditioning, it predicts that if the conditioned stimulus does not disappear after the reward, the animal may expect a second reward. Finally, the last simulation reveals that the buildup of activity of some units in the networks can adapt to new delays by adjusting their rate of integration. Most importantly, the paper shows that it is possible, with the proposed architecture, to acquire discharge patterns similar to those observed in dopaminergic neurons and in the cerebral cortex on those tasks simply by minimizing a predictive cost function.

摘要

基于时间差分学习算法的多巴胺能模型通常不区分痕迹条件反射和延迟条件反射。相反，它们使用自条件刺激开始后经过时间的固定时间表示。最近，有人提出了一种新模型，其中时间是在代表大脑皮层的长短期记忆（LSTM）人工神经网络中学习的（里韦斯特等人，《计算神经科学杂志》，2010年，第28卷第1期，第107 - 130页）。在本文中，评估了该模型再现和解释相关数据的能力，以及做出有趣新预测的能力。该模型揭示了痕迹条件反射和延迟条件反射之间显著不同的时间表示，因为痕迹条件反射需要工作记忆来记住过去的条件刺激，而延迟条件反射则不需要。另一方面，当在一种条件反射范式上训练并在另一种范式上测试时，该模型预测这两种条件下多巴胺反应没有重要差异。该模型预测，在痕迹条件反射中，动物的计时从条件刺激消失开始，而不是从其开始。在经典条件反射中，它预测如果条件刺激在奖励后不消失，动物可能会期待第二次奖励。最后，最后的模拟表明，网络中一些单元的活动积累可以通过调整它们的整合速率来适应新的延迟。最重要的是，本文表明，使用所提出的架构，仅通过最小化预测成本函数，就有可能获得与在这些任务中多巴胺能神经元和大脑皮层中观察到的放电模式相似的模式。

相似文献

Conditioning and time representation in long short-term memory networks.

Biol Cybern. 2014 Feb;108(1):23-48. doi: 10.1007/s00422-013-0575-1. Epub 2013 Nov 21.

Alternative time representation in dopamine models.

J Comput Neurosci. 2010 Feb;28(1):107-30. doi: 10.1007/s10827-009-0191-1. Epub 2009 Oct 22.

Why trace and delay conditioning are sometimes (but not always) hippocampal dependent: a computational model.

Brain Res. 2013 Feb 1;1493:48-67. doi: 10.1016/j.brainres.2012.11.020. Epub 2012 Nov 23.

Conditioned stimulus duration in classical trace conditioning: test of a real-time neural network model.

Behav Brain Res. 1991 Apr 18;43(1):73-8. doi: 10.1016/s0166-4328(05)80054-3.

Neural substrates mediating human delay and trace fear conditioning.

J Neurosci. 2004 Jan 7;24(1):218-28. doi: 10.1523/JNEUROSCI.0433-03.2004.

Altering the synchrony of stimulus trace processes: tests of a neural-network model.

Biol Cybern. 1991;65(3):161-9. doi: 10.1007/BF00198087.

Medial Auditory Thalamus Is Necessary for Expression of Auditory Trace Eyelid Conditioning.

J Neurosci. 2018 Oct 10;38(41):8831-8844. doi: 10.1523/JNEUROSCI.1009-18.2018. Epub 2018 Aug 17.

Cerebellar Processing Common to Delay and Trace Eyelid Conditioning.

J Neurosci. 2018 Aug 15;38(33):7221-7236. doi: 10.1523/JNEUROSCI.0430-18.2018. Epub 2018 Jul 16.

Differential mechanisms underlie trace and delay conditioning in Drosophila.

Nature. 2022 Mar;603(7900):302-308. doi: 10.1038/s41586-022-04433-6. Epub 2022 Feb 16.

Model-Driven Analysis of Eyeblink Classical Conditioning Reveals the Underlying Structure of Cerebellar Plasticity and Neuronal Activity.

IEEE Trans Neural Netw Learn Syst. 2017 Nov;28(11):2748-2762. doi: 10.1109/TNNLS.2016.2598190.

引用本文的文献

From eye-blinks to state construction: Diagnostic benchmarks for online representation learning.

Adapt Behav. 2023 Feb;31(1):3-19. doi: 10.1177/10597123221085039. Epub 2022 Apr 27.

Lateral Hypothalamic Control of the Ventral Tegmental Area: Reward Evaluation and the Driving of Motivated Behavior.

Front Syst Neurosci. 2017 Jul 6;11:50. doi: 10.3389/fnsys.2017.00050. eCollection 2017.

Arithmetic and local circuitry underlying dopamine prediction errors.

Nature. 2015 Sep 10;525(7568):243-6. doi: 10.1038/nature14855. Epub 2015 Aug 31.

Timing and expectation of reward: a neuro-computational model of the afferents to the ventral tegmental area.

Front Neurorobot. 2014 Jan 31;8:4. doi: 10.3389/fnbot.2014.00004. eCollection 2014.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

长短期记忆网络中的条件作用与时间表征

Conditioning and time representation in long short-term memory networks.

作者信息

Rivest Francois, Kalaska John F, Bengio Yoshua

机构信息

Department of Mathematics and Computer Science, Royal Military College of Canada, PO Box 17000, Station Forces, Kingston, ON, K7K 7B4, Canada,

出版信息

Biol Cybern. 2014 Feb;108(1):23-48. doi: 10.1007/s00422-013-0575-1. Epub 2013 Nov 21.

DOI:10.1007/s00422-013-0575-1

PMID:24258005

Abstract

摘要

长短期记忆网络中的条件作用与时间表征

Conditioning and time representation in long short-term memory networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

长短期记忆网络中的条件作用与时间表征

Conditioning and time representation in long short-term memory networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献