循环神经网络中轮廓连接与追踪的强化学习

Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks.

作者信息

Brosch Tobias, Neumann Heiko, Roelfsema Pieter R

机构信息

University of Ulm, Institute of Neural Information Processing, Ulm, Germany.

Department of Vision & Cognition, Netherlands Institute for Neuroscience (KNAW), Amsterdam, The Netherlands; Department of Integrative Neurophysiology, Center for Neurogenomics and Cognitive Research, VU University, Amsterdam, The Netherlands; Psychiatry Department, Academic Medical Center, Amsterdam, The Netherlands.

出版信息

PLoS Comput Biol. 2015 Oct 23;11(10):e1004489. doi: 10.1371/journal.pcbi.1004489. eCollection 2015 Oct.

DOI:10.1371/journal.pcbi.1004489

PMID:26496502

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4619762/

Abstract

The processing of a visual stimulus can be subdivided into a number of stages. Upon stimulus presentation there is an early phase of feedforward processing where the visual information is propagated from lower to higher visual areas for the extraction of basic and complex stimulus features. This is followed by a later phase where horizontal connections within areas and feedback connections from higher areas back to lower areas come into play. In this later phase, image elements that are behaviorally relevant are grouped by Gestalt grouping rules and are labeled in the cortex with enhanced neuronal activity (object-based attention in psychology). Recent neurophysiological studies revealed that reward-based learning influences these recurrent grouping processes, but it is not well understood how rewards train recurrent circuits for perceptual organization. This paper examines the mechanisms for reward-based learning of new grouping rules. We derive a learning rule that can explain how rewards influence the information flow through feedforward, horizontal and feedback connections. We illustrate the efficiency with two tasks that have been used to study the neuronal correlates of perceptual organization in early visual cortex. The first task is called contour-integration and demands the integration of collinear contour elements into an elongated curve. We show how reward-based learning causes an enhancement of the representation of the to-be-grouped elements at early levels of a recurrent neural network, just as is observed in the visual cortex of monkeys. The second task is curve-tracing where the aim is to determine the endpoint of an elongated curve composed of connected image elements. If trained with the new learning rule, neural networks learn to propagate enhanced activity over the curve, in accordance with neurophysiological data. We close the paper with a number of model predictions that can be tested in future neurophysiological and computational studies.

摘要

视觉刺激的处理过程可细分为多个阶段。在呈现刺激时，存在一个前馈处理的早期阶段，视觉信息从较低视觉区域传播到较高视觉区域，以提取基本和复杂的刺激特征。随后是一个后期阶段，此时区域内的水平连接以及从较高区域到较低区域的反馈连接开始发挥作用。在这个后期阶段，行为相关的图像元素根据格式塔分组规则进行分组，并在皮层中通过增强的神经元活动进行标记（心理学中的基于对象的注意）。最近的神经生理学研究表明，基于奖励的学习会影响这些循环分组过程，但奖励如何训练用于知觉组织的循环回路尚不清楚。本文研究了基于奖励学习新分组规则的机制。我们推导了一个学习规则，该规则可以解释奖励如何影响通过前馈、水平和反馈连接的信息流。我们用两个用于研究早期视觉皮层中知觉组织的神经元相关性的任务来说明其有效性。第一个任务称为轮廓整合，要求将共线的轮廓元素整合为一条细长曲线。我们展示了基于奖励的学习如何在循环神经网络的早期水平上增强待分组元素的表征，正如在猴子的视觉皮层中观察到的那样。第二个任务是曲线追踪，其目的是确定由相连图像元素组成的细长曲线的端点。如果用新的学习规则进行训练，神经网络会学会根据神经生理学数据在曲线上传播增强的活动。我们在论文结尾提出了一些模型预测，这些预测可在未来的神经生理学和计算研究中进行测试。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a75/4619762/600f9495cb57/pcbi.1004489.g001.jpg

相似文献

Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks.循环神经网络中轮廓连接与追踪的强化学习

PLoS Comput Biol. 2015 Oct 23;11(10):e1004489. doi: 10.1371/journal.pcbi.1004489. eCollection 2015 Oct.

Recurrent neural networks that learn multi-step visual routines with reinforcement learning.基于强化学习的循环神经网络，可用于学习多步骤视觉常规。

PLoS Comput Biol. 2024 Apr 29;20(4):e1012030. doi: 10.1371/journal.pcbi.1012030. eCollection 2024 Apr.

A recurrent neural model for proto-object based contour integration and figure-ground segregation.一种用于基于原始对象的轮廓整合和图形-背景分离的循环神经网络模型。

J Comput Neurosci. 2017 Dec;43(3):227-242. doi: 10.1007/s10827-017-0659-3. Epub 2017 Sep 19.

Reward-dependent learning in neuronal networks for planning and decision making.用于规划和决策的神经网络中基于奖励的学习。

Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0.

Linking the laminar circuits of visual cortex to visual perception: development, grouping, and attention.将视觉皮层的分层回路与视觉感知相联系：发育、分组与注意力。

Neurosci Biobehav Rev. 2001 Aug;25(6):513-26. doi: 10.1016/s0149-7634(01)00030-6.

Towards a theory of the laminar architecture of cerebral cortex: computational clues from the visual system.迈向大脑皮层分层结构理论：来自视觉系统的计算线索。

Cereb Cortex. 2003 Jan;13(1):100-13. doi: 10.1093/cercor/13.1.100.

Network model of top-down influences on local gain and contextual interactions in visual cortex.视觉皮层中自上而下的影响对局部增益和上下文相互作用的网络模型。

Proc Natl Acad Sci U S A. 2013 Oct 22;110(43):E4108-17. doi: 10.1073/pnas.1317019110. Epub 2013 Oct 7.

Contrast-sensitive perceptual grouping and object-based attention in the laminar circuits of primary visual cortex.初级视觉皮层分层回路中的对比敏感度感知分组与基于对象的注意力

Vision Res. 2000;40(10-12):1413-32. doi: 10.1016/s0042-6989(99)00229-1.

A growth-cone model for the spread of object-based attention during contour grouping.基于生长锥的模型可以解释在轮廓分组过程中以客体为中心的注意的扩展。

Curr Biol. 2014 Dec 15;24(24):2869-77. doi: 10.1016/j.cub.2014.10.007. Epub 2014 Nov 20.

Distinct Feedforward and Feedback Effects of Microstimulation in Visual Cortex Reveal Neural Mechanisms of Texture Segregation.微刺激对视皮层的前馈和反馈作用的不同揭示了纹理分离的神经机制。

Neuron. 2017 Jul 5;95(1):209-220.e3. doi: 10.1016/j.neuron.2017.05.033. Epub 2017 Jun 15.

引用本文的文献

A model of thalamo-cortical interaction for incremental binding in mental contour-tracing.一种用于心理轮廓追踪中增量绑定的丘脑 - 皮质相互作用模型。

PLoS Comput Biol. 2025 May 8;21(5):e1012835. doi: 10.1371/journal.pcbi.1012835. eCollection 2025 May.

Recurrent neural networks that learn multi-step visual routines with reinforcement learning.基于强化学习的循环神经网络，可用于学习多步骤视觉常规。

PLoS Comput Biol. 2024 Apr 29;20(4):e1012030. doi: 10.1371/journal.pcbi.1012030. eCollection 2024 Apr.

Towards a more general understanding of the algorithmic utility of recurrent connections.朝向对递归连接的算法效用的更一般性理解。

PLoS Comput Biol. 2022 Jun 21;18(6):e1010227. doi: 10.1371/journal.pcbi.1010227. eCollection 2022 Jun.

Control of synaptic plasticity in deep cortical networks.深皮质网络中突触可塑性的控制。

Nat Rev Neurosci. 2018 Feb 16;19(3):166-180. doi: 10.1038/nrn.2018.6.

Reward-based training of recurrent neural networks for cognitive and value-based tasks.用于认知和基于价值任务的循环神经网络的基于奖励的训练。

Elife. 2017 Jan 13;6:e21492. doi: 10.7554/eLife.21492.

Reversal Learning in Humans and Gerbils: Dynamic Control Network Facilitates Learning.人类和沙鼠的反转学习：动态控制网络促进学习。

Front Neurosci. 2016 Nov 17;10:535. doi: 10.3389/fnins.2016.00535. eCollection 2016.

Toward an Integration of Deep Learning and Neuroscience.迈向深度学习与神经科学的整合。

Front Comput Neurosci. 2016 Sep 14;10:94. doi: 10.3389/fncom.2016.00094. eCollection 2016.

本文引用的文献

Reinforcement learning improves behaviour from evaluative feedback.强化学习通过评估反馈来改善行为。

Nature. 2015 May 28;521(7553):445-51. doi: 10.1038/nature14540.

On event-based optical flow detection.基于事件的光流检测

Front Neurosci. 2015 Apr 20;9:137. doi: 10.3389/fnins.2015.00137. eCollection 2015.

How attention can create synaptic tags for the learning of working memories in sequential tasks.在序列任务中，注意力如何为工作记忆的学习创建突触标签。

PLoS Comput Biol. 2015 Mar 5;11(3):e1004060. doi: 10.1371/journal.pcbi.1004060. eCollection 2015 Mar.

Belief states as a framework to explain extra-retinal influences in visual cortex.信念状态作为解释视觉皮层中视网膜外影响的框架。

Curr Opin Neurobiol. 2015 Jun;32:45-52. doi: 10.1016/j.conb.2014.10.013. Epub 2014 Nov 17.

A growth-cone model for the spread of object-based attention during contour grouping.基于生长锥的模型可以解释在轮廓分组过程中以客体为中心的注意的扩展。

Curr Biol. 2014 Dec 15;24(24):2869-77. doi: 10.1016/j.cub.2014.10.007. Epub 2014 Nov 20.

A critical time window for dopamine actions on the structural plasticity of dendritic spines.多巴胺作用于树突棘结构可塑性的关键时间窗口。

Science. 2014 Sep 26;345(6204):1616-20. doi: 10.1126/science.1255514.

Computing with a canonical neural circuits model with pool normalization and modulating feedback.使用具有池归一化和调制反馈的典型神经回路模型进行计算。

Neural Comput. 2014 Dec;26(12):2735-89. doi: 10.1162/NECO_a_00675. Epub 2014 Sep 23.

Perceptual training continuously refines neuronal population codes in primary visual cortex.感知训练不断完善初级视觉皮层中的神经元群体代码。

Nat Neurosci. 2014 Oct;17(10):1380-7. doi: 10.1038/nn.3805. Epub 2014 Sep 7.

Statistical models of natural images and cortical visual representation.自然图像与皮层视觉表征的统计模型。

Top Cogn Sci. 2010 Apr;2(2):251-64. doi: 10.1111/j.1756-8765.2009.01057.x. Epub 2009 Nov 4.

Eye movement preparation modulates neuronal responses in area V4 when dissociated from attentional demands.眼动准备在与注意力需求分离时调节 V4 区神经元的反应。

Neuron. 2014 Jul 16;83(2):496-506. doi: 10.1016/j.neuron.2014.06.014.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

循环神经网络中轮廓连接与追踪的强化学习

Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献