Guigon E, Burnod Y
INSERM CREARE, Institut des Neurosciences, Université Pierre et Marie Curie, Paris, France.
Int J Psychophysiol. 1995 Mar;19(2):103-13. doi: 10.1016/0167-8760(94)00079-t.
Recent neurophysiological studies have revealed the patterns of neuronal activity during the acquisition of goal-directed behaviors, both in single cells, and in large populations of neurons. We propose a model which helps three sets of experimental results in the monkey to be understood: (1) activity of single cells vary greatly and only population activities are causally related to behavior. The model shows how a population of stochastic neurons, whose behaviors vary widely, can learn a skilled conditioned movement with only local activity-dependent synaptic changes. (2) typical changes in neuronal activity occur when the rules governing the behavior are changed, i.e. when the relationship between cues and actions to reach a goal changes over time. There are two types of neuronal patterns during changes in reward contingency: a monotonic increasing pattern and a non-monotonic pattern which follows the change in the way the reward is obtained. Units in the model display these two types of change, which correspond to synaptic modifications related to the encoding of the behavioral significance of sensory and motor events. (3) These two patterns of neuronal activity define two populations whose anatomical distributions in the frontal lobe overlap with a gradient organized in the rostro-caudal direction. The model consists of two artificial neural networks, defined by the same set of equations, but which differ in the values of two parameters (P and Q). P defines the adaptive properties of processing units and Q describes the coding of information. The model suggests that a balance in the relative strengths of these parameters distributed along a rostro-caudal gradient can explain the distribution of neuronal types in the frontal lobe of the monkey.
最近的神经生理学研究揭示了在目标导向行为习得过程中,单细胞以及大量神经元群体的神经活动模式。我们提出了一个模型,有助于理解猴子身上的三组实验结果:(1)单细胞的活动变化很大,只有群体活动与行为存在因果关系。该模型展示了一群行为差异很大的随机神经元如何仅通过局部活动依赖的突触变化来学习熟练的条件性运动。(2)当行为规则发生变化时,即线索与实现目标的动作之间的关系随时间改变时,神经元活动会出现典型变化。在奖励偶然性变化期间有两种神经元模式:单调增加模式和一种跟随奖励获取方式变化的非单调模式。模型中的单元展示了这两种变化类型,它们对应于与感觉和运动事件的行为意义编码相关的突触修饰。(3)这两种神经元活动模式定义了两个群体,它们在额叶中的解剖分布与沿前后方向组织的梯度重叠。该模型由两个人工神经网络组成,由同一组方程定义,但在两个参数(P和Q)的值上有所不同。P定义了处理单元的自适应特性,Q描述了信息编码。该模型表明,沿着前后梯度分布的这些参数的相对强度平衡可以解释猴子额叶中神经元类型的分布。