Biomedical Engineering Department, Tekirdağ Namık Kemal University, 59030, Tekirdağ, Turkey.
Institute of Biomedical Engineering, Boğaziçi University, Kandilli Campus, Çengelköy, 34684, Istanbul, Turkey.
J Comput Neurosci. 2020 Aug;48(3):333-353. doi: 10.1007/s10827-020-00751-8. Epub 2020 Jul 8.
We present a stochastic learning model that combines the essential elements of Hebbian and Rescorla-Wagner theories for operant conditioning. The model was used to predict the behavioral data of rats performing a vibrotactile yes/no detection task. Probabilistic nature of learning was implemented by trial-by-trial variability in the random distributions of associative strengths between the sensory and the response representations. By using measures derived from log-likelihoods (corrected Akaike and Bayesian information criteria), the proposed model and its subtypes were compared with each other, and with previous models in the literature, including reinforcement learning model with softmax rule and drift diffusion model. The main difference between these models was the level of stochasticity which was implemented as associative variation or response selection. The proposed model with subject-dependent variance coefficient (SVC) and with trial-dependent variance coefficient (TVC) resulted in better trial-by-trial fits to experimental data than the other tested models based on information criteria. Additionally, surrogate data were simulated with estimated parameters and the performance of the models were compared based on psychophysical measures (A': non-parametric sensitivity index, hits and false alarms on receiver operating characteristics). Especially the TVC model could produce psychophysical measures closer to those of the experimental data than the alternative models. The presented approach is novel for linking psychophysical response measures with learning in a yes/no detection task, and may be used in neural engineering applications.
我们提出了一个随机学习模型,该模型结合了操作性条件反射的赫布和雷斯考拉-瓦格纳理论的基本要素。该模型用于预测在进行振动触觉检测任务时大鼠的行为数据。通过在感觉和反应表示之间的关联强度的随机分布中逐次变化来实现学习的概率性质。通过使用来自对数似然度的度量(校正的 Akaike 和贝叶斯信息准则),将提出的模型及其亚型与彼此以及与文献中的先前模型进行了比较,包括具有软最大化规则的强化学习模型和漂移扩散模型。这些模型之间的主要区别在于实现关联变化或反应选择的随机性水平。与基于信息准则的其他测试模型相比,具有主体相关方差系数(SVC)和具有试验相关方差系数(TVC)的提出的模型在逐次拟合实验数据方面表现更好。此外,根据估计的参数模拟了替代数据,并基于心理物理度量(A':非参数灵敏度指数,接收者操作特征上的命中和误报)比较了模型的性能。特别是 TVC 模型可以产生比替代模型更接近实验数据的心理物理度量。提出的方法是将心理物理响应度量与检测任务中的学习联系起来的一种新方法,并且可用于神经工程应用。