IEEE Trans Biomed Circuits Syst. 2024 Oct;18(5):1166-1177. doi: 10.1109/TBCAS.2024.3388673. Epub 2024 Sep 26.
Most operant conditioning circuits predominantly focus on simple feedback process, few studies consider the intricacies of feedback outcomes and the uncertainty of feedback time. This paper proposes a neuromorphic circuit based on operant conditioning with addictiveness and time memory for automatic learning. The circuit is mainly composed of hunger output module, neuron module, excitement output module, memristor-based decision module, and memory and feedback generation module. In the circuit, the process of output excitement and addiction in stochastic feedback is achieved. The memory of interval between the two rewards is formed. The circuit can adapt to complex scenarios with these functions. In addition, hunger and satiety are introduced to realize the interaction between biological behavior and exploration desire, which enables the circuit to continuously reshape its memories and actions. The process of operant conditioning theory for automatic learning is accomplished. The study of operant conditioning can serve as a reference for more intelligent brain-inspired neural systems.
大多数操作性条件反射回路主要关注简单的反馈过程,很少有研究考虑反馈结果的复杂性和反馈时间的不确定性。本文提出了一种基于操作性条件反射的神经形态电路,具有成瘾性和时间记忆,用于自动学习。该电路主要由饥饿输出模块、神经元模块、兴奋输出模块、基于忆阻器的决策模块以及记忆和反馈生成模块组成。在该电路中,实现了随机反馈中输出兴奋和成瘾的过程,形成了两个奖励之间的间隔记忆。该电路可以通过这些功能适应复杂场景。此外,引入饥饿和饱腹感来实现生物行为和探索欲望之间的交互,使电路能够不断重塑其记忆和行为。完成了自动学习的操作性条件反射理论过程。操作性条件反射的研究可以为更智能的基于大脑的神经系统提供参考。