Nagashima Kazuma, Morita Junya, Takeuchi Yugo
Department of Information Science and Technology, Graduate School of Science and Technology, Shizuoka University, Hamamatsu, Japan.
Department of Behavior Informatics, Faculty of Informatics, Shizuoka University, Hamamatsu, Japan.
Front Artif Intell. 2024 Oct 17;7:1397860. doi: 10.3389/frai.2024.1397860. eCollection 2024.
Studies on reinforcement learning have developed the representation of curiosity, which is a type of intrinsic motivation that leads to high performance in a certain type of tasks. However, these studies have not thoroughly examined the internal cognitive mechanisms leading to this performance. In contrast to this previous framework, we propose a mechanism of intrinsic motivation focused on pattern discovery from the perspective of human cognition. This study deals with intellectual curiosity as a type of intrinsic motivation, which finds novel compressible patterns in the data. We represented the process of continuation and boredom of tasks driven by intellectual curiosity using "pattern matching," "utility," and "production compilation," which are general functions of the adaptive control of thought-rational (ACT-R) architecture. We implemented three ACT-R models with different levels of thinking to navigate multiple mazes of different sizes in simulations, manipulating the intensity of intellectual curiosity. The results indicate that intellectual curiosity negatively affects task completion rates in models with lower levels of thinking, while positively impacting models with higher levels of thinking. In addition, comparisons with a model developed by a conventional framework of reinforcement learning (intrinsic curiosity module: ICM) indicate the advantage of representing the agent's intention toward a goal in the proposed mechanism. In summary, the reported models, developed using functions linked to a general cognitive architecture, can contribute to our understanding of intrinsic motivation within the broader context of human innovation driven by pattern discovery.
强化学习的研究已经发展出了好奇心的表征,好奇心是一种内在动机,能在特定类型的任务中带来高性能表现。然而,这些研究尚未全面考察导致这种表现的内部认知机制。与先前的这个框架不同,我们从人类认知的角度提出了一种专注于模式发现的内在动机机制。本研究将求知欲作为一种内在动机来探讨,它能在数据中发现新颖的可压缩模式。我们使用思维自适应控制的理性(ACT-R)架构的通用功能“模式匹配”“效用”和“生产编译”来表示由求知欲驱动的任务延续和厌倦过程。我们在模拟中实现了三个具有不同思维水平的ACT-R模型,以操控求知欲的强度来穿越不同大小的多个迷宫。结果表明,求知欲对思维水平较低的模型中的任务完成率有负面影响,而对思维水平较高的模型有积极影响。此外,与由强化学习的传统框架(内在好奇心模块:ICM)开发的模型进行比较,表明了在所提出的机制中表征智能体对目标的意图的优势。总之,所报告的使用与通用认知架构相关联的功能开发的模型,有助于我们在由模式发现驱动的人类创新的更广泛背景下理解内在动机。