基于深度强化学习的好奇心驱动推荐策略，用于自适应学习。

Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning.

机构信息

Department of Mathematics, Hong Kong University of Science and Technology, Kowloon, Hong Kong.

出版信息

Br J Math Stat Psychol. 2020 Nov;73(3):522-540. doi: 10.1111/bmsp.12199. Epub 2020 Feb 21.

PMID:32080828

Abstract

The design of recommendation strategies in the adaptive learning systems focuses on utilizing currently available information to provide learners with individual-specific learning instructions. As a critical motivate for human behaviours, curiosity is essentially the drive to explore knowledge and seek information. In a psychologically inspired view, we propose a curiosity-driven recommendation policy within the reinforcement learning framework, allowing for an efficient and enjoyable personalized learning path. Specifically, a curiosity reward from a well-designed predictive model is generated to model one's familiarity with the knowledge space. Given such curiosity rewards, we apply the actor-critic method to approximate the policy directly through neural networks. Numerical analyses with a large continuous knowledge state space and concrete learning scenarios are provided to further demonstrate the efficiency of the proposed method.

摘要

自适应学习系统中的推荐策略设计侧重于利用当前可用的信息为学习者提供个性化的学习指导。好奇心是人类行为的一个关键动机，它本质上是探索知识和寻求信息的驱动力。在受心理学启发的观点下，我们在强化学习框架内提出了一种基于好奇心的推荐策略，以实现高效和愉快的个性化学习路径。具体来说，我们从精心设计的预测模型中生成好奇心奖励，以建模学习者对知识空间的熟悉程度。有了这样的好奇心奖励，我们应用演员-评论家方法通过神经网络直接逼近策略。我们提供了具有大连续知识状态空间和具体学习场景的数值分析，以进一步证明所提出方法的效率。

相似文献

Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning.基于深度强化学习的好奇心驱动推荐策略，用于自适应学习。

Br J Math Stat Psychol. 2020 Nov;73(3):522-540. doi: 10.1111/bmsp.12199. Epub 2020 Feb 21.

A reinforcement learning approach to personalized learning recommendation systems.一种用于个性化学习推荐系统的强化学习方法。

Br J Math Stat Psychol. 2019 Feb;72(1):108-135. doi: 10.1111/bmsp.12144. Epub 2018 Sep 12.

Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.基于辅助评论家网络的自动驾驶策略深度强化学习

IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3680-3690. doi: 10.1109/TNNLS.2021.3116063. Epub 2023 Jul 6.

Contributions of expected learning progress and perceptual novelty to curiosity-driven exploration.预期学习进展和感知新颖性对好奇心驱动探索的贡献。

Cognition. 2022 Aug;225:105119. doi: 10.1016/j.cognition.2022.105119. Epub 2022 Apr 12.

Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning.通过深度强化学习的仿生水下航行器目标跟踪控制。

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3741-3752. doi: 10.1109/TNNLS.2021.3054402. Epub 2022 Aug 3.

Intrinsic Rewards for Exploration Without Harm From Observational Noise: A Simulation Study Based on the Free Energy Principle.探索的内在奖励，无需因观测噪声而产生危害：基于自由能原理的模拟研究。

Neural Comput. 2024 Aug 19;36(9):1854-1885. doi: 10.1162/neco_a_01690.

LJIR: Learning Joint-Action Intrinsic Reward in cooperative multi-agent reinforcement learning.LJIR：在合作多智能体强化学习中学习联合行动内在奖励

Neural Netw. 2023 Oct;167:450-459. doi: 10.1016/j.neunet.2023.08.016. Epub 2023 Aug 22.

Multi-agent reinforcement learning with approximate model learning for competitive games.多智能体强化学习与近似模型学习在竞争性游戏中的应用。

PLoS One. 2019 Sep 11;14(9):e0222215. doi: 10.1371/journal.pone.0222215. eCollection 2019.

Adaptive Learning Recommendation Strategy Based on Deep Q-learning.基于深度Q学习的自适应学习推荐策略

Appl Psychol Meas. 2020 Jun;44(4):251-266. doi: 10.1177/0146621619858674. Epub 2019 Jul 25.

Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.使用人类生成的奖励训练用于手臂运动的 Actor-Critic 强化学习控制器。

IEEE Trans Neural Syst Rehabil Eng. 2017 Oct;25(10):1892-1905. doi: 10.1109/TNSRE.2017.2700395. Epub 2017 May 2.

引用本文的文献

An adaptive testing item selection strategy via a deep reinforcement learning approach.基于深度强化学习的自适应测验项目选择策略。

Behav Res Methods. 2024 Dec;56(8):8695-8714. doi: 10.3758/s13428-024-02498-x. Epub 2024 Sep 13.

Allosteric Regulation at the Crossroads of New Technologies: Multiscale Modeling, Networks, and Machine Learning.新技术交叉点上的变构调节：多尺度建模、网络与机器学习

Front Mol Biosci. 2020 Jul 9;7:136. doi: 10.3389/fmolb.2020.00136. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验