Suppr超能文献

基于深度Q学习的自适应学习推荐策略

Adaptive Learning Recommendation Strategy Based on Deep Q-learning.

作者信息

Tan Chunxi, Han Ruijian, Ye Rougang, Chen Kani

机构信息

The Hong Kong University of Science and Technology, Kowloon, Hong Kong.

出版信息

Appl Psychol Meas. 2020 Jun;44(4):251-266. doi: 10.1177/0146621619858674. Epub 2019 Jul 25.

Abstract

Personalized recommendation system has been widely adopted in E-learning field that is adaptive to each learner's own learning pace. With full utilization of learning behavior data, psychometric assessment models keep track of the learner's proficiency on knowledge points, and then, the well-designed recommendation strategy selects a sequence of actions to meet the objective of maximizing learner's learning efficiency. This article proposes a novel adaptive recommendation strategy under the framework of reinforcement learning. The proposed strategy is realized by the deep Q-learning algorithms, which are the techniques that contributed to the success of AlphaGo Zero to achieve the super-human level in playing the game of go. The proposed algorithm incorporates an early stopping to account for the possibility that learners may choose to stop learning. It can properly deal with missing data and can handle more individual-specific features for better recommendations. The recommendation strategy guides individual learners with efficient learning paths that vary from person to person. The authors showcase concrete examples with numeric analysis of substantive learning scenarios to further demonstrate the power of the proposed method.

摘要

个性化推荐系统已在电子学习领域广泛应用,该系统能适应每个学习者自身的学习节奏。通过充分利用学习行为数据,心理测量评估模型跟踪学习者在知识点上的熟练程度,然后,精心设计的推荐策略选择一系列行动,以实现最大化学习者学习效率的目标。本文提出了一种强化学习框架下的新型自适应推荐策略。所提出的策略通过深度Q学习算法实现,这些技术促成了AlphaGo Zero在围棋游戏中达到超人水平的成功。所提出的算法纳入了早期停止机制,以考虑学习者可能选择停止学习的可能性。它可以妥善处理缺失数据,并能处理更多个体特定特征以实现更好的推荐。该推荐策略为个体学习者指引因人而异的高效学习路径。作者展示了具体示例,并对实质性学习场景进行了数值分析,以进一步证明所提方法的效能。

相似文献

1
Adaptive Learning Recommendation Strategy Based on Deep Q-learning.基于深度Q学习的自适应学习推荐策略
Appl Psychol Meas. 2020 Jun;44(4):251-266. doi: 10.1177/0146621619858674. Epub 2019 Jul 25.
5
Research on MOOC Teaching Mode in Higher Education Based on Deep Learning.基于深度学习的高等教育 MOOC 教学模式研究。
Comput Intell Neurosci. 2022 Jan 29;2022:8031602. doi: 10.1155/2022/8031602. eCollection 2022.
6
Optimal Hierarchical Learning Path Design With Reinforcement Learning.基于强化学习的最优分层学习路径设计
Appl Psychol Meas. 2021 Jan;45(1):54-70. doi: 10.1177/0146621620947171. Epub 2020 Aug 22.
8
Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。
Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.
10
A perimetric learner's index.视野计学习指数。
Acta Ophthalmol Scand. 1997 Dec;75(6):665-8. doi: 10.1111/j.1600-0420.1997.tb00627.x.

引用本文的文献

本文引用的文献

2
Recommendation System for Adaptive Learning.自适应学习推荐系统
Appl Psychol Meas. 2018 Jan;42(1):24-41. doi: 10.1177/0146621617697959. Epub 2017 Mar 26.
5
Reinforcement Learning and Savings Behavior.强化学习与储蓄行为。
J Finance. 2009 Dec;64(6):2515-2534. doi: 10.1111/j.1540-6261.2009.01509.x.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验