Suppr超能文献

基于人类引导的优先经验强化学习在自动驾驶中的应用

Prioritized Experience-Based Reinforcement Learning With Human Guidance for Autonomous Driving.

作者信息

Wu Jingda, Huang Zhiyu, Huang Wenhui, Lv Chen

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Jun 10;PP. doi: 10.1109/TNNLS.2022.3177685.

Abstract

Reinforcement learning (RL) requires skillful definition and remarkable computational efforts to solve optimization and control problems, which could impair its prospect. Introducing human guidance into RL is a promising way to improve learning performance. In this article, a comprehensive human guidance-based RL framework is established. A novel prioritized experience replay mechanism that adapts to human guidance in the RL process is proposed to boost the efficiency and performance of the RL algorithm. To relieve the heavy workload on human participants, a behavior model is established based on an incremental online learning method to mimic human actions. We design two challenging autonomous driving tasks for evaluating the proposed algorithm. Experiments are conducted to access the training and testing performance and learning mechanism of the proposed algorithm. Comparative results against the state-of-the-art methods suggest the advantages of our algorithm in terms of learning efficiency, performance, and robustness.

摘要

强化学习(RL)需要巧妙的定义和巨大的计算量来解决优化和控制问题,这可能会损害其前景。将人类指导引入强化学习是提高学习性能的一种有前途的方法。在本文中,建立了一个全面的基于人类指导的强化学习框架。提出了一种新颖的优先经验回放机制,该机制在强化学习过程中适应人类指导,以提高强化学习算法的效率和性能。为了减轻人类参与者的繁重工作量,基于增量在线学习方法建立了一个行为模型来模仿人类行为。我们设计了两个具有挑战性的自动驾驶任务来评估所提出的算法。进行实验以评估所提出算法的训练和测试性能以及学习机制。与最先进方法的比较结果表明了我们算法在学习效率、性能和鲁棒性方面的优势。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验