Suppr超能文献

迈向受大脑启发的系统:用于模拟自动驾驶智能体的深度循环强化学习

Toward a Brain-Inspired System: Deep Recurrent Reinforcement Learning for a Simulated Self-Driving Agent.

作者信息

Chen Jieneng, Chen Jingye, Zhang Ruiming, Hu Xiaobin

机构信息

Department of Computer Science, College of Electronics and Information Engineering, Tongji University, Shanghai, China.

School of Computer Science, Fudan University, Shanghai, China.

出版信息

Front Neurorobot. 2019 Jun 28;13:40. doi: 10.3389/fnbot.2019.00040. eCollection 2019.

Abstract

An effective way to achieve intelligence is to simulate various intelligent behaviors in the human brain. In recent years, bio-inspired learning methods have emerged, and they are different from the classical mathematical programming principle. From the perspective of brain inspiration, reinforcement learning has gained additional interest in solving decision-making tasks as increasing neuroscientific research demonstrates that significant links exist between reinforcement learning and specific neural substrates. Because of the tremendous research that focuses on human brains and reinforcement learning, scientists have investigated how robots can autonomously tackle complex tasks in the form of making a self-driving agent control in a human-like way. In this study, we propose an end-to-end architecture using novel deep-Q-network architecture in conjunction with a recurrence to resolve the problem in the field of simulated self-driving. The main contribution of this study is that we trained the driving agent using a brain-inspired trial-and-error technique, which was in line with the real world situation. Besides, there are three innovations in the proposed learning network: raw screen outputs are the only information which the driving agent can rely on, a weighted layer that enhances the differences of the lengthy episode, and a modified replay mechanism that overcomes the problem of sparsity and accelerates learning. The proposed network was trained and tested under a third-party OpenAI Gym environment. After training for several episodes, the resulting driving agent performed advanced behaviors in the given scene. We hope that in the future, the proposed brain-inspired learning system would inspire practicable self-driving control solutions.

摘要

实现智能的一种有效方法是模拟人类大脑中的各种智能行为。近年来,受生物启发的学习方法应运而生,它们不同于经典的数学编程原理。从大脑启发的角度来看,强化学习在解决决策任务方面获得了更多关注,因为越来越多的神经科学研究表明,强化学习与特定的神经基质之间存在着重要联系。由于对人类大脑和强化学习的大量研究,科学家们研究了机器人如何以类似人类的方式进行自动驾驶代理控制的形式自主处理复杂任务。在本研究中,我们提出了一种端到端架构,该架构使用新颖的深度Q网络架构并结合循环来解决模拟自动驾驶领域中的问题。本研究的主要贡献在于,我们使用受大脑启发的试错技术训练驾驶代理,这与现实世界的情况相符。此外,所提出的学习网络有三项创新:原始屏幕输出是驾驶代理唯一可以依赖的信息,一个加权层增强了长情节的差异,以及一个改进的重放机制,克服了稀疏性问题并加速了学习。所提出的网络在第三方OpenAI Gym环境下进行了训练和测试。经过几轮训练后,生成的驾驶代理在给定场景中表现出先进的行为。我们希望在未来,所提出的受大脑启发的学习系统能够激发切实可行的自动驾驶控制解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db67/6611356/0bbce8f32d1f/fnbot-13-00040-g0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验