用于自主水下航行器三维路径跟踪与避碰的深度强化学习控制器

Deep Reinforcement Learning Controller for 3D Path Following and Collision Avoidance by Autonomous Underwater Vehicles.

作者信息

Havenstrøm Simen Theie, Rasheed Adil, San Omer

机构信息

Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway.

Mathematics and Cybernetics, SINTEF Digital, Trondheim, Norway.

出版信息

Front Robot AI. 2021 Jan 25;7:566037. doi: 10.3389/frobt.2020.566037. eCollection 2020.

DOI:10.3389/frobt.2020.566037

PMID:33585570

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7874127/

Abstract

Control theory provides engineers with a multitude of tools to design controllers that manipulate the closed-loop behavior and stability of dynamical systems. These methods rely heavily on insights into the mathematical model governing the physical system. However, in complex systems, such as autonomous underwater vehicles performing the dual objective of path following and collision avoidance, decision making becomes nontrivial. We propose a solution using state-of-the-art Deep Reinforcement Learning (DRL) techniques to develop autonomous agents capable of achieving this hybrid objective without having a priori knowledge about the goal or the environment. Our results demonstrate the viability of DRL in path following and avoiding collisions towards achieving human-level decision making in autonomous vehicle systems within extreme obstacle configurations.

摘要

控制理论为工程师提供了大量工具，用于设计控制器，以操纵动态系统的闭环行为和稳定性。这些方法严重依赖于对控制物理系统的数学模型的深入理解。然而，在复杂系统中，例如执行路径跟踪和避碰双重目标的自主水下航行器，决策变得并非易事。我们提出一种解决方案，利用最先进的深度强化学习（DRL）技术来开发自主智能体，使其能够在无需关于目标或环境的先验知识的情况下实现这一混合目标。我们的结果证明了DRL在路径跟踪和避碰方面的可行性，朝着在极端障碍物配置下的自主车辆系统中实现人类水平的决策迈进。