• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

未知户外环境中自主飞行机器人端到端局部运动规划的深度强化学习:实时飞行实验

Deep Reinforcement Learning for End-to-End Local Motion Planning of Autonomous Aerial Robots in Unknown Outdoor Environments: Real-Time Flight Experiments.

作者信息

Doukhi Oualid, Lee Deok-Jin

机构信息

Center for Artificial Intelligence & Autonomous Systems, Kunsan National University, 558 Daehak-ro, Naun 2(i)-dong, Gunsan 54150, Jeollabuk-do, Korea.

School of Mechanical Design Engineering, Smart e-Mobilty Lab, Center for Artificial Intelligence & Autonomous Systems, Jeonbuk National University, 567, Baekje-daero, Deokjin-gu, Jeonju-si 54896, Jeollabuk-do, Korea.

出版信息

Sensors (Basel). 2021 Apr 4;21(7):2534. doi: 10.3390/s21072534.

DOI:10.3390/s21072534
PMID:33916624
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8038595/
Abstract

Autonomous navigation and collision avoidance missions represent a significant challenge for robotics systems as they generally operate in dynamic environments that require a high level of autonomy and flexible decision-making capabilities. This challenge becomes more applicable in micro aerial vehicles (MAVs) due to their limited size and computational power. This paper presents a novel approach for enabling a micro aerial vehicle system equipped with a laser range finder to autonomously navigate among obstacles and achieve a user-specified goal location in a GPS-denied environment, without the need for mapping or path planning. The proposed system uses an actor-critic-based reinforcement learning technique to train the aerial robot in a Gazebo simulator to perform a point-goal navigation task by directly mapping the noisy MAV's state and laser scan measurements to continuous motion control. The obtained policy can perform collision-free flight in the real world while being trained entirely on a 3D simulator. Intensive simulations and real-time experiments were conducted and compared with a nonlinear model predictive control technique to show the generalization capabilities to new unseen environments, and robustness against localization noise. The obtained results demonstrate our system's effectiveness in flying safely and reaching the desired points by planning smooth forward linear velocity and heading rates.

摘要

自主导航和避碰任务对机器人系统来说是一项重大挑战,因为它们通常在需要高度自主性和灵活决策能力的动态环境中运行。由于其尺寸和计算能力有限,这一挑战在微型飞行器(MAV)中更为突出。本文提出了一种新颖的方法,使配备激光测距仪的微型飞行器系统能够在无GPS环境中在障碍物之间自主导航并到达用户指定的目标位置,而无需地图绘制或路径规划。所提出的系统使用基于演员-评论家的强化学习技术,在Gazebo模拟器中训练空中机器人,通过直接将有噪声的微型飞行器状态和激光扫描测量映射到连续运动控制来执行点目标导航任务。所获得的策略能够在现实世界中执行无碰撞飞行,同时完全在三维模拟器上进行训练。进行了大量模拟和实时实验,并与非线性模型预测控制技术进行了比较,以展示其对新的未知环境的泛化能力以及对定位噪声的鲁棒性。所获得的结果证明了我们的系统通过规划平滑的前向线速度和航向速率,在安全飞行和到达期望点方面的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/6b5a21421e22/sensors-21-02534-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/928f9ff06fe6/sensors-21-02534-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/90c660e7e585/sensors-21-02534-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/1f28f5870e84/sensors-21-02534-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/e6da4fbd9a38/sensors-21-02534-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/65bfd7ff2fe7/sensors-21-02534-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/2f911e72d95c/sensors-21-02534-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/cc44319582ba/sensors-21-02534-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/2cf2acb77843/sensors-21-02534-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/275870e0bf3d/sensors-21-02534-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/e46b2e696c8d/sensors-21-02534-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/f73c3592413e/sensors-21-02534-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/06ecc352f2df/sensors-21-02534-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/6b5a21421e22/sensors-21-02534-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/928f9ff06fe6/sensors-21-02534-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/90c660e7e585/sensors-21-02534-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/1f28f5870e84/sensors-21-02534-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/e6da4fbd9a38/sensors-21-02534-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/65bfd7ff2fe7/sensors-21-02534-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/2f911e72d95c/sensors-21-02534-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/cc44319582ba/sensors-21-02534-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/2cf2acb77843/sensors-21-02534-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/275870e0bf3d/sensors-21-02534-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/e46b2e696c8d/sensors-21-02534-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/f73c3592413e/sensors-21-02534-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/06ecc352f2df/sensors-21-02534-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a3/8038595/6b5a21421e22/sensors-21-02534-g013.jpg

相似文献

1
Deep Reinforcement Learning for End-to-End Local Motion Planning of Autonomous Aerial Robots in Unknown Outdoor Environments: Real-Time Flight Experiments.未知户外环境中自主飞行机器人端到端局部运动规划的深度强化学习:实时飞行实验
Sensors (Basel). 2021 Apr 4;21(7):2534. doi: 10.3390/s21072534.
2
Optimal Motion Planning in GPS-Denied Environments Using Nonlinear Model Predictive Horizon.基于非线性模型预测视野的 GPS 拒止环境下最优运动规划。
Sensors (Basel). 2021 Aug 18;21(16):5547. doi: 10.3390/s21165547.
3
Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning.基于地图的深度强化学习实现分布式非通信多机器人避碰
Sensors (Basel). 2020 Aug 27;20(17):4836. doi: 10.3390/s20174836.
4
Autonomous localized path planning algorithm for UAVs based on TD3 strategy.基于TD3策略的无人机自主局部路径规划算法
Sci Rep. 2024 Jan 8;14(1):763. doi: 10.1038/s41598-024-51349-4.
5
Navigation Aiding by a Hybrid Laser-Camera Motion Estimator for Micro Aerial Vehicles.用于微型飞行器的混合激光-相机运动估计器辅助导航
Sensors (Basel). 2016 Sep 16;16(9):1516. doi: 10.3390/s16091516.
6
Enabling UAV Navigation with Sensor and Environmental Uncertainty in Cluttered and GPS-Denied Environments.在复杂且无GPS信号的环境中利用传感器和环境不确定性实现无人机导航
Sensors (Basel). 2016 May 10;16(5):666. doi: 10.3390/s16050666.
7
Deep Learning-Based NMPC for Local Motion Planning of Last-Mile Delivery Robot.基于深度学习的最后一英里送货机器人局部运动规划的 NMPC。
Sensors (Basel). 2022 Oct 22;22(21):8101. doi: 10.3390/s22218101.
8
Reinforcement learning-based dynamic obstacle avoidance and integration of path planning.基于强化学习的动态避障与路径规划集成
Intell Serv Robot. 2021;14(5):663-677. doi: 10.1007/s11370-021-00387-2. Epub 2021 Oct 6.
9
Research on obstacle avoidance optimization and path planning of autonomous vehicles based on attention mechanism combined with multimodal information decision-making thoughts of robots.基于注意力机制结合机器人多模态信息决策思想的自动驾驶车辆避障优化与路径规划研究
Front Neurorobot. 2023 Sep 22;17:1269447. doi: 10.3389/fnbot.2023.1269447. eCollection 2023.
10
Autonomous Vision-Based Aerial Grasping for Rotorcraft Unmanned Aerial Vehicles.基于视觉的旋翼机无人机自主空中抓取
Sensors (Basel). 2019 Aug 3;19(15):3410. doi: 10.3390/s19153410.

引用本文的文献

1
White shark optimizer with optimal deep learning based effective unmanned aerial vehicles communication and scene classification.基于最优深度学习的白鲨优化器在无人机有效通信与场景分类中的应用
Sci Rep. 2023 Dec 27;13(1):23041. doi: 10.1038/s41598-023-50064-w.
2
Reinforcement and Curriculum Learning for Off-Road Navigation of an UGV with a 3D LiDAR.基于 3D LiDAR 的 UGV 越野导航的增强与课程学习。
Sensors (Basel). 2023 Mar 18;23(6):3239. doi: 10.3390/s23063239.
3
A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework.

本文引用的文献

1
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
基于深度强化学习框架的无地图局部路径规划方法。
Sensors (Basel). 2023 Feb 10;23(4):2036. doi: 10.3390/s23042036.
4
End-to-End AUV Motion Planning Method Based on Soft Actor-Critic.基于软动作 - 批评家的端到端 AUV 运动规划方法。
Sensors (Basel). 2021 Sep 1;21(17):5893. doi: 10.3390/s21175893.