• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的改进 hTrihex 机器人全覆盖路径规划

Reinforcement Learning-Based Complete Area Coverage Path Planning for a Modified hTrihex Robot.

机构信息

ROAR Lab, Engineering Product Development, Singapore University of Technology and Design, Singapore 487372, Singapore.

Optoelectronics Research Group, Faculty of Electrical and Electronics Engineering, Ton Duc Thang University, Ho Chi Minh City 700000, Vietnam.

出版信息

Sensors (Basel). 2021 Feb 4;21(4):1067. doi: 10.3390/s21041067.

DOI:10.3390/s21041067
PMID:33557225
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7913922/
Abstract

One of the essential attributes of a cleaning robot is to achieve complete area coverage. Current commercial indoor cleaning robots have fixed morphology and are restricted to clean only specific areas in a house. The results of maximum area coverage are sub-optimal in this case. Tiling robots are innovative solutions for such a coverage problem. These new kinds of robots can be deployed in the cases of cleaning, painting, maintenance, and inspection, which require complete area coverage. Tiling robots' objective is to cover the entire area by reconfiguring to different shapes as per the area requirements. In this context, it is vital to have a framework that enables the robot to maximize the area coverage while minimizing energy consumption. That means it is necessary for the robot to cover the maximum area with the least number of shape reconfigurations possible. The current paper proposes a complete area coverage planning module for the modified hTrihex, a honeycomb-shaped tiling robot, based on the deep reinforcement learning technique. This framework simultaneously generates the tiling shapes and the trajectory with minimum overall cost. In this regard, a convolutional neural network (CNN) with long short term memory (LSTM) layer was trained using the actor-critic experience replay (ACER) reinforcement learning algorithm. The simulation results obtained from the current implementation were compared against the results that were generated through traditional tiling theory models that included zigzag, spiral, and greedy search schemes. The model presented in the current paper was also compared against other methods where this problem was considered as a traveling salesman problem (TSP) solved through genetic algorithm (GA) and ant colony optimization (ACO) approaches. Our proposed scheme generates a path with a minimized cost at a lesser time.

摘要

清洁机器人的一个基本属性是实现完全区域覆盖。当前的商用室内清洁机器人具有固定的形态,只能清洁房屋中的特定区域。在这种情况下,最大区域覆盖的结果是次优的。平铺机器人是解决这种覆盖问题的创新解决方案。这些新型机器人可以在需要完全区域覆盖的清洁、绘画、维护和检查等情况下部署。平铺机器人的目标是通过根据区域要求重新配置为不同的形状来覆盖整个区域。在这种情况下,拥有一个使机器人能够在最小化能耗的同时最大化区域覆盖的框架是至关重要的。这意味着机器人需要用尽可能少的形状重新配置来覆盖最大的区域。本文提出了一种基于深度强化学习技术的改进 hTrihex 蜂窝状平铺机器人的完全区域覆盖规划模块。该框架同时生成具有最小总成本的平铺形状和轨迹。在这方面,使用演员-评论家经验回放(ACER)强化学习算法对具有长短时记忆(LSTM)层的卷积神经网络(CNN)进行了训练。当前实现获得的模拟结果与包括之字形、螺旋形和贪婪搜索方案在内的传统平铺理论模型生成的结果进行了比较。本文提出的模型还与其他将此问题视为通过遗传算法(GA)和蚁群优化(ACO)方法解决的旅行商问题(TSP)的方法进行了比较。我们提出的方案以较少的时间生成了成本最小的路径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/befc53128404/sensors-21-01067-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/590f35295d1f/sensors-21-01067-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/01f804d3642d/sensors-21-01067-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/8ef1df1bf4d3/sensors-21-01067-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/81cd5a624f5c/sensors-21-01067-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/98f20d70ce5b/sensors-21-01067-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/9bbf08f2e4d0/sensors-21-01067-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/7de99401e445/sensors-21-01067-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/98655ff39a2e/sensors-21-01067-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/cb8b0bbbca6f/sensors-21-01067-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/287d4fef525a/sensors-21-01067-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/7e3b1e090d46/sensors-21-01067-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/befc53128404/sensors-21-01067-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/590f35295d1f/sensors-21-01067-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/01f804d3642d/sensors-21-01067-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/8ef1df1bf4d3/sensors-21-01067-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/81cd5a624f5c/sensors-21-01067-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/98f20d70ce5b/sensors-21-01067-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/9bbf08f2e4d0/sensors-21-01067-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/7de99401e445/sensors-21-01067-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/98655ff39a2e/sensors-21-01067-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/cb8b0bbbca6f/sensors-21-01067-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/287d4fef525a/sensors-21-01067-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/7e3b1e090d46/sensors-21-01067-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7913922/befc53128404/sensors-21-01067-g012.jpg

相似文献

1
Reinforcement Learning-Based Complete Area Coverage Path Planning for a Modified hTrihex Robot.基于强化学习的改进 hTrihex 机器人全覆盖路径规划
Sensors (Basel). 2021 Feb 4;21(4):1067. doi: 10.3390/s21041067.
2
Optimization Complete Area Coverage by Reconfigurable hTrihex Tiling Robot.通过可重构 hTrihex 平铺机器人实现优化的全覆盖区域。
Sensors (Basel). 2020 Jun 3;20(11):3170. doi: 10.3390/s20113170.
3
Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran-A Polyabolo-Inspired Self-Reconfigurable Tiling Robot.基于强化学习的旅行商问题算法在hTetran - A多角形启发的自重构平铺机器人覆盖路径规划中的应用
Sensors (Basel). 2021 Apr 7;21(8):2577. doi: 10.3390/s21082577.
4
Evolutionary Algorithm-Based Complete Coverage Path Planning for Tetriamond Tiling Robots.基于进化算法的 Tetriamond 铺砖机器人全覆盖路径规划
Sensors (Basel). 2020 Jan 13;20(2):445. doi: 10.3390/s20020445.
5
Modified A-Star Algorithm for Efficient Coverage Path Planning in Tetris Inspired Self-Reconfigurable Robot with Integrated Laser Sensor.基于集成激光传感器的 Tetris 启发式自重构机器人的高效覆盖路径规划的改进 A-Star 算法。
Sensors (Basel). 2018 Aug 7;18(8):2585. doi: 10.3390/s18082585.
6
Hierarchical Area-Based and Path-Based Heuristic Approaches for Multirobot Coverage Path Planning with Performance Analysis in Surveillance Systems.基于层次区域和路径的启发式方法在监控系统多机器人覆盖路径规划中的应用及性能分析
Sensors (Basel). 2023 Oct 17;23(20):8533. doi: 10.3390/s23208533.
7
A Novel Cooperative Path Planning for Multi-robot Persistent Coverage with Obstacles and Coverage Period Constraints.一种具有障碍物和覆盖周期约束的多机器人持续覆盖的新型协同路径规划
Sensors (Basel). 2019 Apr 28;19(9):1994. doi: 10.3390/s19091994.
8
A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots.一种基于改进软演员-评论家算法的移动机器人路径规划方法。
Biomimetics (Basel). 2023 Oct 10;8(6):481. doi: 10.3390/biomimetics8060481.
9
Modelling and Control of a Reconfigurable Robot for Achieving Reconfiguration and Locomotion with Different Shapes.具有不同形状的可重构机器人的建模与控制,以实现重构和运动。
Sensors (Basel). 2021 Aug 9;21(16):5362. doi: 10.3390/s21165362.
10
Efficient Coverage Path Planning for Mobile Disinfecting Robots Using Graph-Based Representation of Environment.基于环境图形表示的移动消毒机器人高效覆盖路径规划
Front Robot AI. 2021 Mar 15;8:624333. doi: 10.3389/frobt.2021.624333. eCollection 2021.

引用本文的文献

1
Autonomous Sea Floor Coverage with Constrained Input Autonomous Underwater Vehicles: Integrated Path Planning and Control.具有受限输入的自主水下航行器的自主海底覆盖:集成路径规划与控制
Sensors (Basel). 2025 Feb 9;25(4):1023. doi: 10.3390/s25041023.
2
A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework.基于深度强化学习框架的无地图局部路径规划方法。
Sensors (Basel). 2023 Feb 10;23(4):2036. doi: 10.3390/s23042036.
3
Modelling and Control of a Reconfigurable Robot for Achieving Reconfiguration and Locomotion with Different Shapes.

本文引用的文献

1
PPS: Energy-Aware Grid-Based Coverage Path Planning for UAVs Using Area Partitioning in the Presence of NFZs.PPS:在存在 NFZ 的情况下,使用区域划分的基于网格的能量感知无人机覆盖路径规划。
Sensors (Basel). 2020 Jul 3;20(13):3742. doi: 10.3390/s20133742.
2
A Self-Organizing Fuzzy Logic Classifier for Benchmarking Robot-Aided Blasting of Ship Hulls.一种用于基准测试船体机器人辅助爆破的自组织模糊逻辑分类器。
Sensors (Basel). 2020 Jun 5;20(11):3215. doi: 10.3390/s20113215.
3
Optimization Complete Area Coverage by Reconfigurable hTrihex Tiling Robot.
具有不同形状的可重构机器人的建模与控制,以实现重构和运动。
Sensors (Basel). 2021 Aug 9;21(16):5362. doi: 10.3390/s21165362.
4
A Dimensional Comparison between Evolutionary Algorithm and Deep Reinforcement Learning Methodologies for Autonomous Surface Vehicles with Water Quality Sensors.具有水质传感器的自主水面车辆的进化算法与深度强化学习方法的维度比较。
Sensors (Basel). 2021 Apr 19;21(8):2862. doi: 10.3390/s21082862.
5
Locomotion with Pedestrian Aware from Perception Sensor by Pavement Sweeping Reconfigurable Robot.基于感知传感器的行人感知的路面清扫可重构机器人的移动。
Sensors (Basel). 2021 Mar 3;21(5):1745. doi: 10.3390/s21051745.
通过可重构 hTrihex 平铺机器人实现优化的全覆盖区域。
Sensors (Basel). 2020 Jun 3;20(11):3170. doi: 10.3390/s20113170.
4
Table Cleaning Task by Human Support Robot Using Deep Learning Technique.人型支持机器人使用深度学习技术进行的表清洁任务。
Sensors (Basel). 2020 Mar 18;20(6):1698. doi: 10.3390/s20061698.
5
Evolutionary Algorithm-Based Complete Coverage Path Planning for Tetriamond Tiling Robots.基于进化算法的 Tetriamond 铺砖机器人全覆盖路径规划
Sensors (Basel). 2020 Jan 13;20(2):445. doi: 10.3390/s20020445.
6
Modified A-Star Algorithm for Efficient Coverage Path Planning in Tetris Inspired Self-Reconfigurable Robot with Integrated Laser Sensor.基于集成激光传感器的 Tetris 启发式自重构机器人的高效覆盖路径规划的改进 A-Star 算法。
Sensors (Basel). 2018 Aug 7;18(8):2585. doi: 10.3390/s18082585.
7
A neural network approach to complete coverage path planning.一种用于完全覆盖路径规划的神经网络方法。
IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):718-25. doi: 10.1109/tsmcb.2003.811769.