• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的多机器人系统动态场探索与重建用于环境监测

Reinforcement learning-based dynamic field exploration and reconstruction using multi-robot systems for environmental monitoring.

作者信息

Lu Thinh, Sobti Divyam, Talwar Deepak, Wu Wencen

机构信息

Computer Engineering Department, San Jose State University, San Jose, CA, United States.

出版信息

Front Robot AI. 2025 Mar 25;12:1492526. doi: 10.3389/frobt.2025.1492526. eCollection 2025.

DOI:10.3389/frobt.2025.1492526
PMID:40201564
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11975907/
Abstract

In the realm of real-time environmental monitoring and hazard detection, multi-robot systems present a promising solution for exploring and mapping dynamic fields, particularly in scenarios where human intervention poses safety risks. This research introduces a strategy for path planning and control of a group of mobile sensing robots to efficiently explore and reconstruct a dynamic field consisting of multiple non-overlapping diffusion sources. Our approach integrates a reinforcement learning-based path planning algorithm to guide the multi-robot formation in identifying diffusion sources, with a clustering-based method for destination selection once a new source is detected, to enhance coverage and accelerate exploration in unknown environments. Simulation results and real-world laboratory experiments demonstrate the effectiveness of our approach in exploring and reconstructing dynamic fields. This study advances the field of multi-robot systems in environmental monitoring and has practical implications for rescue missions and field explorations.

摘要

在实时环境监测和危险检测领域,多机器人系统为探索和绘制动态区域提供了一种很有前景的解决方案,特别是在人类干预存在安全风险的场景中。本研究介绍了一种针对一组移动传感机器人的路径规划和控制策略,以有效地探索和重建由多个不重叠扩散源组成的动态区域。我们的方法集成了一种基于强化学习的路径规划算法,以指导多机器人编队识别扩散源,并采用一种基于聚类的方法在检测到新源时进行目的地选择,以提高未知环境中的覆盖范围并加速探索。仿真结果和实际实验室实验证明了我们的方法在探索和重建动态区域方面的有效性。这项研究推动了多机器人系统在环境监测领域的发展,对救援任务和野外探索具有实际意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/ac574bd8b658/frobt-12-1492526-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/c88bcfecb460/frobt-12-1492526-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/a9951a8711de/frobt-12-1492526-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/b1e73989fecc/frobt-12-1492526-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/3efb03cd6e46/frobt-12-1492526-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/8c2ee975a50e/frobt-12-1492526-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/798230b481bb/frobt-12-1492526-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/4174bb1bfa04/frobt-12-1492526-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/9fb5683a3d6b/frobt-12-1492526-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/69dae0af5835/frobt-12-1492526-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/8c657d8ef07b/frobt-12-1492526-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/131447172f3f/frobt-12-1492526-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/08321594a816/frobt-12-1492526-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/7d5e5d4b9272/frobt-12-1492526-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/5fc251552798/frobt-12-1492526-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/ac574bd8b658/frobt-12-1492526-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/c88bcfecb460/frobt-12-1492526-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/a9951a8711de/frobt-12-1492526-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/b1e73989fecc/frobt-12-1492526-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/3efb03cd6e46/frobt-12-1492526-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/8c2ee975a50e/frobt-12-1492526-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/798230b481bb/frobt-12-1492526-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/4174bb1bfa04/frobt-12-1492526-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/9fb5683a3d6b/frobt-12-1492526-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/69dae0af5835/frobt-12-1492526-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/8c657d8ef07b/frobt-12-1492526-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/131447172f3f/frobt-12-1492526-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/08321594a816/frobt-12-1492526-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/7d5e5d4b9272/frobt-12-1492526-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/5fc251552798/frobt-12-1492526-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/859d/11975907/ac574bd8b658/frobt-12-1492526-g015.jpg

相似文献

1
Reinforcement learning-based dynamic field exploration and reconstruction using multi-robot systems for environmental monitoring.基于强化学习的多机器人系统动态场探索与重建用于环境监测
Front Robot AI. 2025 Mar 25;12:1492526. doi: 10.3389/frobt.2025.1492526. eCollection 2025.
2
A Complete Coverage Path Planning Algorithm for Lawn Mowing Robots Based on Deep Reinforcement Learning.一种基于深度强化学习的草坪修剪机器人全覆盖路径规划算法
Sensors (Basel). 2025 Jan 12;25(2):416. doi: 10.3390/s25020416.
3
Path planning and collision avoidance methods for distributed multi-robot systems in complex dynamic environments.复杂动态环境下分布式多机器人系统的路径规划与避碰方法
Math Biosci Eng. 2023 Jan;20(1):145-178. doi: 10.3934/mbe.2023008. Epub 2022 Sep 30.
4
Collaborative Complete Coverage and Path Planning for Multi-Robot Exploration.多机器人协作全覆盖路径规划。
Sensors (Basel). 2021 May 26;21(11):3709. doi: 10.3390/s21113709.
5
Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning.基于地图的深度强化学习实现分布式非通信多机器人避碰
Sensors (Basel). 2020 Aug 27;20(17):4836. doi: 10.3390/s20174836.
6
The Path Planning of Mobile Robot by Neural Networks and Hierarchical Reinforcement Learning.基于神经网络和分层强化学习的移动机器人路径规划
Front Neurorobot. 2020 Oct 2;14:63. doi: 10.3389/fnbot.2020.00063. eCollection 2020.
7
A survey on autonomous environmental monitoring approaches: towards unifying active sensing and reinforcement learning.自主环境监测方法综述:迈向主动感知与强化学习的统一
Front Robot AI. 2024 Mar 12;11:1336612. doi: 10.3389/frobt.2024.1336612. eCollection 2024.
8
RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning.RL-QPSO网络:用于高效移动机器人路径规划的深度强化学习增强型量子粒子群优化算法
Front Neurorobot. 2025 Jan 8;18:1464572. doi: 10.3389/fnbot.2024.1464572. eCollection 2024.
9
Resilient Multi-Robot Coverage Path Redistribution Using Boustrophedon Decomposition for Environmental Monitoring.基于双向换行分解的弹性多机器人覆盖路径重新分配用于环境监测
Sensors (Basel). 2024 Nov 23;24(23):7482. doi: 10.3390/s24237482.
10
Improved Hybrid Model for Obstacle Detection and Avoidance in Robot Operating System Framework (Rapidly Exploring Random Tree and Dynamic Windows Approach).机器人操作系统框架(快速探索随机树和动态窗口方法)中用于障碍物检测与规避的改进混合模型
Sensors (Basel). 2024 Apr 2;24(7):2262. doi: 10.3390/s24072262.

本文引用的文献

1
Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination.用于大规模分布式多机器人协调的图软演员-评论家强化学习
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):665-676. doi: 10.1109/TNNLS.2023.3329530. Epub 2025 Jan 7.
2
Distributed cooperative Kalman filter constrained by advection-diffusion equation for mobile sensor networks.用于移动传感器网络的受对流扩散方程约束的分布式协作卡尔曼滤波器
Front Robot AI. 2023 Jun 7;10:1175418. doi: 10.3389/frobt.2023.1175418. eCollection 2023.
3
Adaptive Optimal Tracking Control of an Underactuated Surface Vessel Using Actor-Critic Reinforcement Learning.
基于智能体-评判强化学习的欠驱动水面舰艇自适应最优跟踪控制
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7520-7533. doi: 10.1109/TNNLS.2022.3214681. Epub 2024 Jun 3.
4
A Survey on Curriculum Learning.课程学习调查
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4555-4576. doi: 10.1109/TPAMI.2021.3069908. Epub 2022 Aug 4.