• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多智能体避掠食者的共识、合作学习与群聚行为

Consensus, cooperative learning, and flocking for multiagent predator avoidance.

作者信息

Young Zachary, Manh La Hung

机构信息

Department of Computer Science and Engineering, Advanced Robotics and Automation (ARA) Laboratory, University of Nevada, Reno, NV, USA.

出版信息

Int J Adv Robot Syst. 2020 Sep 1;17(5). doi: 10.1177/1729881420960342. Epub 2020 Sep 24.

DOI:10.1177/1729881420960342
PMID:34819959
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8609419/
Abstract

Multiagent coordination is highly desirable with many uses in a variety of tasks. In nature, the phenomenon of coordinated flocking is highly common with applications related to defending or escaping from predators. In this article, a hybrid multiagent system that integrates consensus, cooperative learning, and flocking control to determine the direction of attacking predators and learns to flock away from them in a coordinated manner is proposed. This system is entirely distributed requiring only communication between neighboring agents. The fusion of consensus and collaborative reinforcement learning allows agents to cooperatively learn in a variety of multiagent coordination tasks, but this article focuses on flocking away from attacking predators. The results of the flocking show that the agents are able to effectively flock to a target without collision with each other or obstacles. Multiple reinforcement learning methods are evaluated for the task with cooperative learning utilizing function approximation for state-space reduction performing the best. The results of the proposed consensus algorithm show that it provides quick and accurate transmission of information between agents in the flock. Simulations are conducted to show and validate the proposed hybrid system in both one and two predator environments, resulting in an efficient cooperative learning behavior. In the future, the system of using consensus to determine the state and reinforcement learning to learn the states can be applied to additional multiagent tasks.

摘要

多智能体协调在各种任务中有许多用途,非常值得期待。在自然界中,协同集群现象在与防御或躲避捕食者相关的应用中非常常见。本文提出了一种混合多智能体系统,该系统整合了一致性、合作学习和集群控制,以确定攻击捕食者的方向,并学会以协调的方式远离它们。该系统完全分布式,仅需要相邻智能体之间进行通信。一致性和协作强化学习的融合使智能体能够在各种多智能体协调任务中进行合作学习,但本文重点关注远离攻击捕食者的集群行为。集群结果表明,智能体能够有效地聚集到目标位置,且彼此之间或与障碍物无碰撞。针对该任务评估了多种强化学习方法,其中利用函数逼近进行状态空间缩减的合作学习表现最佳。所提出的一致性算法结果表明,它能在集群中的智能体之间快速准确地传输信息。进行了仿真以展示和验证所提出的混合系统在单捕食者和双捕食者环境中的有效性,从而产生高效的合作学习行为。未来,使用一致性来确定状态和强化学习来学习状态的系统可应用于其他多智能体任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/6f252bc23658/nihms-1638864-f0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/be38bc9c678a/nihms-1638864-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/12a86be94aa3/nihms-1638864-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/1ef9ddfd6125/nihms-1638864-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/a929040792da/nihms-1638864-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/52afcae75480/nihms-1638864-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/a1956b868b8d/nihms-1638864-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/318e6028e4b1/nihms-1638864-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/923bc76728fd/nihms-1638864-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/a4acd3fb91ad/nihms-1638864-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/262901933482/nihms-1638864-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/4de30469a1fd/nihms-1638864-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/218548fd21df/nihms-1638864-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/6f252bc23658/nihms-1638864-f0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/be38bc9c678a/nihms-1638864-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/12a86be94aa3/nihms-1638864-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/1ef9ddfd6125/nihms-1638864-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/a929040792da/nihms-1638864-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/52afcae75480/nihms-1638864-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/a1956b868b8d/nihms-1638864-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/318e6028e4b1/nihms-1638864-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/923bc76728fd/nihms-1638864-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/a4acd3fb91ad/nihms-1638864-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/262901933482/nihms-1638864-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/4de30469a1fd/nihms-1638864-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/218548fd21df/nihms-1638864-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0058/8609419/6f252bc23658/nihms-1638864-f0015.jpg

相似文献

1
Consensus, cooperative learning, and flocking for multiagent predator avoidance.多智能体避掠食者的共识、合作学习与群聚行为
Int J Adv Robot Syst. 2020 Sep 1;17(5). doi: 10.1177/1729881420960342. Epub 2020 Sep 24.
2
Leader-Based Flocking of Multiple Swarm Robots in Underwater Environments.基于领导者的多群体水下机器人聚集。
Sensors (Basel). 2023 Jun 2;23(11):5305. doi: 10.3390/s23115305.
3
Distributed Adaptive Flocking Control for Large-Scale Multiagent Systems.大规模多智能体系统的分布式自适应群体控制
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3126-3135. doi: 10.1109/TNNLS.2023.3343666. Epub 2025 Feb 6.
4
Collision-Avoiding Flocking With Multiple Fixed-Wing UAVs in Obstacle-Cluttered Environments: A Task-Specific Curriculum- Based MADRL Approach.在障碍物密集环境中多架固定翼无人机的避碰集群:一种基于特定任务课程的多智能体深度强化学习方法
IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):10894-10908. doi: 10.1109/TNNLS.2023.3245124. Epub 2024 Aug 5.
5
Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems.用于协作多智能体系统的基于任务分解的强化学习
IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):2054-2065. doi: 10.1109/TNNLS.2020.2996209. Epub 2021 May 3.
6
Simulation of a Bio-Inspired Flocking-Based Aggregation Behaviour in Swarm Robotics.群体机器人中基于生物启发的聚集行为的模拟
Biomimetics (Basel). 2024 Nov 1;9(11):668. doi: 10.3390/biomimetics9110668.
7
Learning to flock through reinforcement.通过强化学习来聚集。
Phys Rev E. 2020 Jul;102(1-1):012601. doi: 10.1103/PhysRevE.102.012601.
8
A Local-and-Global Attention Reinforcement Learning Algorithm for Multiagent Cooperative Navigation.一种用于多智能体协作导航的局部与全局注意力强化学习算法
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7767-7777. doi: 10.1109/TNNLS.2022.3220798. Epub 2024 Jun 3.
9
Intelligent multiagent coordination based on reinforcement hierarchical neuro-fuzzy models.基于强化分层神经模糊模型的智能多智能体协调
Int J Neural Syst. 2014 Dec;24(8):1450031. doi: 10.1142/S0129065714500312. Epub 2014 Nov 18.
10
Self-organizing neural architectures and cooperative learning in a multiagent environment.多智能体环境中的自组织神经架构与协同学习
IEEE Trans Syst Man Cybern B Cybern. 2007 Dec;37(6):1567-80. doi: 10.1109/tsmcb.2007.907040.

引用本文的文献

1
Collective evolution learning model for vision-based collective motion with collision avoidance.基于视觉的避碰集体运动的集体进化学习模型。
PLoS One. 2023 May 10;18(5):e0270318. doi: 10.1371/journal.pone.0270318. eCollection 2023.
2
Predefined Time Synchronization of Multi-Agent Systems: A Passivity Based Analysis.多智能体系统的预定义时间同步:基于无源的分析。
Sensors (Basel). 2023 Apr 10;23(8):3865. doi: 10.3390/s23083865.
3
Dynamics and risk sharing in groups of selfish individuals.群体中自私个体的动态与风险分担。

本文引用的文献

1
Learning to flock through reinforcement.通过强化学习来聚集。
Phys Rev E. 2020 Jul;102(1-1):012601. doi: 10.1103/PhysRevE.102.012601.
2
Distributed Sensor Fusion for Scalar Field Mapping Using Mobile Sensor Networks.利用移动传感器网络进行标量场映射的分布式传感器融合。
IEEE Trans Cybern. 2013 Apr;43(2):766-78. doi: 10.1109/TSMCB.2012.2215919. Epub 2013 Mar 7.
J Theor Biol. 2023 Apr 7;562:111433. doi: 10.1016/j.jtbi.2023.111433. Epub 2023 Feb 2.