• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度强化学习中的人工代理协同捕猎。

Collaborative hunting in artificial agents with deep reinforcement learning.

机构信息

Graduate School of Informatics, Nagoya University, Nagoya, Japan.

Institute for Advanced Research, Nagoya University, Nagoya, Japan.

出版信息

Elife. 2024 May 7;13:e85694. doi: 10.7554/eLife.85694.

DOI:10.7554/eLife.85694
PMID:38711355
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11076040/
Abstract

Collaborative hunting, in which predators play different and complementary roles to capture prey, has been traditionally believed to be an advanced hunting strategy requiring large brains that involve high-level cognition. However, recent findings that collaborative hunting has also been documented in smaller-brained vertebrates have placed this previous belief under strain. Here, using computational multi-agent simulations based on deep reinforcement learning, we demonstrate that decisions underlying collaborative hunts do not necessarily rely on sophisticated cognitive processes. We found that apparently elaborate coordination can be achieved through a relatively simple decision process of mapping between states and actions related to distance-dependent internal representations formed by prior experience. Furthermore, we confirmed that this decision rule of predators is robust against unknown prey controlled by humans. Our computational ecological results emphasize that collaborative hunting can emerge in various intra- and inter-specific interactions in nature, and provide insights into the evolution of sociality.

摘要

合作狩猎,即捕食者扮演不同且互补的角色来捕获猎物,传统上被认为是一种需要大脑发达并涉及高级认知的先进狩猎策略。然而,最近的发现表明,合作狩猎也存在于大脑较小的脊椎动物中,这使得之前的观点受到了挑战。在这里,我们使用基于深度强化学习的计算多主体模拟,证明了合作狩猎的决策不一定依赖于复杂的认知过程。我们发现,通过一种相对简单的决策过程,即根据先前经验形成的与距离相关的内部表示的状态和动作之间的映射,可以实现明显复杂的协调。此外,我们证实了捕食者的这种决策规则对人类控制的未知猎物具有鲁棒性。我们的计算生态学结果强调了合作狩猎可以在自然界中各种种内和种间的相互作用中出现,并为社会性的进化提供了新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/eb8f8597af12/elife-85694-fig4-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/af570972dbeb/elife-85694-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/c8cc1cebbb90/elife-85694-fig1-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/f1b716dcb6e5/elife-85694-fig1-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/628435f70d99/elife-85694-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/a2e502f54a26/elife-85694-fig2-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/0104673e7ab0/elife-85694-fig2-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/35cd2f7cfb91/elife-85694-fig2-figsupp3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/3fa897e32b82/elife-85694-fig2-figsupp4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/8147adef2d00/elife-85694-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/bc14c8442ac8/elife-85694-fig3-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/74875c22926d/elife-85694-fig3-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/a609120b0ab4/elife-85694-fig3-figsupp3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/60cc356cc752/elife-85694-fig3-figsupp4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/80f237281472/elife-85694-fig3-figsupp5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/ffc3fc033ff1/elife-85694-fig3-figsupp6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/65a0922ae9a0/elife-85694-fig3-figsupp7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/d3c9706625f4/elife-85694-fig3-figsupp8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/5e0085ad66f4/elife-85694-fig3-figsupp9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/eb80acfb8ad4/elife-85694-fig3-figsupp10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/f9997723b97d/elife-85694-fig3-figsupp11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/ec2d6dd67198/elife-85694-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/12192cf50709/elife-85694-fig4-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/eb8f8597af12/elife-85694-fig4-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/af570972dbeb/elife-85694-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/c8cc1cebbb90/elife-85694-fig1-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/f1b716dcb6e5/elife-85694-fig1-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/628435f70d99/elife-85694-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/a2e502f54a26/elife-85694-fig2-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/0104673e7ab0/elife-85694-fig2-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/35cd2f7cfb91/elife-85694-fig2-figsupp3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/3fa897e32b82/elife-85694-fig2-figsupp4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/8147adef2d00/elife-85694-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/bc14c8442ac8/elife-85694-fig3-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/74875c22926d/elife-85694-fig3-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/a609120b0ab4/elife-85694-fig3-figsupp3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/60cc356cc752/elife-85694-fig3-figsupp4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/80f237281472/elife-85694-fig3-figsupp5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/ffc3fc033ff1/elife-85694-fig3-figsupp6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/65a0922ae9a0/elife-85694-fig3-figsupp7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/d3c9706625f4/elife-85694-fig3-figsupp8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/5e0085ad66f4/elife-85694-fig3-figsupp9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/eb80acfb8ad4/elife-85694-fig3-figsupp10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/f9997723b97d/elife-85694-fig3-figsupp11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/ec2d6dd67198/elife-85694-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/12192cf50709/elife-85694-fig4-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abc3/11076040/eb8f8597af12/elife-85694-fig4-figsupp2.jpg

相似文献

1
Collaborative hunting in artificial agents with deep reinforcement learning.深度强化学习中的人工代理协同捕猎。
Elife. 2024 May 7;13:e85694. doi: 10.7554/eLife.85694.
2
Cultural transmission, competition for prey, and the evolution of cooperative hunting.文化传播、猎物竞争与合作狩猎的进化。
Theor Popul Biol. 2024 Apr;156:12-21. doi: 10.1016/j.tpb.2023.12.005. Epub 2024 Jan 7.
3
Mechanisms of group-hunting in vertebrates.脊椎动物群体捕猎的机制。
Biol Rev Camb Philos Soc. 2023 Oct;98(5):1687-1711. doi: 10.1111/brv.12973. Epub 2023 May 18.
4
Sharing Rewards Undermines Coordinated Hunting.分享奖励会破坏协作狩猎。
J Comput Biol. 2022 Sep;29(9):1022-1030. doi: 10.1089/cmb.2021.0549. Epub 2022 Jun 24.
5
Wolf-pack (Canis lupus) hunting strategies emerge from simple rules in computational simulations.狼群(犬狼疮)的狩猎策略源自计算模拟中的简单规则。
Behav Processes. 2011 Nov;88(3):192-7. doi: 10.1016/j.beproc.2011.09.006. Epub 2011 Sep 28.
6
Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents.强化学习智能体驱动的捕食者-猎物生态系统的协同进化
Entropy (Basel). 2021 Apr 13;23(4):461. doi: 10.3390/e23040461.
7
Lionfish predators use flared fin displays to initiate cooperative hunting.狮子鱼的捕食者利用展开鱼鳍的展示行为来发起合作捕猎。
Biol Lett. 2014 Jun;10(6). doi: 10.1098/rsbl.2014.0281.
8
Influence of group size on the success of wolves hunting bison.群体规模对狼捕猎野牛成功率的影响。
PLoS One. 2014 Nov 12;9(11):e112884. doi: 10.1371/journal.pone.0112884. eCollection 2014.
9
Simple decision rules underlie collaborative hunting in yellow saddle goatfish.简单的决策规则是黄鞍羊鱼协作狩猎的基础。
Proc Biol Sci. 2018 Jan 31;285(1871). doi: 10.1098/rspb.2017.2488.
10
Echolocating toothed whales use ultra-fast echo-kinetic responses to track evasive prey.使用回声定位的齿鲸利用超快速回声动力学反应来追踪逃避的猎物。
Elife. 2021 Oct 26;10:e68825. doi: 10.7554/eLife.68825.

引用本文的文献

1
Analyzing coordinated group behavior through role-sharing: a pilot study in female 3-on-3 basketball with practical application.通过角色分担分析团队协作行为:女子三人制篮球的初步研究及实际应用
Front Sports Act Living. 2025 May 27;7:1513982. doi: 10.3389/fspor.2025.1513982. eCollection 2025.

本文引用的文献

1
Flexible prediction of opponent motion with internal representation in interception behavior.在拦截行为中,通过内部表示灵活预测对手的运动。
Biol Cybern. 2021 Oct;115(5):473-485. doi: 10.1007/s00422-021-00891-9. Epub 2021 Aug 11.
2
Formalizing planning and information search in naturalistic decision-making.自然决策中的规划与信息搜索形式化。
Nat Neurosci. 2021 Aug;24(8):1051-1064. doi: 10.1038/s41593-021-00866-w. Epub 2021 Jun 21.
3
Promises and challenges of human computational ethology.人类计算行为学的前景与挑战。
Neuron. 2021 Jul 21;109(14):2224-2238. doi: 10.1016/j.neuron.2021.05.021. Epub 2021 Jun 17.
4
Octopuses punch fishes during collaborative interspecific hunting events.章鱼在合作性种间捕猎活动中攻击鱼类。
Ecology. 2021 Mar;102(3):e03266. doi: 10.1002/ecy.3266. Epub 2021 Feb 2.
5
Deep Reinforcement Learning and Its Neuroscientific Implications.深度强化学习及其神经科学意义。
Neuron. 2020 Aug 19;107(4):603-616. doi: 10.1016/j.neuron.2020.06.014. Epub 2020 Jul 13.
6
Spatial planning with long visual range benefits escape from visual predators in complex naturalistic environments.具有长视觉范围的空间规划有利于在复杂的自然环境中躲避视觉捕食者。
Nat Commun. 2020 Jun 16;11(1):3057. doi: 10.1038/s41467-020-16102-1.
7
Human Navigational Strategy for Intercepting an Erratically Moving Target in Chase and Escape Interactions.人类在追逐和逃避互动中拦截不稳定移动目标的导航策略。
J Mot Behav. 2020;52(6):750-760. doi: 10.1080/00222895.2019.1692331. Epub 2019 Dec 2.
8
Human-level performance in 3D multiplayer games with population-based reinforcement learning.基于群体强化学习的 3D 多人游戏中的人类水平表现。
Science. 2019 May 31;364(6443):859-865. doi: 10.1126/science.aau6249.
9
Cognitive Control of Escape Behaviour.逃避行为的认知控制。
Trends Cogn Sci. 2019 Apr;23(4):334-348. doi: 10.1016/j.tics.2019.01.012. Epub 2019 Mar 6.
10
Vector-based navigation using grid-like representations in artificial agents.基于向量的人工代理中使用网格表示的导航。
Nature. 2018 May;557(7705):429-433. doi: 10.1038/s41586-018-0102-6. Epub 2018 May 9.