• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度强化学习的捕食者-猎物系统协同进化

Deep-Reinforcement Learning-Based Co-Evolution in a Predator-Prey System.

作者信息

Wang Xueting, Cheng Jun, Wang Lei

机构信息

CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.

Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, Shatin 999077, Hong Kong, China.

出版信息

Entropy (Basel). 2019 Aug 8;21(8):773. doi: 10.3390/e21080773.

DOI:10.3390/e21080773
PMID:33267487
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7515302/
Abstract

Understanding or estimating the co-evolution processes is critical in ecology, but very challenging. Traditional methods are difficult to deal with the complex processes of evolution and to predict their consequences on nature. In this paper, we use the deep-reinforcement learning algorithms to endow the organism with learning ability, and simulate their evolution process by using the Monte Carlo simulation algorithm in a large-scale ecosystem. The combination of the two algorithms allows organisms to use experiences to determine their behavior through interaction with that environment, and to pass on experience to their offspring. Our research showed that the predators' reinforcement learning ability contributed to the stability of the ecosystem and helped predators obtain a more reasonable behavior pattern of coexistence with its prey. The reinforcement learning effect of prey on its own population was not as good as that of predators and increased the risk of extinction of predators. The inconsistent learning periods and speed of prey and predators aggravated that risk. The co-evolution of the two species had resulted in fewer numbers of their populations due to their potentially antagonistic evolutionary networks. If the learnable predators and prey invade an ecosystem at the same time, prey had an advantage. Thus, the proposed model illustrates the influence of learning mechanism on a predator-prey ecosystem and demonstrates the feasibility of predicting the behavior evolution in a predator-prey ecosystem using AI approaches.

摘要

理解或估计协同进化过程在生态学中至关重要,但极具挑战性。传统方法难以处理复杂的进化过程,也难以预测其对自然界的影响。在本文中,我们使用深度强化学习算法赋予生物体学习能力,并在大规模生态系统中利用蒙特卡罗模拟算法模拟它们的进化过程。这两种算法的结合使生物体能够通过与环境的交互利用经验来决定其行为,并将经验传递给后代。我们的研究表明,捕食者的强化学习能力有助于生态系统的稳定,并帮助捕食者获得与其猎物共存的更合理行为模式。猎物对自身种群的强化学习效果不如捕食者,且增加了捕食者灭绝的风险。猎物和捕食者不一致的学习周期和速度加剧了这种风险。由于它们潜在的对抗性进化网络,这两个物种的协同进化导致了它们种群数量的减少。如果可学习的捕食者和猎物同时侵入一个生态系统,猎物具有优势。因此,所提出的模型说明了学习机制对捕食者 - 猎物生态系统的影响,并证明了使用人工智能方法预测捕食者 - 猎物生态系统中行为进化的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/cedba49ddbbf/entropy-21-00773-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/7d3e4c73ac59/entropy-21-00773-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/105950ccdf83/entropy-21-00773-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/f2e2a05971c8/entropy-21-00773-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/0e82d298224c/entropy-21-00773-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/cedba49ddbbf/entropy-21-00773-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/7d3e4c73ac59/entropy-21-00773-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/105950ccdf83/entropy-21-00773-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/f2e2a05971c8/entropy-21-00773-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/0e82d298224c/entropy-21-00773-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd62/7515302/cedba49ddbbf/entropy-21-00773-g005.jpg

相似文献

1
Deep-Reinforcement Learning-Based Co-Evolution in a Predator-Prey System.基于深度强化学习的捕食者-猎物系统协同进化
Entropy (Basel). 2019 Aug 8;21(8):773. doi: 10.3390/e21080773.
2
Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents.强化学习智能体驱动的捕食者-猎物生态系统的协同进化
Entropy (Basel). 2021 Apr 13;23(4):461. doi: 10.3390/e23040461.
3
Supercritical and subcritical Hopf-bifurcations in a two-delayed prey-predator system with density-dependent mortality of predator and strong Allee effect in prey.具有捕食者密度依赖死亡率和猎物强阿利效应的双延迟捕食-食饵系统中的超临界和亚临界霍普夫分岔
Biosystems. 2019 Jun;180:19-37. doi: 10.1016/j.biosystems.2019.02.011. Epub 2019 Mar 7.
4
Collaborative hunting in artificial agents with deep reinforcement learning.深度强化学习中的人工代理协同捕猎。
Elife. 2024 May 7;13:e85694. doi: 10.7554/eLife.85694.
5
Predator coexistence through emergent fitness equalization.通过新兴适应度均衡实现捕食者共存。
Ecology. 2020 May;101(5):e02995. doi: 10.1002/ecy.2995. Epub 2020 Feb 28.
6
Co-evolution as an important component explaining microbial predator-prey interaction.协同进化是解释微生物捕食者 - 猎物相互作用的一个重要组成部分。
J Theor Biol. 2020 Feb 7;486:110095. doi: 10.1016/j.jtbi.2019.110095. Epub 2019 Nov 26.
7
Background evolution in camouflage systems: a predator-prey/pollinator-flower game.背景进化在伪装系统中:捕食者-猎物/传粉者-花的博弈。
J Theor Biol. 2010 Feb 21;262(4):662-78. doi: 10.1016/j.jtbi.2009.09.001. Epub 2009 Sep 10.
8
Evolution of handling time can destroy the coexistence of cycling predators.处理时间的演变会破坏周期性捕食者的共存。
J Evol Biol. 2006 Jan;19(1):49-58. doi: 10.1111/j.1420-9101.2005.00993.x.
9
Spatial Heterogeneity, Indirect Interactions, and the Coexistence of Prey Species.空间异质性、间接相互作用与猎物物种的共存
Am Nat. 1984;124(3):377-406. doi: 10.1086/284280.
10
Influence of Cat Odor on Reproductive Behavior and Physiology in the House Mouse: (Mus Musculus)猫的气味对家鼠(小家鼠)生殖行为和生理的影响

引用本文的文献

1
Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents.强化学习智能体驱动的捕食者-猎物生态系统的协同进化
Entropy (Basel). 2021 Apr 13;23(4):461. doi: 10.3390/e23040461.

本文引用的文献

1
Enriching behavioral ecology with reinforcement learning methods.用强化学习方法丰富行为生态学。
Behav Processes. 2019 Apr;161:94-100. doi: 10.1016/j.beproc.2018.01.008. Epub 2018 Feb 13.
2
Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。
Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.
3
Evolution of Swarming Behavior Is Shaped by How Predators Attack.群体行为的进化受捕食者攻击方式的影响。
Artif Life. 2016 Summer;22(3):299-318. doi: 10.1162/ARTL_a_00206. Epub 2016 May 3.
4
Population and Evolutionary Dynamics based on Predator-Prey Relationships in a 3D Physical Simulation.基于三维物理模拟中捕食者 - 猎物关系的种群与进化动力学
Artif Life. 2016 Spring;22(2):226-40. doi: 10.1162/ARTL_a_00201. Epub 2016 Mar 2.
5
Reinforcement learning improves behaviour from evaluative feedback.强化学习通过评估反馈来改善行为。
Nature. 2015 May 28;521(7553):445-51. doi: 10.1038/nature14540.
6
The learning of action sequences through social transmission.通过社会传播学习动作序列。
Anim Cogn. 2015 Sep;18(5):1093-103. doi: 10.1007/s10071-015-0877-x. Epub 2015 May 26.
7
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
8
Collective behavior and predation success in a predator-prey model inspired by hunting bats.受捕食蝙蝠启发的捕食者 - 猎物模型中的集体行为与捕食成功率
Phys Rev E Stat Nonlin Soft Matter Phys. 2013 Dec;88(6):062724. doi: 10.1103/PhysRevE.88.062724. Epub 2013 Dec 30.
9
Status and ecological effects of the world's largest carnivores.世界上最大的食肉动物的现状和生态影响。
Science. 2014 Jan 10;343(6167):1241484. doi: 10.1126/science.1241484.
10
Spontaneous emergence of spatial patterns in a predator-prey model.捕食者 - 猎物模型中空间模式的自发出现。
Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Dec;76(6 Pt 1):061902. doi: 10.1103/PhysRevE.76.061902. Epub 2007 Dec 4.