• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用受秃鹫启发的深度强化学习揭示有风条件下自主热气流翱翔的原理。

Revealing principles of autonomous thermal soaring in windy conditions using vulture-inspired deep reinforcement-learning.

作者信息

Flato Yoav, Harel Roi, Tamar Aviv, Nathan Ran, Beatus Tsevi

机构信息

Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, 9190401, Israel.

Department of Ecology, Evolution, and Behavior, Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, 9190401, Israel.

出版信息

Nat Commun. 2024 Jun 10;15(1):4942. doi: 10.1038/s41467-024-48670-x.

DOI:10.1038/s41467-024-48670-x
PMID:38858356
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11164704/
Abstract

Thermal soaring, a technique used by birds and gliders to utilize updrafts of hot air, is an appealing model-problem for studying motion control and how it is learned by animals and engineered autonomous systems. Thermal soaring has rich dynamics and nontrivial constraints, yet it uses few control parameters and is becoming experimentally accessible. Following recent developments in applying reinforcement learning methods for training deep neural-network (deep-RL) models to soar autonomously both in simulation and real gliders, here we develop a simulation-based deep-RL system to study the learning process of thermal soaring. We find that this process has learning bottlenecks, we define a new efficiency metric and use it to characterize learning robustness, we compare the learned policy to data from soaring vultures, and find that the neurons of the trained network divide into function clusters that evolve during learning. These results pose thermal soaring as a rich yet tractable model-problem for the learning of motion control.

摘要

热气流翱翔是鸟类和滑翔机利用热空气上升气流的一种技术,是研究运动控制以及动物和工程自主系统如何学习运动控制的一个有吸引力的模型问题。热气流翱翔具有丰富的动力学特性和非平凡的约束条件,但它使用的控制参数很少,并且在实验上越来越容易实现。随着最近在应用强化学习方法训练深度神经网络(深度强化学习)模型以在模拟和实际滑翔机中自主翱翔方面的进展,我们在此开发了一个基于模拟的深度强化学习系统来研究热气流翱翔的学习过程。我们发现这个过程存在学习瓶颈,定义了一种新的效率指标并使用它来表征学习的稳健性,将学习到的策略与翱翔秃鹰的数据进行比较,发现训练网络的神经元会分成在学习过程中不断演变的功能簇。这些结果表明,热气流翱翔是学习运动控制的一个丰富但易于处理的模型问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/6203889878d2/41467_2024_48670_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/b229ba2ad5dc/41467_2024_48670_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/e219b1a5fbd4/41467_2024_48670_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/e292921f9513/41467_2024_48670_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/2bcf374c5373/41467_2024_48670_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/714ed13a5ec0/41467_2024_48670_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/e4654bc0c08c/41467_2024_48670_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/d7f28d53a37b/41467_2024_48670_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/8bf176fe87f5/41467_2024_48670_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/4b20682034d8/41467_2024_48670_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/6203889878d2/41467_2024_48670_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/b229ba2ad5dc/41467_2024_48670_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/e219b1a5fbd4/41467_2024_48670_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/e292921f9513/41467_2024_48670_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/2bcf374c5373/41467_2024_48670_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/714ed13a5ec0/41467_2024_48670_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/e4654bc0c08c/41467_2024_48670_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/d7f28d53a37b/41467_2024_48670_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/8bf176fe87f5/41467_2024_48670_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/4b20682034d8/41467_2024_48670_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9158/11164704/6203889878d2/41467_2024_48670_Fig10_HTML.jpg

相似文献

1
Revealing principles of autonomous thermal soaring in windy conditions using vulture-inspired deep reinforcement-learning.利用受秃鹫启发的深度强化学习揭示有风条件下自主热气流翱翔的原理。
Nat Commun. 2024 Jun 10;15(1):4942. doi: 10.1038/s41467-024-48670-x.
2
Learning to soar in turbulent environments.学会在动荡环境中翱翔。
Proc Natl Acad Sci U S A. 2016 Aug 16;113(33):E4877-84. doi: 10.1073/pnas.1606075113. Epub 2016 Aug 1.
3
Opportunistic soaring by birds suggests new opportunities for atmospheric energy harvesting by flying robots.鸟类的伺机翱翔为飞行机器人的大气能量收集提供了新的机会。
J R Soc Interface. 2022 Nov;19(196):20220671. doi: 10.1098/rsif.2022.0671. Epub 2022 Nov 23.
4
Vultures respond to challenges of near-ground thermal soaring by varying bank angle.兀鹫通过改变倾斜角度来应对近地面热上升气流的挑战。
J Exp Biol. 2018 Dec 3;221(Pt 23):jeb174995. doi: 10.1242/jeb.174995.
5
Glider soaring via reinforcement learning in the field.通过强化学习在野外滑翔。
Nature. 2018 Oct;562(7726):236-239. doi: 10.1038/s41586-018-0533-0. Epub 2018 Sep 19.
6
Thermal soaring flight of birds and unmanned aerial vehicles.鸟类和无人机的热升飞行。
Bioinspir Biomim. 2010 Dec;5(4):045003. doi: 10.1088/1748-3182/5/4/045003. Epub 2010 Nov 24.
7
Analysis of UAV Thermal Soaring via Hawk-Inspired Swarm Interaction.基于鹰群交互启发的无人机热气流翱翔分析
Biomimetics (Basel). 2023 Mar 17;8(1):124. doi: 10.3390/biomimetics8010124.
8
How cheap is soaring flight in raptors? A preliminary investigation in freely-flying vultures.猛禽 soaring flight 的成本有多低?对自由飞行的秃鹫的初步调查。
PLoS One. 2014 Jan 15;9(1):e84887. doi: 10.1371/journal.pone.0084887. eCollection 2014.
9
How did extinct giant birds and pterosaurs fly? A comprehensive modeling approach to evaluate soaring performance.已灭绝的巨型鸟类和翼龙是如何飞行的?一种评估翱翔性能的综合建模方法。
PNAS Nexus. 2022 Mar 10;1(1):pgac023. doi: 10.1093/pnasnexus/pgac023. eCollection 2022 Mar.
10
Match between soaring modes of black kites and the fine-scale distribution of updrafts.黑鸢翱翔模式与上升气流精细分布的匹配。
Sci Rep. 2017 Jul 25;7(1):6421. doi: 10.1038/s41598-017-05319-8.

引用本文的文献

1
Optimal dynamic soaring trades off energy harvest and directional flight.最佳动态翱翔在能量获取与定向飞行之间进行权衡。
iScience. 2025 Apr 28;28(6):112540. doi: 10.1016/j.isci.2025.112540. eCollection 2025 Jun 20.

本文引用的文献

1
An artificial neural network explains how bats might use vision for navigation.人工神经网络解释了蝙蝠如何利用视觉进行导航。
Commun Biol. 2022 Dec 3;5(1):1325. doi: 10.1038/s42003-022-04260-5.
2
A reinforcement-based mechanism for discontinuous learning.基于强化的非连续学习机制。
Proc Natl Acad Sci U S A. 2022 Dec 6;119(49):e2215352119. doi: 10.1073/pnas.2215352119. Epub 2022 Nov 28.
3
MoËT: Mixture of Expert Trees and its application to verifiable reinforcement learning.Moët:混合专家树及其在可验证强化学习中的应用。
Neural Netw. 2022 Jul;151:34-47. doi: 10.1016/j.neunet.2022.03.022. Epub 2022 Mar 23.
4
Learning agile and dynamic motor skills for legged robots.学习用于腿部机器人的敏捷和动态运动技能。
Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aau5872.
5
Reinforcement Learning Enables Resource Partitioning in Foraging Bats.强化学习使觅食蝙蝠实现资源分配。
Curr Biol. 2020 Oct 19;30(20):4096-4102.e6. doi: 10.1016/j.cub.2020.07.079. Epub 2020 Aug 20.
6
Physical limits of flight performance in the heaviest soaring bird.飞行性能的物理极限在最重的翱翔鸟类中。
Proc Natl Acad Sci U S A. 2020 Jul 28;117(30):17884-17890. doi: 10.1073/pnas.1907360117. Epub 2020 Jul 13.
7
An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning.基于深度强化学习的无人船自主路径规划模型。
Sensors (Basel). 2020 Jan 11;20(2):426. doi: 10.3390/s20020426.
8
Vultures respond to challenges of near-ground thermal soaring by varying bank angle.兀鹫通过改变倾斜角度来应对近地面热上升气流的挑战。
J Exp Biol. 2018 Dec 3;221(Pt 23):jeb174995. doi: 10.1242/jeb.174995.
9
Glider soaring via reinforcement learning in the field.通过强化学习在野外滑翔。
Nature. 2018 Oct;562(7726):236-239. doi: 10.1038/s41586-018-0533-0. Epub 2018 Sep 19.
10
Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。
Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.