• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习在涡旋流场中的有效导航。

Learning efficient navigation in vortical flow fields.

机构信息

Graduate Aerospace Laboratories, California Institute of Technology, 1200 E California Blvd, Pasadena, CA, 91125, USA.

Computational Science and Engineering Laboratory, ETH Zurich, 8093, Zurich, Switzerland.

出版信息

Nat Commun. 2021 Dec 8;12(1):7143. doi: 10.1038/s41467-021-27015-y.

DOI:10.1038/s41467-021-27015-y
PMID:34880221
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8654940/
Abstract

Efficient point-to-point navigation in the presence of a background flow field is important for robotic applications such as ocean surveying. In such applications, robots may only have knowledge of their immediate surroundings or be faced with time-varying currents, which limits the use of optimal control techniques. Here, we apply a recently introduced Reinforcement Learning algorithm to discover time-efficient navigation policies to steer a fixed-speed swimmer through unsteady two-dimensional flow fields. The algorithm entails inputting environmental cues into a deep neural network that determines the swimmer's actions, and deploying Remember and Forget Experience Replay. We find that the resulting swimmers successfully exploit the background flow to reach the target, but that this success depends on the sensed environmental cue. Surprisingly, a velocity sensing approach significantly outperformed a bio-mimetic vorticity sensing approach, and achieved a near 100% success rate in reaching the target locations while approaching the time-efficiency of optimal navigation trajectories.

摘要

在存在背景流场的情况下实现高效的点对点导航对于机器人应用(如海洋勘测)非常重要。在这些应用中,机器人可能只了解其周围环境,或者面临时变水流,这限制了最优控制技术的使用。在这里,我们应用最近引入的强化学习算法来发现高效的导航策略,以引导固定速度的游泳者通过不稳定的二维流场。该算法需要将环境提示输入到一个深度神经网络中,该网络确定游泳者的动作,并部署“记住”和“忘记”经验回放。我们发现,由此产生的游泳者成功地利用背景流到达目标,但这一成功取决于所感知的环境提示。令人惊讶的是,速度感应方法的表现明显优于仿生涡度感应方法,并且在接近最优导航轨迹的时间效率的同时,接近 100%的成功率达到目标位置。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/a63ac162c408/41467_2021_27015_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/4a51d4712e23/41467_2021_27015_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/0f215ba22934/41467_2021_27015_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/9da434216245/41467_2021_27015_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/665914949f15/41467_2021_27015_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/79afb991802c/41467_2021_27015_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/3080b0f9f25f/41467_2021_27015_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/a63ac162c408/41467_2021_27015_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/4a51d4712e23/41467_2021_27015_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/0f215ba22934/41467_2021_27015_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/9da434216245/41467_2021_27015_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/665914949f15/41467_2021_27015_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/79afb991802c/41467_2021_27015_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/3080b0f9f25f/41467_2021_27015_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee3/8654940/a63ac162c408/41467_2021_27015_Fig7_HTML.jpg

相似文献

1
Learning efficient navigation in vortical flow fields.学习在涡旋流场中的有效导航。
Nat Commun. 2021 Dec 8;12(1):7143. doi: 10.1038/s41467-021-27015-y.
2
Efficient collective swimming by harnessing vortices through deep reinforcement learning.通过深度强化学习利用涡旋实现高效集体游动。
Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):5849-5854. doi: 10.1073/pnas.1800923115. Epub 2018 May 21.
3
Learning to swim efficiently in a nonuniform flow field.学习在非均匀流场中高效游泳。
Phys Rev E. 2023 Jun;107(6-2):065102. doi: 10.1103/PhysRevE.107.065102.
4
Real-time sensory-motor integration of hippocampal place cell replay and prefrontal sequence learning in simulated and physical rat robots for novel path optimization.模拟和物理大鼠机器人中海马位置细胞重放和前额叶序列学习的实时感觉运动整合,用于新路径优化。
Biol Cybern. 2020 Apr;114(2):249-268. doi: 10.1007/s00422-020-00820-2. Epub 2020 Feb 24.
5
Deep imitation learning for 3D navigation tasks.用于3D导航任务的深度模仿学习
Neural Comput Appl. 2018;29(7):389-404. doi: 10.1007/s00521-017-3241-z. Epub 2017 Dec 4.
6
Predictive hierarchical reinforcement learning for path-efficient mapless navigation with moving target.具有移动目标的无图路径高效导航的预测分层强化学习。
Neural Netw. 2023 Aug;165:677-688. doi: 10.1016/j.neunet.2023.06.007. Epub 2023 Jun 10.
7
Reactive and Cognitive Search Strategies for Olfactory Robots嗅觉机器人的反应式与认知式搜索策略
8
Ultrafast Miniature Robotic Swimmers with Upstream Motility.具有上游运动能力的超快微型机器人游泳器。
Cyborg Bionic Syst. 2023;4:0015. doi: 10.34133/cbsystems.0015. Epub 2023 Mar 15.
9
Finite-horizon, energy-efficient trajectories in unsteady flows.非定常流中的有限时域节能轨迹。
Proc Math Phys Eng Sci. 2022 Feb;478(2258):20210255. doi: 10.1098/rspa.2021.0255. Epub 2022 Feb 2.
10
Simple model of a planar undulating magnetic microswimmer.平面波动磁性微游动器的简单模型。
Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Jul;90(1):013012. doi: 10.1103/PhysRevE.90.013012. Epub 2014 Jul 17.

引用本文的文献

1
Chemotactic navigation in robotic swimmers via reset-free hierarchical reinforcement learning.通过无重置分层强化学习实现机器人游泳者的趋化导航。
Nat Commun. 2025 Jul 1;16(1):5441. doi: 10.1038/s41467-025-60646-z.
2
Sensing flow gradients is necessary for learning autonomous underwater navigation.感知水流梯度是学习自主水下导航所必需的。
Nat Commun. 2025 Mar 28;16(1):3044. doi: 10.1038/s41467-025-58125-6.
3
Surfing vortex rings for energy-efficient propulsion.利用涡环冲浪实现节能推进。

本文引用的文献

1
Finite-horizon, energy-efficient trajectories in unsteady flows.非定常流中的有限时域节能轨迹。
Proc Math Phys Eng Sci. 2022 Feb;478(2258):20210255. doi: 10.1098/rspa.2021.0255. Epub 2022 Feb 2.
2
Autonomous navigation of stratospheric balloons using reinforcement learning.平流层气球的自主导航使用强化学习。
Nature. 2020 Dec;588(7836):77-82. doi: 10.1038/s41586-020-2939-8. Epub 2020 Dec 2.
3
Optimal Flow Sensing for Schooling Swimmers.游泳群体的最佳水流感知
PNAS Nexus. 2025 Feb 25;4(2):pgaf031. doi: 10.1093/pnasnexus/pgaf031. eCollection 2025 Feb.
4
A platform-agnostic deep reinforcement learning framework for effective Sim2Real transfer towards autonomous driving.一个与平台无关的深度强化学习框架,用于实现向自动驾驶的有效模拟到真实迁移。
Commun Eng. 2024 Oct 17;3(1):147. doi: 10.1038/s44172-024-00292-3.
5
Mobile Sensor Path Planning for Kalman Filter Spatiotemporal Estimation.用于卡尔曼滤波器时空估计的移动传感器路径规划
Sensors (Basel). 2024 Jun 8;24(12):3727. doi: 10.3390/s24123727.
6
Learning to cooperate for low-Reynolds-number swimming: a model problem for gait coordination.学习协同实现低雷诺数游动:步态协调的一个模式问题。
Sci Rep. 2023 Jun 9;13(1):9397. doi: 10.1038/s41598-023-36305-y.
7
Fish response to the presence of hydrokinetic turbines as a sustainable energy solution.鱼类对水动能涡轮机存在的反应,作为一种可持续能源解决方案。
Sci Rep. 2023 May 8;13(1):7459. doi: 10.1038/s41598-023-33000-w.
8
A Fast-Tracking-Particle-Inspired Flow-Aided Control Approach for Air Vehicles in Turbulent Flow.一种用于湍流中飞行器的快速跟踪粒子启发式流动辅助控制方法。
Biomimetics (Basel). 2022 Nov 6;7(4):192. doi: 10.3390/biomimetics7040192.
9
Finite-horizon, energy-efficient trajectories in unsteady flows.非定常流中的有限时域节能轨迹。
Proc Math Phys Eng Sci. 2022 Feb;478(2258):20210255. doi: 10.1098/rspa.2021.0255. Epub 2022 Feb 2.
Biomimetics (Basel). 2020 Mar 9;5(1):10. doi: 10.3390/biomimetics5010010.
4
Zermelo's problem: Optimal point-to-point navigation in 2D turbulent flows using reinforcement learning.泽梅罗问题:使用强化学习在 2D 湍流中进行最优点对点导航。
Chaos. 2019 Oct;29(10):103138. doi: 10.1063/1.5120370.
5
Training bioinspired sensors to classify flows.培训仿生传感器以进行流分类。
Bioinspir Biomim. 2018 Nov 27;14(1):016009. doi: 10.1088/1748-3190/aaef1d.
6
Glider soaring via reinforcement learning in the field.通过强化学习在野外滑翔。
Nature. 2018 Oct;562(7726):236-239. doi: 10.1038/s41586-018-0533-0. Epub 2018 Sep 19.
7
Efficient collective swimming by harnessing vortices through deep reinforcement learning.通过深度强化学习利用涡旋实现高效集体游动。
Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):5849-5854. doi: 10.1073/pnas.1800923115. Epub 2018 May 21.
8
A novel mechanism for mechanosensory-based rheotaxis in larval zebrafish.斑马鱼幼体中基于机械感觉的趋流性的一种新机制。
Nature. 2017 Jul 27;547(7664):445-448. doi: 10.1038/nature23014. Epub 2017 Jul 12.
9
Flow Navigation by Smart Microswimmers via Reinforcement Learning.智能微型游泳器通过强化学习实现流动导航。
Phys Rev Lett. 2017 Apr 14;118(15):158004. doi: 10.1103/PhysRevLett.118.158004. Epub 2017 Apr 12.
10
Chaotic particle transport in time-dependent Rayleigh-Bénard convection.时变瑞利-贝纳德对流中的混沌粒子输运
Phys Rev A Gen Phys. 1988 Dec 15;38(12):6280-6286. doi: 10.1103/physreva.38.6280.