• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于预测强化学习的仿生自主系统目标跟踪控制

Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning.

作者信息

Wang Yu, Wang Jian, Kang Song, Yu Junzhi

机构信息

Department of Automation, Tsinghua University, Beijing 100084, China.

The Laboratory of Cognitive and Decision Intelligence for Complex System, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China.

出版信息

Biomimetics (Basel). 2024 Jan 4;9(1):0. doi: 10.3390/biomimetics9010033.

DOI:10.3390/biomimetics9010033
PMID:38248607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11154344/
Abstract

Biological fish often swim in a schooling manner, the mechanism of which comes from the fact that these schooling movements can improve the fishes' hydrodynamic efficiency. Inspired by this phenomenon, a target-following control framework for a biomimetic autonomous system is proposed in this paper. Firstly, a following motion model is established based on the mechanism of fish schooling swimming, in which the follower robotic fish keeps a certain distance and orientation from the leader robotic fish. Second, by incorporating a predictive concept into reinforcement learning, a predictive deep deterministic policy gradient-following controller is provided with the normalized state space, action space, reward, and prediction design. It can avoid overshoot to a certain extent. A nonlinear model predictive controller is designed and can be selected for the follower robotic fish, together with the predictive reinforcement learning. Finally, extensive simulations are conducted, including the fix point and dynamic target following for single robotic fish, as well as cooperative following with the leader robotic fish. The obtained results indicate the effectiveness of the proposed methods, providing a valuable sight for the cooperative control of underwater robots to explore the ocean.

摘要

生物鱼常常以群体游动的方式游动,其机制源于这些群体运动能够提高鱼的流体动力学效率。受此现象启发,本文提出了一种用于仿生自主系统的目标跟踪控制框架。首先,基于鱼群游动的机制建立了跟随运动模型,其中跟随的机器鱼与领头的机器鱼保持一定的距离和方向。其次,通过将预测概念融入强化学习,为归一化状态空间、动作空间、奖励和预测设计提供了一种预测深度确定性策略梯度跟随控制器。它可以在一定程度上避免超调。设计了一种非线性模型预测控制器,可与预测强化学习一起用于跟随的机器鱼。最后,进行了广泛的仿真,包括单个机器鱼的定点和动态目标跟踪,以及与领头机器鱼的协同跟踪。所获得的结果表明了所提方法的有效性,为水下机器人探索海洋的协同控制提供了有价值的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/bc254845ce59/biomimetics-09-00033-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/8dfc4b84edb5/biomimetics-09-00033-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/9dbc7d8e9c8e/biomimetics-09-00033-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/09cbb393af1c/biomimetics-09-00033-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/294d7da345f8/biomimetics-09-00033-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/bc254845ce59/biomimetics-09-00033-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/8dfc4b84edb5/biomimetics-09-00033-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/9dbc7d8e9c8e/biomimetics-09-00033-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/09cbb393af1c/biomimetics-09-00033-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/294d7da345f8/biomimetics-09-00033-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7456/11154344/bc254845ce59/biomimetics-09-00033-g008.jpg

相似文献

1
Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning.基于预测强化学习的仿生自主系统目标跟踪控制
Biomimetics (Basel). 2024 Jan 4;9(1):0. doi: 10.3390/biomimetics9010033.
2
Design, Modeling, and Visual Learning-Based Control of Soft Robotic Fish Driven by Super-Coiled Polymers.基于超螺旋聚合物驱动的软机器人鱼的设计、建模与视觉学习控制
Front Robot AI. 2022 Mar 4;8:809427. doi: 10.3389/frobt.2021.809427. eCollection 2021.
3
Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning.通过深度强化学习的仿生水下航行器目标跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3741-3752. doi: 10.1109/TNNLS.2021.3054402. Epub 2022 Aug 3.
4
Efficient collective swimming by harnessing vortices through deep reinforcement learning.通过深度强化学习利用涡旋实现高效集体游动。
Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):5849-5854. doi: 10.1073/pnas.1800923115. Epub 2018 May 21.
5
Toward the Intelligent, Safe Exploration of a Biomimetic Underwater Robot: Modeling, Planning, and Control.迈向仿生水下机器人的智能、安全探索:建模、规划与控制
Biomimetics (Basel). 2024 Feb 21;9(3):126. doi: 10.3390/biomimetics9030126.
6
Bottom-level motion control for robotic fish to swim in groups: modeling and experiments.群体游动仿生机器鱼底层运动控制:建模与实验
Bioinspir Biomim. 2019 May 2;14(4):046001. doi: 10.1088/1748-3190/ab1052.
7
Synchronisation through learning for two self-propelled swimmers.两名自主游泳者通过学习实现同步。
Bioinspir Biomim. 2017 Mar 29;12(3):036001. doi: 10.1088/1748-3190/aa6311.
8
Learning to school in dense configurations with multi-agent deep reinforcement learning.多智能体深度强化学习在密集配置中学习。
Bioinspir Biomim. 2022 Nov 16;18(1). doi: 10.1088/1748-3190/ac9fb5.
9
Exploration of swimming performance for a biomimetic multi-joint robotic fish with a compliant passive joint.具有柔顺被动关节的仿生多关节机器鱼的游泳性能研究。
Bioinspir Biomim. 2020 Dec 21;16(2). doi: 10.1088/1748-3190/abc494.
10
A bio-inspired electrocommunication system for small underwater robots.一种用于小型水下机器人的仿生电通信系统。
Bioinspir Biomim. 2017 Mar 29;12(3):036002. doi: 10.1088/1748-3190/aa61c3.

引用本文的文献

1
Trajectory Tracking Control of Variable Sweep Aircraft Based on Reinforcement Learning.基于强化学习的变后掠翼飞机轨迹跟踪控制
Biomimetics (Basel). 2024 Apr 27;9(5):263. doi: 10.3390/biomimetics9050263.
2
Toward the Intelligent, Safe Exploration of a Biomimetic Underwater Robot: Modeling, Planning, and Control.迈向仿生水下机器人的智能、安全探索:建模、规划与控制
Biomimetics (Basel). 2024 Feb 21;9(3):126. doi: 10.3390/biomimetics9030126.

本文引用的文献

1
In-line swimming dynamics revealed by fish interacting with a robotic mechanism.鱼类与机器人机制相互作用揭示的直线游动动力学。
Elife. 2023 Feb 6;12:e81392. doi: 10.7554/eLife.81392.
2
Quantifying the Leaping Motion Using a Self-Propelled Bionic Robotic Dolphin Platform.使用自推进式仿生机器人海豚平台对跳跃运动进行量化。
Biomimetics (Basel). 2023 Jan 5;8(1):21. doi: 10.3390/biomimetics8010021.
3
Hydrodynamic Modeling and Parameter Identification of a Bionic Underwater Vehicle: RobDact.仿生水下航行器RobDact的流体动力学建模与参数识别
Cyborg Bionic Syst. 2022 May 31;2022:9806328. doi: 10.34133/2022/9806328. eCollection 2022.
4
Thrust Improvement of a Biomimetic Robotic Fish by Using a Deformable Caudal Fin.基于可变形尾鳍的仿生机器人鱼推进性能提升研究
Biomimetics (Basel). 2022 Aug 14;7(3):113. doi: 10.3390/biomimetics7030113.
5
Using a robotic platform to study the influence of relative tailbeat phase on the energetic costs of side-by-side swimming in fish.利用机器人平台研究相对尾鳍摆动相位对鱼类并排游动能量消耗的影响。
Proc Math Phys Eng Sci. 2021 May;477(2249):20200810. doi: 10.1098/rspa.2020.0810. Epub 2021 May 12.
6
Attention-Based Meta-Reinforcement Learning for Tracking Control of AUV With Time-Varying Dynamics.基于注意力的元强化学习用于时变动力学自主水下航行器的跟踪控制
IEEE Trans Neural Netw Learn Syst. 2022 Nov;33(11):6388-6401. doi: 10.1109/TNNLS.2021.3079148. Epub 2022 Oct 27.
7
Exploration of underwater life with an acoustically controlled soft robotic fish.使用声学控制的软体机器鱼探索水下生命。
Sci Robot. 2018 Mar 21;3(16). doi: 10.1126/scirobotics.aar3449.
8
Tuna robotics: A high-frequency experimental platform exploring the performance space of swimming fishes.金枪鱼机器人:一个探索游泳鱼类性能空间的高频实验平台。
Sci Robot. 2019 Sep 18;4(34). doi: 10.1126/scirobotics.aax4615.
9
Vortex phase matching as a strategy for schooling in robots and in fish.涡旋相位匹配作为机器人和鱼类群体游动的策略。
Nat Commun. 2020 Oct 26;11(1):5408. doi: 10.1038/s41467-020-19086-0.
10
Tunabot Flex: a tuna-inspired robot with body flexibility improves high-performance swimming.金枪鱼启发的柔性机器人提高高性能游泳能力
Bioinspir Biomim. 2021 Mar 5;16(2). doi: 10.1088/1748-3190/abb86d.