• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于柯尔莫哥洛夫-阿诺德网络,使用深度强化学习模拟鱼类自主游泳行为。

Simulating fish autonomous swimming behaviours using deep reinforcement learning based on Kolmogorov-Arnold Networks.

作者信息

Li Tao, Zhang Chunze, Zhang Guibin, Zhou Qin, Hou Ji, Diao Wei, Meng Wanwan, Zhang Xujin

机构信息

Southwest Research Institute for Hydraulic and Water Transport Engineering, Chongqing Jiaotong University, Chongqing, People's Republic of China.

The College of River and Ocean Engineering, Chongqing Jiaotong University, Chongqing, People's Republic of China.

出版信息

Bioinspir Biomim. 2025 Jan 16;20(2). doi: 10.1088/1748-3190/ada59c.

DOI:10.1088/1748-3190/ada59c
PMID:39752883
Abstract

The study of fish swimming behaviours and locomotion mechanisms holds significant scientific and engineering value. With the rapid advancements in artificial intelligence, a new method combining deep reinforcement learning (DRL) with computational fluid dynamics has emerged and been applied to simulate the fish's adaptive swimming behaviour, where the complex fish behaviour is decoupled to focus on the fish's response to the hydrodynamic field, and the simulation is driven by reward-based objectives to model the fish's swimming behaviour. However, the scale of this cross-disciplinary method is directly affected by the efficiency of the DRL model. To promote it to more general application scenarios, there is a pressing need for further research on more efficient and economical network architectures to address the challenge of approximating state-value function in high-dimensional, dynamic, and uncertain environments. Building upon a previously proposed computational platform for the simulation of fish autonomous swimming behaviour, we integrated Kolmogorov-Arnold Networks(KANs) and tested their performance in point-to-point swimming and Kármán gait swimming environments. Experimental results demonstrated that, compared to long short-term memory Networks(LSTMs) and multilayer perceptron networks(MLPs), the introduction of KANs significantly enhanced the perception and decision-making abilities of the intelligent fish in complex fluid environments. With a smaller network scale, in the point-to-point swimming case, KANs effectively approximated the state-value function, achieving average reward improvements of up to 88.0% and 94.1% over MLPs and LSTMs networks, respectively, and increased by 766.7% and 105.6% in the Kármán gait swimming case. Under comparable network sizes, the intelligent fish with KANs exhibited faster learning capabilities and more stable swimming performance in complex fluid settings.

摘要

鱼类游泳行为和运动机制的研究具有重要的科学和工程价值。随着人工智能的快速发展,一种将深度强化学习(DRL)与计算流体动力学相结合的新方法应运而生,并被应用于模拟鱼类的自适应游泳行为,其中复杂的鱼类行为被解耦,以专注于鱼类对流体动力场的响应,并且模拟由基于奖励的目标驱动,以对鱼类的游泳行为进行建模。然而,这种跨学科方法的规模直接受到DRL模型效率的影响。为了将其推广到更广泛的应用场景,迫切需要对更高效、更经济的网络架构进行进一步研究,以应对在高维、动态和不确定环境中逼近状态值函数的挑战。基于先前提出的用于模拟鱼类自主游泳行为的计算平台,我们集成了柯尔莫哥洛夫 - 阿诺德网络(KANs),并在点对点游泳和卡门步态游泳环境中测试了它们的性能。实验结果表明,与长短期记忆网络(LSTMs)和多层感知器网络(MLPs)相比,引入KANs显著增强了智能鱼在复杂流体环境中的感知和决策能力。在较小的网络规模下,在点对点游泳情况下,KANs有效地逼近了状态值函数,分别比MLPs和LSTMs网络实现了高达88.0%和94.1%的平均奖励提升,在卡门步态游泳情况下分别提升了766.7%和105.6%。在可比的网络规模下,具有KANs的智能鱼在复杂流体环境中表现出更快的学习能力和更稳定的游泳性能。

相似文献

1
Simulating fish autonomous swimming behaviours using deep reinforcement learning based on Kolmogorov-Arnold Networks.基于柯尔莫哥洛夫-阿诺德网络,使用深度强化学习模拟鱼类自主游泳行为。
Bioinspir Biomim. 2025 Jan 16;20(2). doi: 10.1088/1748-3190/ada59c.
2
Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning.运用深度强化学习在复杂的礁岩环境中学习回避和捕食。
Bioinspir Biomim. 2024 Aug 7;19(5). doi: 10.1088/1748-3190/ad6544.
3
Kolmogorov-Arnold networks for genomic tasks.用于基因组任务的柯尔莫哥洛夫-阿诺德网络。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf129.
4
A numerical study of fish adaption behaviors in complex environments with a deep reinforcement learning and immersed boundary-lattice Boltzmann method.基于深度强化学习和浸入边界-格子玻尔兹曼方法的复杂环境中鱼类适应行为的数值研究。
Sci Rep. 2021 Jan 18;11(1):1691. doi: 10.1038/s41598-021-81124-8.
5
Efficient collective swimming by harnessing vortices through deep reinforcement learning.通过深度强化学习利用涡旋实现高效集体游动。
Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):5849-5854. doi: 10.1073/pnas.1800923115. Epub 2018 May 21.
6
Design, Modeling, and Visual Learning-Based Control of Soft Robotic Fish Driven by Super-Coiled Polymers.基于超螺旋聚合物驱动的软机器人鱼的设计、建模与视觉学习控制
Front Robot AI. 2022 Mar 4;8:809427. doi: 10.3389/frobt.2021.809427. eCollection 2021.
7
Learning to school in dense configurations with multi-agent deep reinforcement learning.多智能体深度强化学习在密集配置中学习。
Bioinspir Biomim. 2022 Nov 16;18(1). doi: 10.1088/1748-3190/ac9fb5.
8
Using deep reinforcement learning to investigate stretch feedback during swimming of the lamprey.利用深度强化学习研究七鳃鳗游泳时的伸展反馈。
Bioinspir Biomim. 2025 Mar 5;20(2). doi: 10.1088/1748-3190/adb8b1.
9
Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.基于辅助评论家网络的自动驾驶策略深度强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3680-3690. doi: 10.1109/TNNLS.2021.3116063. Epub 2023 Jul 6.
10
An intrusion detection model based on Convolutional Kolmogorov-Arnold Networks.一种基于卷积柯尔莫哥洛夫-阿诺德网络的入侵检测模型。
Sci Rep. 2025 Jan 14;15(1):1917. doi: 10.1038/s41598-024-85083-8.