• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过无重置分层强化学习实现机器人游泳者的趋化导航。

Chemotactic navigation in robotic swimmers via reset-free hierarchical reinforcement learning.

作者信息

Xiong Tongzhao, Liu Zhaorong, Wang Yufei, Ong Chong Jin, Zhu Lailai

机构信息

Department of Mechanical Engineering, National University of Singapore, Singapore, 117575, Singapore.

Department of Physics, University of Science and Technology of China, Hefei, Anhui, 230026, People's Republic of China.

出版信息

Nat Commun. 2025 Jul 1;16(1):5441. doi: 10.1038/s41467-025-60646-z.

DOI:10.1038/s41467-025-60646-z
PMID:40593554
Abstract

Microorganisms have evolved diverse strategies to propel themselves in viscous fluids, navigate complex environments, and exhibit taxis in response to stimuli. This has inspired the development of miniature robots, where artificial intelligence (AI) is playing an increasingly important role. Can AI endow these synthetic systems with intelligence akin to that honed through natural evolution? Here, we demonstrate, in silico, chemotactic navigation in a multi-link robotic model using two-level hierarchical reinforcement learning (RL). The lower-level RL allows the model-configured as a chain or ring topology-to acquire topology-adapted swimming gaits: wave propagation characteristic of flagella or body oscillation akin to an amoebae. Such chain and ring swimmers, further enabled by the higher-level RL, accomplish chemotactic navigation in prototypical biologically relevant scenarios that feature conflicting chemoattractants, pursuing a swimming bacterial mimic, steering in vortical flows, and squeezing through tight constrictions. Additionally, we achieve reset-free RL under partial observability, where simulated robots rely solely on local scalar observations rather than global or vectorial data. This advancement illuminates potential solutions for overcoming persistent challenges of manual resets and partial observability in real-world microrobotic RL.

摘要

微生物已经进化出多种策略,以便在粘性流体中推动自身前进、在复杂环境中导航并对刺激做出趋性反应。这激发了微型机器人的发展,其中人工智能(AI)正发挥着越来越重要的作用。人工智能能否赋予这些合成系统类似于通过自然进化磨练出来的智能?在此,我们在计算机模拟中展示了使用两级分层强化学习(RL)在多连杆机器人模型中的趋化导航。较低级别的强化学习使配置为链状或环状拓扑结构的模型能够获得适应拓扑结构的游泳步态:类似于鞭毛的波传播特性或类似于变形虫的身体振荡。这种链状和环状游泳者在更高级别的强化学习的进一步支持下,在具有相互冲突的化学引诱剂的典型生物相关场景中完成趋化导航,追逐模仿游泳细菌的目标,在涡流中转向,并挤过狭窄通道。此外,我们在部分可观测性下实现了无重置强化学习,即模拟机器人仅依靠局部标量观测而不是全局或矢量数据。这一进展为克服现实世界微型机器人强化学习中手动重置和部分可观测性的持续挑战提供了潜在解决方案。

相似文献

1
Chemotactic navigation in robotic swimmers via reset-free hierarchical reinforcement learning.通过无重置分层强化学习实现机器人游泳者的趋化导航。
Nat Commun. 2025 Jul 1;16(1):5441. doi: 10.1038/s41467-025-60646-z.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Are There Differences in Accuracy or Outcomes Scores Among Navigated, Robotic, Patient-specific Instruments or Standard Cutting Guides in TKA? A Network Meta-analysis.导航、机器人、患者特异性器械与标准截骨导板在 TKA 中准确性或结果评分是否存在差异?一项网络荟萃分析。
Clin Orthop Relat Res. 2020 Sep;478(9):2105-2116. doi: 10.1097/CORR.0000000000001324.
4
Prenatal detection of congenital heart defects using the deep learning-based image and video analysis: protocol for Clinical Artificial Intelligence in Fetal Echocardiography (CAIFE), an international multicentre multidisciplinary study.使用基于深度学习的图像和视频分析进行先天性心脏缺陷的产前检测:胎儿超声心动图临床人工智能(CAIFE)方案,一项国际多中心多学科研究。
BMJ Open. 2025 Jun 5;15(6):e101263. doi: 10.1136/bmjopen-2025-101263.
5
EORTC guidelines for the use of erythropoietic proteins in anaemic patients with cancer: 2006 update.欧洲癌症研究与治疗组织(EORTC)癌症贫血患者促红细胞生成蛋白使用指南:2006年更新版
Eur J Cancer. 2007 Jan;43(2):258-70. doi: 10.1016/j.ejca.2006.10.014. Epub 2006 Dec 19.
6
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
7
Carbon dioxide detection for diagnosis of inadvertent respiratory tract placement of enterogastric tubes in children.用于诊断儿童肠胃管意外置入呼吸道的二氧化碳检测
Cochrane Database Syst Rev. 2025 Feb 19;2(2):CD011196. doi: 10.1002/14651858.CD011196.pub2.
8
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
9
AI-Driven Antimicrobial Peptide Discovery: Mining and Generation.人工智能驱动的抗菌肽发现:挖掘与生成
Acc Chem Res. 2025 Jun 17;58(12):1831-1846. doi: 10.1021/acs.accounts.0c00594. Epub 2025 Jun 3.
10
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

引用本文的文献

1
Reinforcement learning of a biflagellate model microswimmer.双鞭毛模型微游动器的强化学习
Eur Phys J E Soft Matter. 2025 Aug 13;48(8-9):50. doi: 10.1140/epje/s10189-025-00513-3.

本文引用的文献

1
Reinforcement learning of biomimetic navigation: a model problem for sperm chemotaxis.仿生导航的强化学习:精子趋化性的模型问题。
Eur Phys J E Soft Matter. 2024 Sep 27;47(9):59. doi: 10.1140/epje/s10189-024-00451-6.
2
Medical Microrobots.医疗微机器人。
Annu Rev Biomed Eng. 2024 Jul;26(1):561-591. doi: 10.1146/annurev-bioeng-081523-033131. Epub 2024 Jun 20.
3
Smart active particles learn and transcend bacterial foraging strategies.智能活性粒子学习并超越细菌觅食策略。
Proc Natl Acad Sci U S A. 2024 Apr 9;121(15):e2317618121. doi: 10.1073/pnas.2317618121. Epub 2024 Apr 1.
4
Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement Learning.智能磁性微型机器人通过深度强化学习学会游泳。
Adv Intell Syst. 2022 Oct;4(10). doi: 10.1002/aisy.202200023. Epub 2022 Jul 6.
5
Delivering drugs with microrobots.用微型机器人投递药物。
Science. 2023 Dec 8;382(6675):1120-1122. doi: 10.1126/science.adh3073. Epub 2023 Dec 7.
6
Chemotaxis of an elastic flagellated microrobot.弹性鞭毛微型机器人的趋化性。
Phys Rev E. 2023 Oct;108(4-1):044408. doi: 10.1103/PhysRevE.108.044408.
7
Learning to swim efficiently in a nonuniform flow field.学习在非均匀流场中高效游泳。
Phys Rev E. 2023 Jun;107(6-2):065102. doi: 10.1103/PhysRevE.107.065102.
8
Optimal navigation of a smart active particle: directional and distance sensing.智能主动粒子的最优导航:方向和距离感知。
Eur Phys J E Soft Matter. 2023 Jun 19;46(6):48. doi: 10.1140/epje/s10189-023-00309-3.
9
Steering undulatory micro-swimmers in a fluid flow through reinforcement learning.通过强化学习在流场中引导波动微游泳者。
Eur Phys J E Soft Matter. 2023 Jun 12;46(6):43. doi: 10.1140/epje/s10189-023-00293-8.
10
Learning efficient navigation in vortical flow fields.学习在涡旋流场中的有效导航。
Nat Commun. 2021 Dec 8;12(1):7143. doi: 10.1038/s41467-021-27015-y.