• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于变分递归神经网络的主动推理对习惯化智能体进行目标导向规划

Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network.

作者信息

Matsumoto Takazumi, Tani Jun

机构信息

Okinawa Institute of Science and Technology, Okinawa 904-0495, Japan.

出版信息

Entropy (Basel). 2020 May 18;22(5):564. doi: 10.3390/e22050564.

DOI:10.3390/e22050564
PMID:33286336
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7517093/
Abstract

It is crucial to ask how agents can achieve goals by generating action plans using only partial models of the world acquired through habituated sensory-motor experiences. Although many existing robotics studies use a forward model framework, there are generalization issues with high degrees of freedom. The current study shows that the predictive coding (PC) and active inference (AIF) frameworks, which employ a generative model, can develop better generalization by learning a prior distribution in a low dimensional latent state space representing probabilistic structures extracted from well habituated sensory-motor trajectories. In our proposed model, learning is carried out by inferring optimal latent variables as well as synaptic weights for maximizing the evidence lower bound, while goal-directed planning is accomplished by inferring latent variables for maximizing the estimated lower bound. Our proposed model was evaluated with both simple and complex robotic tasks in simulation, which demonstrated sufficient generalization in learning with limited training data by setting an intermediate value for a regularization coefficient. Furthermore, comparative simulation results show that the proposed model outperforms a conventional forward model in goal-directed planning, due to the learned prior confining the search of motor plans within the range of habituated trajectories.

摘要

关键在于询问智能体如何通过仅使用通过习惯性感觉运动经验获得的世界局部模型来生成行动计划,从而实现目标。尽管许多现有的机器人研究使用前向模型框架,但存在高自由度的泛化问题。当前研究表明,采用生成模型的预测编码(PC)和主动推理(AIF)框架可以通过在低维潜在状态空间中学习先验分布来实现更好的泛化,该空间表示从习惯性感觉运动轨迹中提取的概率结构。在我们提出的模型中,学习是通过推断最优潜在变量以及突触权重来最大化证据下界来进行的,而目标导向规划则是通过推断潜在变量来最大化估计下界来完成的。我们提出的模型在模拟中用简单和复杂的机器人任务进行了评估,通过为正则化系数设置中间值,证明了在有限训练数据的学习中具有足够的泛化能力。此外,比较模拟结果表明,由于学习到的先验将运动计划的搜索限制在习惯性轨迹范围内,所提出的模型在目标导向规划方面优于传统的前向模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/4faf586adc87/entropy-22-00564-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/9890e41db287/entropy-22-00564-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/95c4a6db9ddd/entropy-22-00564-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/4a93e1c3a839/entropy-22-00564-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/47c6bec2b907/entropy-22-00564-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/e128ea49020a/entropy-22-00564-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/20704108cbfa/entropy-22-00564-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/dfde69a8d354/entropy-22-00564-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/efb5c93c72e2/entropy-22-00564-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/5aa8107d83eb/entropy-22-00564-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/d5253fb7f847/entropy-22-00564-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/3b6b66e0ae54/entropy-22-00564-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/3ec4836a0eeb/entropy-22-00564-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/d8b6c0e35235/entropy-22-00564-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/40cca4c0433e/entropy-22-00564-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/0eac4392ee8a/entropy-22-00564-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/f8e4930c2bd8/entropy-22-00564-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/82add072591c/entropy-22-00564-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/c5248a0a05d4/entropy-22-00564-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/4faf586adc87/entropy-22-00564-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/9890e41db287/entropy-22-00564-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/95c4a6db9ddd/entropy-22-00564-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/4a93e1c3a839/entropy-22-00564-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/47c6bec2b907/entropy-22-00564-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/e128ea49020a/entropy-22-00564-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/20704108cbfa/entropy-22-00564-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/dfde69a8d354/entropy-22-00564-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/efb5c93c72e2/entropy-22-00564-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/5aa8107d83eb/entropy-22-00564-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/d5253fb7f847/entropy-22-00564-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/3b6b66e0ae54/entropy-22-00564-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/3ec4836a0eeb/entropy-22-00564-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/d8b6c0e35235/entropy-22-00564-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/40cca4c0433e/entropy-22-00564-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/0eac4392ee8a/entropy-22-00564-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/f8e4930c2bd8/entropy-22-00564-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/82add072591c/entropy-22-00564-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/c5248a0a05d4/entropy-22-00564-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7517093/4faf586adc87/entropy-22-00564-g019.jpg

相似文献

1
Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network.基于变分递归神经网络的主动推理对习惯化智能体进行目标导向规划
Entropy (Basel). 2020 May 18;22(5):564. doi: 10.3390/e22050564.
2
Goal-Directed Planning and Goal Understanding by Extended Active Inference: Evaluation through Simulated and Physical Robot Experiments.通过扩展主动推理实现目标导向规划与目标理解:基于模拟和物理机器人实验的评估
Entropy (Basel). 2022 Mar 28;24(4):469. doi: 10.3390/e24040469.
3
Bidirectional interaction between visual and motor generative models using Predictive Coding and Active Inference.使用预测编码和主动推理实现视觉和运动生成模型的双向交互。
Neural Netw. 2021 Nov;143:638-656. doi: 10.1016/j.neunet.2021.07.016. Epub 2021 Jul 24.
4
Deep active inference.深度主动推理
Biol Cybern. 2018 Dec;112(6):547-573. doi: 10.1007/s00422-018-0785-7. Epub 2018 Oct 22.
5
A generative spiking neural-network model of goal-directed behaviour and one-step planning.一种用于目标导向行为和一步规划的生成尖峰神经网络模型。
PLoS Comput Biol. 2020 Dec 8;16(12):e1007579. doi: 10.1371/journal.pcbi.1007579. eCollection 2020 Dec.
6
A Novel Predictive-Coding-Inspired Variational RNN Model for Online Prediction and Recognition.一种基于新型预测编码的变分 RNN 模型,用于在线预测和识别。
Neural Comput. 2019 Nov;31(11):2025-2074. doi: 10.1162/neco_a_01228. Epub 2019 Sep 16.
7
History Marginalization Improves Forecasting in Variational Recurrent Neural Networks.历史边缘化改进了变分递归神经网络中的预测。
Entropy (Basel). 2021 Nov 24;23(12):1563. doi: 10.3390/e23121563.
8
Learning, planning, and control in a monolithic neural event inference architecture.整体式神经事件推理架构中的学习、规划和控制。
Neural Netw. 2019 Sep;117:135-144. doi: 10.1016/j.neunet.2019.05.001. Epub 2019 May 21.
9
Initialization of latent space coordinates random linear projections for learning robotic sensory-motor sequences.用于学习机器人感觉运动序列的潜在空间坐标随机线性投影的初始化。
Front Neurorobot. 2022 Sep 14;16:891031. doi: 10.3389/fnbot.2022.891031. eCollection 2022.
10
Anomaly Detection of Time Series With Smoothness-Inducing Sequential Variational Auto-Encoder.基于平滑诱导序列变分自编码器的时间序列异常检测
IEEE Trans Neural Netw Learn Syst. 2021 Mar;32(3):1177-1191. doi: 10.1109/TNNLS.2020.2980749. Epub 2021 Mar 1.

引用本文的文献

1
Active Inference with Dynamic Planning and Information Gain in Continuous Space by Inferring Low-Dimensional Latent States.通过推断低维潜在状态在连续空间中进行动态规划和信息增益的主动推理
Entropy (Basel). 2025 Aug 9;27(8):846. doi: 10.3390/e27080846.
2
Synergizing habits and goals with variational Bayes.协同习惯和目标与变分贝叶斯。
Nat Commun. 2024 May 25;15(1):4461. doi: 10.1038/s41467-024-48577-7.
3
Deep kinematic inference affords efficient and scalable control of bodily movements.深度运动学推理为身体运动提供了高效且可扩展的控制。

本文引用的文献

1
A Novel Predictive-Coding-Inspired Variational RNN Model for Online Prediction and Recognition.一种基于新型预测编码的变分 RNN 模型,用于在线预测和识别。
Neural Comput. 2019 Nov;31(11):2025-2074. doi: 10.1162/neco_a_01228. Epub 2019 Sep 16.
2
Learning, planning, and control in a monolithic neural event inference architecture.整体式神经事件推理架构中的学习、规划和控制。
Neural Netw. 2019 Sep;117:135-144. doi: 10.1016/j.neunet.2019.05.001. Epub 2019 May 21.
3
Does predictive coding have a future?预测编码有未来吗?
Proc Natl Acad Sci U S A. 2023 Dec 19;120(51):e2309058120. doi: 10.1073/pnas.2309058120. Epub 2023 Dec 12.
4
Incremental Learning of Goal-Directed Actions in a Dynamic Environment by a Robot Using Active Inference.机器人在动态环境中使用主动推理进行目标导向动作的增量学习。
Entropy (Basel). 2023 Oct 31;25(11):1506. doi: 10.3390/e25111506.
5
Free energy and inference in living systems.生命系统中的自由能与推理
Interface Focus. 2023 Apr 14;13(3):20220041. doi: 10.1098/rsfs.2022.0041. eCollection 2023 Jun 6.
6
Why it hurts: with freedom comes the biological need for pain.为何疼痛:自由伴随着对疼痛的生物需求。
Anim Cogn. 2023 Jul;26(4):1259-1275. doi: 10.1007/s10071-023-01773-2. Epub 2023 Apr 8.
7
Simulating developmental diversity: Impact of neural stochasticity on atypical flexibility and hierarchy.模拟发育多样性:神经随机性对非典型灵活性和层级结构的影响。
Front Psychiatry. 2023 Mar 15;14:1080668. doi: 10.3389/fpsyt.2023.1080668. eCollection 2023.
8
How the conception of control influences our understanding of actions.控制概念如何影响我们对行动的理解。
Nat Rev Neurosci. 2023 May;24(5):313-329. doi: 10.1038/s41583-023-00691-z. Epub 2023 Mar 30.
9
Goal-Directed Planning and Goal Understanding by Extended Active Inference: Evaluation through Simulated and Physical Robot Experiments.通过扩展主动推理实现目标导向规划与目标理解:基于模拟和物理机器人实验的评估
Entropy (Basel). 2022 Mar 28;24(4):469. doi: 10.3390/e24040469.
10
Neural Dynamics under Active Inference: Plausibility and Efficiency of Information Processing.主动推断下的神经动力学:信息处理的合理性与效率
Entropy (Basel). 2021 Apr 12;23(4):454. doi: 10.3390/e23040454.
Nat Neurosci. 2018 Aug;21(8):1019-1021. doi: 10.1038/s41593-018-0200-7.
4
Hierarchical Active Inference: A Theory of Motivated Control.分层主动推理:动机控制理论。
Trends Cogn Sci. 2018 Apr;22(4):294-306. doi: 10.1016/j.tics.2018.01.009. Epub 2018 Feb 20.
5
The Markov blankets of life: autonomy, active inference and the free energy principle.生命的马尔可夫毯:自主性、主动推断和自由能原理。
J R Soc Interface. 2018 Jan;15(138). doi: 10.1098/rsif.2017.0792.
6
Action understanding and active inference.动作理解与主动推理。
Biol Cybern. 2011 Feb;104(1-2):137-60. doi: 10.1007/s00422-011-0424-z. Epub 2011 Feb 17.
7
Action and behavior: a free-energy formulation.行动与行为:一种自由能表述
Biol Cybern. 2010 Mar;102(3):227-60. doi: 10.1007/s00422-010-0364-z. Epub 2010 Feb 11.
8
Reinforcement learning or active inference?强化学习还是主动推理?
PLoS One. 2009 Jul 29;4(7):e6421. doi: 10.1371/journal.pone.0006421.
9
Emergence of functional hierarchy in a multiple timescale neural network model: a humanoid robot experiment.多时间尺度神经网络模型中功能层次结构的出现:一项仿人机器人实验。
PLoS Comput Biol. 2008 Nov;4(11):e1000220. doi: 10.1371/journal.pcbi.1000220. Epub 2008 Nov 7.
10
Model-based learning for mobile robot navigation from the dynamical systems perspective.从动态系统视角看移动机器人导航的基于模型的学习。
IEEE Trans Syst Man Cybern B Cybern. 1996;26(3):421-36. doi: 10.1109/3477.499793.