通过同态 - 异稳态价值梯度进行的无聊驱动的好奇学习

Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients.

作者信息

Yu Yen, Chang Acer Y C, Kanai Ryota

机构信息

Araya, Inc., Tokyo, Japan.

出版信息

Front Neurorobot. 2019 Jan 22;12:88. doi: 10.3389/fnbot.2018.00088. eCollection 2018.

DOI:10.3389/fnbot.2018.00088

PMID:30723402

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6349823/

Abstract

This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an action serves to disclose outcomes that have intrinsic meaningfulness to an agent itself. This motivated two central algorithmic ingredients: devaluation and devaluation progress, both underpin agent's cognition concerning intrinsically generated rewards. The two serve as an instantiation of homeostatic and heterostatic intrinsic motivation. A key insight from our algorithm is that the two seemingly opposite motivations can be reconciled-without which exploration and information-gathering cannot be effectively carried out. We supported this claim with empirical evidence, showing that boredom-enabled agents consistently outperformed other curious or explorative agent variants in model building benchmarks based on self-assisted experience accumulation.

摘要

本文提出了同态-异稳态价值梯度（HHVG）算法，作为对无聊和好奇心之间建设性相互作用的一种形式化解释，这种相互作用产生了有效的探索和卓越的前向模型学习。我们提供了一种关于行动选择的工具性观点，其中一个行动旨在揭示对智能体自身具有内在意义的结果。这激发了两个核心算法要素：贬值和贬值进展，二者都支撑着智能体关于内在产生的奖励的认知。这两者是稳态和异稳态内在动机的一种实例化。我们算法的一个关键见解是，这两种看似相反的动机可以协调一致——没有这一点，探索和信息收集就无法有效进行。我们用实证证据支持了这一说法，表明在基于自我辅助经验积累的模型构建基准测试中，受无聊驱动的智能体始终优于其他好奇或探索性的智能体变体。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2d3/6349823/8c05119913e2/fnbot-12-00088-g0001.jpg

相似文献

Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients.通过同态 - 异稳态价值梯度进行的无聊驱动的好奇学习

Front Neurorobot. 2019 Jan 22;12:88. doi: 10.3389/fnbot.2018.00088. eCollection 2018.

Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots.基于置信度的驱动自生成目标的发展机器人技能获取。

Front Psychol. 2013 Nov 26;4:833. doi: 10.3389/fpsyg.2013.00833. eCollection 2013.

Motivation to Learn.学习动机。

Exp Psychol. 2019 Sep;66(5):319-330. doi: 10.1027/1618-3169/a000455. Epub 2019 Oct 11.

Self-organization of early vocal development in infants and machines: the role of intrinsic motivation.婴儿和机器早期发声发展的自组织：内在动机的作用。

Front Psychol. 2014 Jan 16;4:1006. doi: 10.3389/fpsyg.2013.01006. eCollection 2013.

Intrinsically motivated action-outcome learning and goal-based action recall: a system-level bio-constrained computational model.内在动机驱动的动作-结果学习和基于目标的动作回忆：一种系统级的生物约束计算模型。

Neural Netw. 2013 May;41:168-87. doi: 10.1016/j.neunet.2012.09.015. Epub 2012 Oct 4.

An intrinsic value system for developing multiple invariant representations with incremental slowness learning.具有增量缓慢学习功能的多不变表示的内在价值系统。

Front Neurorobot. 2013 May 30;7:9. doi: 10.3389/fnbot.2013.00009. eCollection 2013.

The Emerging Neuroscience of Intrinsic Motivation: A New Frontier in Self-Determination Research.内在动机的新兴神经科学：自我决定研究的新前沿。

Front Hum Neurosci. 2017 Mar 24;11:145. doi: 10.3389/fnhum.2017.00145. eCollection 2017.

Intrinsic motivation, curiosity, and learning: Theory and applications in educational technologies.内在动机、好奇心与学习：教育技术中的理论与应用

Prog Brain Res. 2016;229:257-284. doi: 10.1016/bs.pbr.2016.05.005. Epub 2016 Jul 29.

The measurement and conceptualization of curiosity.好奇心的测量与概念化。

J Genet Psychol. 2006 Jun;167(2):117-35. doi: 10.3200/GNTP.167.2.117-135.

Humans monitor learning progress in curiosity-driven exploration.人类通过好奇心驱动的探索来监测学习进度。

Nat Commun. 2021 Oct 13;12(1):5972. doi: 10.1038/s41467-021-26196-w.

引用本文的文献

Boredom and curiosity: the hunger and the appetite for information.无聊与好奇：对信息的渴望与欲求。

Front Psychol. 2024 Dec 11;15:1514348. doi: 10.3389/fpsyg.2024.1514348. eCollection 2024.

Modeling fashion as an emergent collective behavior of bored individuals.将时尚建模为无聊的个体的一种新兴集体行为。

Sci Rep. 2023 Nov 22;13(1):20480. doi: 10.1038/s41598-023-47749-7.

本文引用的文献

A Failure to Launch: Regulatory Modes and Boredom Proneness.启动失败：监管模式与无聊倾向。

Front Psychol. 2018 Jul 17;9:1126. doi: 10.3389/fpsyg.2018.01126. eCollection 2018.

Boredom: Under-aroused and restless.无聊：缺乏刺激和烦躁不安。

Conscious Cogn. 2018 May;61:24-37. doi: 10.1016/j.concog.2018.03.014. Epub 2018 Apr 7.

Active Inference, Curiosity and Insight.主动推理、好奇心与洞察力。

Neural Comput. 2017 Oct;29(10):2633-2683. doi: 10.1162/neco_a_00999. Epub 2017 Aug 4.

Boredom begets creativity: A solution to the exploitation-exploration trade-off in predictive coding.无聊催生创造力：预测编码中开发-探索权衡问题的一种解决方案。

Biosystems. 2017 Dec;162:168-176. doi: 10.1016/j.biosystems.2017.04.006. Epub 2017 May 4.

Goal-Directed Behavior and Instrumental Devaluation: A Neural System-Level Computational Model.目标导向行为与工具性贬值：一种神经系统层面的计算模型。

Front Behav Neurosci. 2016 Oct 18;10:181. doi: 10.3389/fnbeh.2016.00181. eCollection 2016.

The Unengaged Mind: Defining Boredom in Terms of Attention.未投入的心智：从注意力的角度定义无聊。

Perspect Psychol Sci. 2012 Sep;7(5):482-95. doi: 10.1177/1745691612456044.

The bright side of boredom.无聊的好处。

Front Psychol. 2014 Nov 3;5:1245. doi: 10.3389/fpsyg.2014.01245. eCollection 2014.

On the function of boredom.论无聊感的功能。

Behav Sci (Basel). 2013 Aug 15;3(3):459-472. doi: 10.3390/bs3030459. eCollection 2013 Sep.

An opportunity cost model of subjective effort and task performance.主观努力与任务绩效的机会成本模型。

Behav Brain Sci. 2013 Dec;36(6):661-79. doi: 10.1017/S0140525X12003196.

Characterizing the psychophysiological signature of boredom.描述无聊时的心理生理特征。

Exp Brain Res. 2014 Feb;232(2):481-91. doi: 10.1007/s00221-013-3755-2. Epub 2013 Nov 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过同态 - 异稳态价值梯度进行的无聊驱动的好奇学习

Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献