在嘈杂环境中进行迁移学习研究：使用 T 迷宫进行空间学习中的先驱和后继特征研究。

Investigating Transfer Learning in Noisy Environments: A Study of Predecessor and Successor Features in Spatial Learning Using a T-Maze.

机构信息

Department of Immunology, Kyungpook National University School of Medicine, Daegu 41944, Republic of Korea.

Department of Physiology, Pusan National University School of Medicine, Yangsan 50612, Republic of Korea.

出版信息

Sensors (Basel). 2024 Oct 3;24(19):6419. doi: 10.3390/s24196419.

DOI:10.3390/s24196419

PMID:39409459

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11479366/

Abstract

In this study, we investigate the adaptability of artificial agents within a noisy T-maze that use Markov decision processes (MDPs) and successor feature (SF) and predecessor feature (PF) learning algorithms. Our focus is on quantifying how varying the hyperparameters, specifically the reward learning rate (αr) and the eligibility trace decay rate (λ), can enhance their adaptability. Adaptation is evaluated by analyzing the hyperparameters of cumulative reward, step length, adaptation rate, and adaptation step length and the relationships between them using Spearman's correlation tests and linear regression. Our findings reveal that an αr of 0.9 consistently yields superior adaptation across all metrics at a noise level of 0.05. However, the optimal setting for λ varies by metric and context. In discussing these results, we emphasize the critical role of hyperparameter optimization in refining the performance and transfer learning efficacy of learning algorithms. This research advances our understanding of the functionality of PF and SF algorithms, particularly in navigating the inherent uncertainty of transfer learning tasks. By offering insights into the optimal hyperparameter configurations, this study contributes to the development of more adaptive and robust learning algorithms, paving the way for future explorations in artificial intelligence and neuroscience.

摘要

在这项研究中，我们研究了在使用马尔可夫决策过程 (MDP) 和后继特征 (SF) 和前继特征 (PF) 学习算法的嘈杂 T 迷宫中，人工代理的适应性。我们的重点是量化改变超参数，特别是奖励学习率 (αr) 和资格迹衰减率 (λ)，如何增强它们的适应性。通过分析累积奖励、步长、适应率和适应步长的超参数，以及使用 Spearman 相关检验和线性回归分析它们之间的关系，来评估适应能力。我们的研究结果表明，在噪声水平为 0.05 时，αr 为 0.9 始终在所有指标上产生优越的适应能力。然而，λ 的最佳设置因指标和上下文而异。在讨论这些结果时，我们强调了超参数优化在改进学习算法的性能和迁移学习效果方面的关键作用。这项研究增进了我们对 PF 和 SF 算法功能的理解，特别是在处理迁移学习任务中的固有不确定性方面。通过提供有关最佳超参数配置的见解，本研究有助于开发更具适应性和鲁棒性的学习算法，为人工智能和神经科学的未来探索铺平道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97e/11479366/c1d28726cd26/sensors-24-06419-g001.jpg

相似文献

Investigating Transfer Learning in Noisy Environments: A Study of Predecessor and Successor Features in Spatial Learning Using a T-Maze.在嘈杂环境中进行迁移学习研究：使用 T 迷宫进行空间学习中的先驱和后继特征研究。

Sensors (Basel). 2024 Oct 3;24(19):6419. doi: 10.3390/s24196419.

Machine Learning-Based Boosted Regression Ensemble Combined with Hyperparameter Tuning for Optimal Adaptive Learning.基于机器学习的增强回归集成与超参数调整相结合，实现最优自适应学习。

Sensors (Basel). 2022 May 16;22(10):3776. doi: 10.3390/s22103776.

Optimizing Machine Learning Algorithms for Landslide Susceptibility Mapping along the Karakoram Highway, Gilgit Baltistan, Pakistan: A Comparative Study of Baseline, Bayesian, and Metaheuristic Hyperparameter Optimization Techniques.优化巴基斯坦吉尔吉特-巴尔蒂斯坦喀喇昆仑公路沿线滑坡易发性制图的机器学习算法：基线、贝叶斯和元启发式超参数优化技术的比较研究

Sensors (Basel). 2023 Aug 1;23(15):6843. doi: 10.3390/s23156843.

Context transfer in reinforcement learning using action-value functions.基于动作值函数的强化学习中的上下文转移

Comput Intell Neurosci. 2014;2014:428567. doi: 10.1155/2014/428567. Epub 2014 Dec 31.

One-shot learning and behavioral eligibility traces in sequential decision making.序列决策中的单次学习和行为资格痕迹。

Elife. 2019 Nov 11;8:e47463. doi: 10.7554/eLife.47463.

Integrated Evolutionary Learning: An Artificial Intelligence Approach to Joint Learning of Features and Hyperparameters for Optimized, Explainable Machine Learning.集成进化学习：一种用于特征和超参数联合学习以实现优化、可解释机器学习的人工智能方法。

Front Artif Intell. 2022 Apr 5;5:832530. doi: 10.3389/frai.2022.832530. eCollection 2022.

Improving sepsis classification performance with artificial intelligence algorithms: A comprehensive overview of healthcare applications.利用人工智能算法提高脓毒症分类性能：医疗保健应用的全面综述。

J Crit Care. 2024 Oct;83:154815. doi: 10.1016/j.jcrc.2024.154815. Epub 2024 May 8.

Online learning of shaping rewards in reinforcement learning.强化学习中的塑造奖励在线学习。

Neural Netw. 2010 May;23(4):541-50. doi: 10.1016/j.neunet.2010.01.001. Epub 2010 Jan 11.

Meta-learning in reinforcement learning.强化学习中的元学习。

Neural Netw. 2003 Jan;16(1):5-9. doi: 10.1016/s0893-6080(02)00228-9.

Toward optimal classifier system performance in non-Markov environments.迈向非马尔可夫环境下的最优分类器系统性能。

Evol Comput. 2000 Winter;8(4):393-418. doi: 10.1162/106365600568239.

引用本文的文献

Noise Resilience of Successor and Predecessor Feature Algorithms in One- and Two-Dimensional Environments.一维和二维环境中后继与前驱特征算法的抗噪能力

Sensors (Basel). 2025 Feb 6;25(3):979. doi: 10.3390/s25030979.

本文引用的文献

Transfer Learning in Deep Reinforcement Learning: A Survey.深度强化学习中的迁移学习：一项综述。

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13344-13362. doi: 10.1109/TPAMI.2023.3292075. Epub 2023 Oct 3.

Emergence of a predictive model in the hippocampus.海马体中预测模型的出现。

Neuron. 2023 Jun 21;111(12):1952-1965.e5. doi: 10.1016/j.neuron.2023.03.011. Epub 2023 Apr 3.

Neural learning rules for generating flexible predictions and computing the successor representation.用于生成灵活预测和计算后继表示的神经学习规则。

Elife. 2023 Mar 16;12:e80680. doi: 10.7554/eLife.80680.

Rapid learning of predictive maps with STDP and theta phase precession.具有 STDP 和 theta 相位进动的预测图的快速学习。

Elife. 2023 Mar 16;12:e80663. doi: 10.7554/eLife.80663.

Learning predictive cognitive maps with spiking neurons during behavior and replays.在行为和重放期间使用尖峰神经元学习预测性认知图。

Elife. 2023 Mar 16;12:e80671. doi: 10.7554/eLife.80671.

Complementary task representations in hippocampus and prefrontal cortex for generalizing the structure of problems.海马体和前额叶皮层中的互补任务表示，用于推广问题的结构。

Nat Neurosci. 2022 Oct;25(10):1314-1326. doi: 10.1038/s41593-022-01149-8. Epub 2022 Sep 28.

Toward the biological model of the hippocampus as the successor representation agent.朝着海马体的生物模型作为后继表示代理的方向发展。

Biosystems. 2022 Mar;213:104612. doi: 10.1016/j.biosystems.2022.104612. Epub 2022 Jan 29.

Serotonin neurons modulate learning rate through uncertainty.血清素神经元通过不确定性来调节学习率。

Curr Biol. 2022 Feb 7;32(3):586-599.e7. doi: 10.1016/j.cub.2021.12.006. Epub 2021 Dec 21.

The learning of prospective and retrospective cognitive maps within neural circuits.在神经回路中学习前瞻性和回溯性认知图。

Neuron. 2021 Nov 17;109(22):3552-3575. doi: 10.1016/j.neuron.2021.09.034. Epub 2021 Oct 21.

Multi-task reinforcement learning in humans.人类的多任务强化学习。

Nat Hum Behav. 2021 Jun;5(6):764-773. doi: 10.1038/s41562-020-01035-y. Epub 2021 Jan 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

在嘈杂环境中进行迁移学习研究：使用 T 迷宫进行空间学习中的先驱和后继特征研究。

Investigating Transfer Learning in Noisy Environments: A Study of Predecessor and Successor Features in Spatial Learning Using a T-Maze.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献