Suppr超能文献

在嘈杂环境中进行迁移学习研究:使用 T 迷宫进行空间学习中的先驱和后继特征研究。

Investigating Transfer Learning in Noisy Environments: A Study of Predecessor and Successor Features in Spatial Learning Using a T-Maze.

机构信息

Department of Immunology, Kyungpook National University School of Medicine, Daegu 41944, Republic of Korea.

Department of Physiology, Pusan National University School of Medicine, Yangsan 50612, Republic of Korea.

出版信息

Sensors (Basel). 2024 Oct 3;24(19):6419. doi: 10.3390/s24196419.

Abstract

In this study, we investigate the adaptability of artificial agents within a noisy T-maze that use Markov decision processes (MDPs) and successor feature (SF) and predecessor feature (PF) learning algorithms. Our focus is on quantifying how varying the hyperparameters, specifically the reward learning rate (αr) and the eligibility trace decay rate (λ), can enhance their adaptability. Adaptation is evaluated by analyzing the hyperparameters of cumulative reward, step length, adaptation rate, and adaptation step length and the relationships between them using Spearman's correlation tests and linear regression. Our findings reveal that an αr of 0.9 consistently yields superior adaptation across all metrics at a noise level of 0.05. However, the optimal setting for λ varies by metric and context. In discussing these results, we emphasize the critical role of hyperparameter optimization in refining the performance and transfer learning efficacy of learning algorithms. This research advances our understanding of the functionality of PF and SF algorithms, particularly in navigating the inherent uncertainty of transfer learning tasks. By offering insights into the optimal hyperparameter configurations, this study contributes to the development of more adaptive and robust learning algorithms, paving the way for future explorations in artificial intelligence and neuroscience.

摘要

在这项研究中,我们研究了在使用马尔可夫决策过程 (MDP) 和后继特征 (SF) 和前继特征 (PF) 学习算法的嘈杂 T 迷宫中,人工代理的适应性。我们的重点是量化改变超参数,特别是奖励学习率 (αr) 和资格迹衰减率 (λ),如何增强它们的适应性。通过分析累积奖励、步长、适应率和适应步长的超参数,以及使用 Spearman 相关检验和线性回归分析它们之间的关系,来评估适应能力。我们的研究结果表明,在噪声水平为 0.05 时,αr 为 0.9 始终在所有指标上产生优越的适应能力。然而,λ 的最佳设置因指标和上下文而异。在讨论这些结果时,我们强调了超参数优化在改进学习算法的性能和迁移学习效果方面的关键作用。这项研究增进了我们对 PF 和 SF 算法功能的理解,特别是在处理迁移学习任务中的固有不确定性方面。通过提供有关最佳超参数配置的见解,本研究有助于开发更具适应性和鲁棒性的学习算法,为人工智能和神经科学的未来探索铺平道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c97e/11479366/c1d28726cd26/sensors-24-06419-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验