• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

动态二元环境中的自适应学习率:自适应信息处理的特征

Adaptive learning rate in dynamical binary environments: the signature of adaptive information processing.

作者信息

Zhu Changbo, Zhou Ke, Tang Yandong, Tang Fengzhen, Si Bailu

机构信息

State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, 110016 Liaoning China.

University of Chinese Academy of Sciences, Beijing, 100049 China.

出版信息

Cogn Neurodyn. 2024 Dec;18(6):4009-4031. doi: 10.1007/s11571-024-10128-7. Epub 2024 Oct 21.

DOI:10.1007/s11571-024-10128-7
PMID:39712114
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11655807/
Abstract

Adaptive mechanisms of learning models play critical roles in interpreting adaptive behavior of humans and animals. Different learning models, varying from Bayesian models, deep learning or regression models to reward-based reinforcement learning models, adopt similar update rules. These update rules can be reduced to the same generalized mathematical form: the Rescorla-Wagner equation. In this paper, we construct a hierarchical Bayesian model with an adaptive learning rate for inferring a hidden probability in a dynamical binary environment, and analysis the adaptive behavior of the model on synthetic data. The update rule of the model state turns out to be an extension of the Rescorla-Wagner equation. The adaptive learning rate is modulated by beliefs and environment uncertainty. Our results underscore adaptive learning rate as mechanistic component in efficient and accurate inference, as well as the signature of information processing in adaptive machine learning models.

摘要

学习模型的自适应机制在解释人类和动物的适应性行为方面起着关键作用。不同的学习模型,从贝叶斯模型、深度学习或回归模型到基于奖励的强化学习模型,都采用类似的更新规则。这些更新规则可以简化为相同的广义数学形式:雷斯克拉-瓦格纳方程。在本文中,我们构建了一个具有自适应学习率的分层贝叶斯模型,用于推断动态二元环境中的隐藏概率,并分析该模型在合成数据上的自适应行为。结果表明,模型状态的更新规则是雷斯克拉-瓦格纳方程的一种扩展。自适应学习率由信念和环境不确定性调节。我们的结果强调了自适应学习率作为高效准确推理中的机制组成部分,以及自适应机器学习模型中信息处理的特征。

相似文献

1
Adaptive learning rate in dynamical binary environments: the signature of adaptive information processing.动态二元环境中的自适应学习率:自适应信息处理的特征
Cogn Neurodyn. 2024 Dec;18(6):4009-4031. doi: 10.1007/s11571-024-10128-7. Epub 2024 Oct 21.
2
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
6
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
7
Quality improvement strategies for diabetes care: Effects on outcomes for adults living with diabetes.糖尿病护理质量改进策略:对成年糖尿病患者结局的影响。
Cochrane Database Syst Rev. 2023 May 31;5(5):CD014513. doi: 10.1002/14651858.CD014513.
8
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
9
Atypical antipsychotics for disruptive behaviour disorders in children and youths.用于治疗儿童和青少年破坏性行为障碍的非典型抗精神病药物。
Cochrane Database Syst Rev. 2017 Aug 9;8(8):CD008559. doi: 10.1002/14651858.CD008559.pub3.
10
Learning together for mental health: feasibility of measures to assess a whole-school mental health and wellbeing intervention in secondary schools.共同学习促进心理健康:评估中学全校心理健康与幸福干预措施的可行性
Public Health Res (Southampt). 2025 Jun 25:1-18. doi: 10.3310/GFDT2323.

本文引用的文献

1
The Rescorla-Wagner model, prediction error, and fear learning.雷斯考拉-瓦格纳模型、预测误差与恐惧学习。
Neurobiol Learn Mem. 2023 Sep;203:107799. doi: 10.1016/j.nlm.2023.107799. Epub 2023 Jul 11.
2
Statistically Optimal Cue Integration During Human Spatial Navigation.人类空间导航过程中的统计最优线索整合
Psychon Bull Rev. 2023 Oct;30(5):1621-1642. doi: 10.3758/s13423-023-02254-w. Epub 2023 Apr 10.
3
Influence of Recent Trial History on Interval Timing.近期试验史对间隔计时的影响。
Neurosci Bull. 2023 Apr;39(4):559-575. doi: 10.1007/s12264-022-00954-2. Epub 2022 Oct 8.
4
Vision as oculomotor reward: cognitive contributions to the dynamic control of saccadic eye movements.视觉作为眼球运动奖励:认知对扫视眼动动态控制的贡献。
Cogn Neurodyn. 2021 Aug;15(4):547-568. doi: 10.1007/s11571-020-09661-y. Epub 2021 Jan 25.
5
Brain-wide, scale-wide physiology underlying behavioral flexibility in zebrafish.斑马鱼行为灵活性的全脑、全尺度生理学基础。
Curr Opin Neurobiol. 2020 Oct;64:151-160. doi: 10.1016/j.conb.2020.08.013. Epub 2020 Oct 19.
6
A brain network supporting social influences in human decision-making.支持人类决策中社会影响的大脑网络。
Sci Adv. 2020 Aug 19;6(34):eabb4159. doi: 10.1126/sciadv.abb4159. eCollection 2020 Aug.
7
Using reinforcement learning models in social neuroscience: frameworks, pitfalls and suggestions of best practices.在社会神经科学中使用强化学习模型:框架、陷阱和最佳实践建议。
Soc Cogn Affect Neurosci. 2020 Jul 30;15(6):695-707. doi: 10.1093/scan/nsaa089.
8
NeuroBayesSLAM: Neurobiologically inspired Bayesian integration of multisensory information for robot navigation.神经贝叶斯 SLAM:用于机器人导航的多传感器信息的神经生物学启发式贝叶斯融合。
Neural Netw. 2020 Jun;126:21-35. doi: 10.1016/j.neunet.2020.02.023. Epub 2020 Mar 4.
9
Human visual exploration reduces uncertainty about the sensed world.人类的视觉探索减少了关于所感知世界的不确定性。
PLoS One. 2018 Jan 5;13(1):e0190429. doi: 10.1371/journal.pone.0190429. eCollection 2018.
10
A causal account of the brain network computations underlying strategic social behavior.战略社会行为的大脑网络计算的因果解释。
Nat Neurosci. 2017 Aug;20(8):1142-1149. doi: 10.1038/nn.4602. Epub 2017 Jul 10.