文献检索，用中文搜 PubMed

应用&插件

Zotero 插件浏览器插件 Mac 客户端 Windows 客户端微信小程序

定价

高级版会员购买积分包购买API积分包

服务

文献检索文档翻译深度研究 API 文档 MCP 服务

关于我们

关于 Suppr 公司介绍联系我们用户协议隐私条款

关注我们

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

粤ICP备2023148730 号-1Suppr @ 2026

BACKGROUND

Reinforcement learning models provide excellent descriptions of learning in multiple species across a variety of tasks. Many researchers are interested in relating parameters of reinforcement learning models to neural measures, psychological variables or experimental manipulations. We demonstrate that parameter identification is difficult because a range of parameter values provide approximately equal quality fits to data. This identification problem has a large impact on power: we show that a researcher who wants to detect a medium sized correlation (r = .3) with 80% power between a variable and learning rate must collect 60% more subjects than specified by a typical power analysis in order to account for the noise introduced by model fitting.

NEW METHOD

We derive a Bayesian optimal model fitting technique that takes advantage of information contained in choices and reaction times to constrain parameter estimates.

RESULTS

We show using simulation and empirical data that this method substantially improves the ability to recover learning rates.

COMPARISON WITH EXISTING METHODS

We compare this method against the use of Bayesian priors. We show in simulations that the combined use of Bayesian priors and reaction times confers the highest parameter identifiability. However, in real data where the priors may have been misspecified, the use of Bayesian priors interferes with the ability of reaction time data to improve parameter identifiability.

CONCLUSIONS

We present a simple technique that takes advantage of readily available data to substantially improve the quality of inferences that can be drawn from parameters of reinforcement learning models.

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

联合建模反应时间和选择可提高强化学习模型的参数可识别性。

Joint modeling of reaction times and choice improves parameter identifiability in reinforcement learning models.

机构信息

出版信息

BACKGROUND

NEW METHOD

RESULTS

COMPARISON WITH EXISTING METHODS

CONCLUSIONS

背景

新方法

结果

与现有方法的比较

结论

相似文献

引用本文的文献

本文引用的文献

联合建模反应时间和选择可提高强化学习模型的参数可识别性。

Joint modeling of reaction times and choice improves parameter identifiability in reinforcement learning models.

机构信息

出版信息

BACKGROUND

NEW METHOD

RESULTS

COMPARISON WITH EXISTING METHODS

CONCLUSIONS

背景

新方法

结果

与现有方法的比较

结论

相似文献

引用本文的文献

本文引用的文献