• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过强化学习在交互式滑坡模拟器工具中理解人类决策

Understanding Human Decision Making in an Interactive Landslide Simulator Tool via Reinforcement Learning.

作者信息

Chaturvedi Pratik, Dutt Varun

机构信息

Applied Cognitive Science Laboratory, Indian Institute of Technology Mandi, Mandi, India.

Defence Terrain Research Laboratory, Defence Research and Development Organization, New Delhi, India.

出版信息

Front Psychol. 2021 Feb 10;11:499422. doi: 10.3389/fpsyg.2020.499422. eCollection 2020.

DOI:10.3389/fpsyg.2020.499422
PMID:33643103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7902924/
Abstract

Prior research has used an Interactive Landslide Simulator (ILS) tool to investigate human decision making against landslide risks. It has been found that repeated feedback in the ILS tool about damages due to landslides causes an improvement in human decisions against landslide risks. However, little is known on how theories of learning from feedback (e.g., reinforcement learning) would account for human decisions in the ILS tool. The primary goal of this paper is to account for human decisions in the ILS tool via computational models based upon reinforcement learning and to explore the model mechanisms involved when people make decisions in the ILS tool. Four different reinforcement-learning models were developed and evaluated in their ability to capture human decisions in an experiment involving two conditions in the ILS tool. The parameters of an Expectancy-Valence (EV) model, two Prospect-Valence-Learning models (PVL and PVL-2), a combination EV-PU model, and a random model were calibrated to human decisions in the ILS tool across the two conditions. Later, different models with their calibrated parameters were generalized to data collected in an experiment involving a new condition in ILS. When generalized to this new condition, the PVL-2 model's parameters of both damage-feedback conditions outperformed all other RL models (including the random model). We highlight the implications of our results for decision making against landslide risks.

摘要

先前的研究使用了交互式滑坡模拟器(ILS)工具来调查人们针对滑坡风险所做的决策。研究发现,ILS工具中关于滑坡造成的损害的反复反馈会使人们针对滑坡风险所做的决策得到改善。然而,对于从反馈中学习的理论(例如强化学习)如何解释人们在ILS工具中的决策,我们知之甚少。本文的主要目标是通过基于强化学习的计算模型来解释人们在ILS工具中的决策,并探索人们在ILS工具中做出决策时所涉及的模型机制。在一项涉及ILS工具两种情况的实验中,开发并评估了四种不同的强化学习模型捕捉人类决策的能力。将期望效价(EV)模型、两种前景效价学习模型(PVL和PVL-2)、组合的EV-PU模型以及随机模型的参数校准为ILS工具中两种情况下的人类决策。之后,将具有校准参数的不同模型推广到在涉及ILS新情况的实验中收集的数据。当推广到这种新情况时,PVL-2模型在两种损害反馈情况下的参数均优于所有其他强化学习模型(包括随机模型)。我们强调了我们的结果对针对滑坡风险进行决策的意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/4db53934db53/fpsyg-11-499422-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/1b41cc936150/fpsyg-11-499422-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/94999bc71cc6/fpsyg-11-499422-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/b19b7ce08cef/fpsyg-11-499422-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/1225087344c2/fpsyg-11-499422-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/dbaec10a954f/fpsyg-11-499422-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/4db53934db53/fpsyg-11-499422-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/1b41cc936150/fpsyg-11-499422-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/94999bc71cc6/fpsyg-11-499422-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/b19b7ce08cef/fpsyg-11-499422-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/1225087344c2/fpsyg-11-499422-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/dbaec10a954f/fpsyg-11-499422-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4f/7902924/4db53934db53/fpsyg-11-499422-g006.jpg

相似文献

1
Understanding Human Decision Making in an Interactive Landslide Simulator Tool via Reinforcement Learning.通过强化学习在交互式滑坡模拟器工具中理解人类决策
Front Psychol. 2021 Feb 10;11:499422. doi: 10.3389/fpsyg.2020.499422. eCollection 2020.
2
Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China.利用 SMOTE 优化机器学习方法在浙江省丽水市滑坡易发性制图中的预测能力。
Int J Environ Res Public Health. 2019 Jan 28;16(3):368. doi: 10.3390/ijerph16030368.
3
Rainfall-Induced Landslide Prediction Using Machine Learning Models: The Case of Ngororero District, Rwanda.基于机器学习模型的降雨诱发滑坡预测:以卢旺达恩戈罗恩戈罗区为例。
Int J Environ Res Public Health. 2020 Jun 10;17(11):4147. doi: 10.3390/ijerph17114147.
4
Validating the PVL-Delta model for the Iowa gambling task.验证用于爱荷华赌博任务的 PVL-Delta 模型。
Front Psychol. 2013 Dec 3;4:898. doi: 10.3389/fpsyg.2013.00898. eCollection 2013.
5
Comparison of Random Forest Model and Frequency Ratio Model for Landslide Susceptibility Mapping (LSM) in Yunyang County (Chongqing, China).随机森林模型与频率比模型在渝阳区(中国重庆)滑坡易发性制图(LSM)中的比较。
Int J Environ Res Public Health. 2020 Jun 12;17(12):4206. doi: 10.3390/ijerph17124206.
6
A new computational account of cognitive control over reinforcement-based decision-making: Modeling of a probabilistic learning task.一种关于对基于强化的决策进行认知控制的新计算解释:概率学习任务的建模
Neural Netw. 2015 Nov;71:112-23. doi: 10.1016/j.neunet.2015.08.006. Epub 2015 Aug 20.
7
Reinforcement-based decision making in corticostriatal circuits: mutual constraints by neurocomputational and diffusion models.基于强化的皮质纹状体回路决策:神经计算和扩散模型的相互约束。
Neural Comput. 2012 May;24(5):1186-229. doi: 10.1162/NECO_a_00270. Epub 2012 Feb 1.
8
Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan.评估先进的随机森林和决策树算法在日本伊豆大岛火山岛降雨诱发滑坡敏感性建模中的应用。
Sci Total Environ. 2019 Apr 20;662:332-346. doi: 10.1016/j.scitotenv.2019.01.221. Epub 2019 Jan 21.
9
A multidimensional stability model for predicting shallow landslide size and shape across landscapes.一种用于预测不同地貌浅层滑坡大小和形状的多维稳定性模型。
J Geophys Res Earth Surf. 2014 Nov;119(11):2481-2504. doi: 10.1002/2014JF003135. Epub 2014 Nov 26.
10
Human and machine learning in non-Markovian decision making.非马尔可夫决策中的人类与机器学习
PLoS One. 2015 Apr 21;10(4):e0123105. doi: 10.1371/journal.pone.0123105. eCollection 2015.

本文引用的文献

1
How the threat of losses makes people explore more than the promise of gains.损失的威胁如何使人们探索得更多,而非收益的承诺。
Psychon Bull Rev. 2017 Jun;24(3):708-720. doi: 10.3758/s13423-016-1158-7.
2
An improved cognitive model of the Iowa and Soochow Gambling Tasks with regard to model fitting performance and tests of parameter consistency.考虑模型拟合性能和参数一致性检验的爱荷华和苏州赌博任务的改进认知模型。
Front Psychol. 2015 Mar 12;6:229. doi: 10.3389/fpsyg.2015.00229. eCollection 2015.
3
A cognitive model of dynamic cooperation with varied interdependency information.
一个具有不同相互依赖信息的动态合作认知模型。
Cogn Sci. 2015 Apr;39(3):457-95. doi: 10.1111/cogs.12170. Epub 2014 Sep 23.
4
Cyber situation awareness: modeling detection of cyber attacks with instance-based learning theory.网络情境感知:基于实例学习理论的网络攻击检测建模。
Hum Factors. 2013 Jun;55(3):605-18. doi: 10.1177/0018720812464045.
5
The role of inertia in modeling decisions from experience with instance-based learning.惯性在基于实例学习的经验决策建模中的作用。
Front Psychol. 2012 Jun 6;3:177. doi: 10.3389/fpsyg.2012.00177. eCollection 2012.
6
Instance-based learning: integrating sampling and repeated decisions from experience.基于实例的学习:从经验中整合采样和重复决策。
Psychol Rev. 2011 Oct;118(4):523-51. doi: 10.1037/a0024558.
7
Comparison of decision learning models using the generalization criterion method.基于泛化准则方法的决策学习模型比较。
Cogn Sci. 2008 Dec;32(8):1376-402. doi: 10.1080/03640210802352992.
8
Mental models of flash floods and landslides.山洪暴发和山体滑坡的心智模型。
Risk Anal. 2007 Jun;27(3):671-82. doi: 10.1111/j.1539-6924.2007.00916.x.
9
Cortical substrates for exploratory decisions in humans.人类探索性决策的皮质基础。
Nature. 2006 Jun 15;441(7095):876-9. doi: 10.1038/nature04766.
10
Using cognitive models to map relations between neuropsychological disorders and human decision-making deficits.运用认知模型来描绘神经心理障碍与人类决策缺陷之间的关系。
Psychol Sci. 2005 Dec;16(12):973-8. doi: 10.1111/j.1467-9280.2005.01646.x.