Bramley Neil R, Lagnado David A, Speekenbrink Maarten
Department of Experimental Psychology, University College London.
J Exp Psychol Learn Mem Cogn. 2015 May;41(3):708-31. doi: 10.1037/xlm0000061. Epub 2014 Oct 20.
Interacting with a system is key to uncovering its causal structure. A computational framework for interventional causal learning has been developed over the last decade, but how real causal learners might achieve or approximate the computations entailed by this framework is still poorly understood. Here we describe an interactive computer task in which participants were incentivized to learn the structure of probabilistic causal systems through free selection of multiple interventions. We develop models of participants' intervention choices and online structure judgments, using expected utility gain, probability gain, and information gain and introducing plausible memory and processing constraints. We find that successful participants are best described by a model that acts to maximize information (rather than expected score or probability of being correct); that forgets much of the evidence received in earlier trials; but that mitigates this by being conservative, preferring structures consistent with earlier stated beliefs. We explore 2 heuristics that partly explain how participants might be approximating these models without explicitly representing or updating a hypothesis space.
与系统交互是揭示其因果结构的关键。在过去十年中,已经开发了一个用于干预性因果学习的计算框架,但真实的因果学习者如何实现或近似该框架所涉及的计算,目前仍知之甚少。在此,我们描述了一项交互式计算机任务,在该任务中,参与者通过自由选择多种干预措施来学习概率因果系统的结构。我们使用预期效用增益、概率增益和信息增益,并引入合理的记忆和处理约束,开发了参与者干预选择和在线结构判断的模型。我们发现,成功的参与者最好用一个旨在最大化信息(而不是预期分数或正确概率)的模型来描述;该模型会遗忘早期试验中收到的许多证据;但通过保持保守来减轻这种情况,即倾向于与早期陈述的信念一致的结构。我们探索了两种启发式方法,它们部分解释了参与者如何在不明确表示或更新假设空间的情况下近似这些模型。