Department of Psychology, The Ohio State University, Columbus, OH 43210, USA.
School of Computer Science and Engineering, Seoul National University, Seoul 151-742, Republic of Korea.
Cogn Psychol. 2021 Mar;125:101360. doi: 10.1016/j.cogpsych.2020.101360. Epub 2021 Jan 17.
Interest in computational modeling of cognition and behavior continues to grow. To be most productive, modelers should be equipped with tools that ensure optimal efficiency in data collection and in the integrity of inference about the phenomenon of interest. Traditionally, models in cognitive science have been parametric, which are particularly susceptible to model misspecification because their strong assumptions (e.g. parameterization, functional form) may introduce unjustified biases in data collection and inference. To address this issue, we propose a data-driven nonparametric framework for model development, one that also includes optimal experimental design as a goal. It combines Gaussian Processes, a stochastic process often used for regression and classification, with active learning, from machine learning, to iteratively fit the model and use it to optimize the design selection throughout the experiment. The approach, dubbed Gaussian process with active learning (GPAL), is an extension of the parametric, adaptive design optimization (ADO) framework (Cavagnaro, Myung, Pitt, & Kujala, 2010). We demonstrate the application and features of GPAL in a delay discounting task and compare its performance to ADO in two experiments. The results show that GPAL is a viable modeling framework that is noteworthy for its high sensitivity to individual differences, identifying novel patterns in the data that were missed by the model-constrained ADO. This investigation represents a first step towards the development of a data-driven cognitive modeling framework that serves as a middle ground between raw data, which can be difficult to interpret, and parametric models, which rely on strong assumptions.
人们对认知和行为的计算建模的兴趣持续增长。为了提高效率,建模者应该配备能够确保数据收集效率和感兴趣现象推断完整性的工具。传统上,认知科学中的模型是参数化的,由于其强假设(例如参数化、函数形式)可能会在数据收集和推断中引入不合理的偏差,因此特别容易出现模型误设。为了解决这个问题,我们提出了一种数据驱动的非参数模型开发框架,该框架还将最优实验设计作为目标。它将高斯过程(一种常用于回归和分类的随机过程)与机器学习中的主动学习相结合,以迭代地拟合模型,并利用它在整个实验过程中优化设计选择。这种方法被称为具有主动学习的高斯过程(GPAL),是参数自适应设计优化(ADO)框架(Cavagnaro、Myung、Pitt 和 Kujala,2010)的扩展。我们在延迟折扣任务中展示了 GPAL 的应用和特点,并在两个实验中将其性能与 ADO 进行了比较。结果表明,GPAL 是一种可行的建模框架,它对个体差异非常敏感,能够识别出模型约束的 ADO 错过的新数据模式。这项研究是朝着开发数据驱动的认知建模框架迈出的第一步,该框架在原始数据(难以解释)和参数模型(依赖强假设)之间提供了一个中间地带。