School of Psychological Science, University of Bristol, Bristol, UK
Ernst Strüngmann Institute (ESI) for Neuroscience in Cooperation with Max Planck Society, Frankfurt am Main, Germany
Behav Brain Sci. 2022 Dec 1;46:e385. doi: 10.1017/S0140525X22002813.
Deep neural networks (DNNs) have had extraordinary successes in classifying photographic images of objects and are often described as the best models of biological vision. This conclusion is largely based on three sets of findings: (1) DNNs are more accurate than any other model in classifying images taken from various datasets, (2) DNNs do the best job in predicting the pattern of human errors in classifying objects taken from various behavioral datasets, and (3) DNNs do the best job in predicting brain signals in response to images taken from various brain datasets (e.g., single cell responses or fMRI data). However, these behavioral and brain datasets do not test hypotheses regarding what features are contributing to good predictions and we show that the predictions may be mediated by DNNs that share little overlap with biological vision. More problematically, we show that DNNs account for almost no results from psychological research. This contradicts the common claim that DNNs are good, let alone the best, models of human object recognition. We argue that theorists interested in developing biologically plausible models of human vision need to direct their attention to explaining psychological findings. More generally, theorists need to build models that explain the results of experiments that manipulate independent variables designed to test hypotheses rather than compete on making the best predictions. We conclude by briefly summarizing various promising modeling approaches that focus on psychological data.
深度神经网络 (DNNs) 在对物体的摄影图像进行分类方面取得了非凡的成功,常被描述为生物视觉的最佳模型。这一结论主要基于三组发现:(1) DNN 在对来自不同数据集的图像进行分类方面比任何其他模型都更准确,(2) DNN 在预测来自不同行为数据集的物体分类中人类错误模式方面表现最佳,以及 (3) DNN 在预测对来自不同大脑数据集(例如单细胞反应或 fMRI 数据)的图像的大脑信号方面表现最佳。然而,这些行为和大脑数据集并没有检验有助于良好预测的特征是什么的假设,我们表明这些预测可能是由与生物视觉重叠很少的 DNN 介导的。更成问题的是,我们表明 DNN 几乎无法解释心理学研究的结果。这与 DNN 是人类物体识别的良好模型,更不用说最佳模型的普遍说法相矛盾。我们认为,有兴趣开发人类视觉生物合理性模型的理论家需要将注意力集中在解释心理学发现上。更一般地说,理论家需要构建能够解释操纵旨在检验假设的自变量的实验结果的模型,而不是在做出最佳预测方面进行竞争。最后,我们简要总结了各种关注心理数据的有前途的建模方法。