Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA.
McGovern Institute, Massachusetts Institute of Technology, Cambridge, MA, USA.
Nat Neurosci. 2023 Nov;26(11):2017-2034. doi: 10.1038/s41593-023-01442-0. Epub 2023 Oct 16.
Deep neural network models of sensory systems are often proposed to learn representational transformations with invariances like those in the brain. To reveal these invariances, we generated 'model metamers', stimuli whose activations within a model stage are matched to those of a natural stimulus. Metamers for state-of-the-art supervised and unsupervised neural network models of vision and audition were often completely unrecognizable to humans when generated from late model stages, suggesting differences between model and human invariances. Targeted model changes improved human recognizability of model metamers but did not eliminate the overall human-model discrepancy. The human recognizability of a model's metamers was well predicted by their recognizability by other models, suggesting that models contain idiosyncratic invariances in addition to those required by the task. Metamer recognizability dissociated from both traditional brain-based benchmarks and adversarial vulnerability, revealing a distinct failure mode of existing sensory models and providing a complementary benchmark for model assessment.
用于学习具有大脑中类似不变性的表示变换的感官系统深度神经网络模型,通常会生成“模型同型物”,这些刺激物在模型阶段内的激活与自然刺激的激活相匹配。当从模型的后期阶段生成用于视觉和听觉的最先进的监督和无监督神经网络模型的同型物时,它们通常对人类来说是完全无法识别的,这表明模型和人类之间存在不变性差异。有针对性的模型改变提高了人类对模型同型物的可识别性,但并没有消除整体的人类-模型差异。模型同型物的人类可识别性可以很好地预测其他模型的可识别性,这表明模型除了任务要求的不变性之外还包含特定的不变性。同型物的可识别性与传统基于大脑的基准和对抗脆弱性分离,揭示了现有感官模型的独特失败模式,并为模型评估提供了一个补充基准。