人类与猴子物体识别行为的比较

Comparison of Object Recognition Behavior in Human and Monkey.

作者信息

Rajalingham Rishi, Schmidt Kailyn, DiCarlo James J

机构信息

Department of Brain and Cognitive Sciences and.

McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139.

出版信息

J Neurosci. 2015 Sep 2;35(35):12127-36. doi: 10.1523/JNEUROSCI.0573-15.2015.

DOI:10.1523/JNEUROSCI.0573-15.2015

PMID:26338324

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4556783/

Abstract

UNLABELLED

Although the rhesus monkey is used widely as an animal model of human visual processing, it is not known whether invariant visual object recognition behavior is quantitatively comparable across monkeys and humans. To address this question, we systematically compared the core object recognition behavior of two monkeys with that of human subjects. To test true object recognition behavior (rather than image matching), we generated several thousand naturalistic synthetic images of 24 basic-level objects with high variation in viewing parameters and image background. Monkeys were trained to perform binary object recognition tasks on a match-to-sample paradigm. Data from 605 human subjects performing the same tasks on Mechanical Turk were aggregated to characterize "pooled human" object recognition behavior, as well as 33 separate Mechanical Turk subjects to characterize individual human subject behavior. Our results show that monkeys learn each new object in a few days, after which they not only match mean human performance but show a pattern of object confusion that is highly correlated with pooled human confusion patterns and is statistically indistinguishable from individual human subjects. Importantly, this shared human and monkey pattern of 3D object confusion is not shared with low-level visual representations (pixels, V1+; models of the retina and primary visual cortex) but is shared with a state-of-the-art computer vision feature representation. Together, these results are consistent with the hypothesis that rhesus monkeys and humans share a common neural shape representation that directly supports object perception.

SIGNIFICANCE STATEMENT

To date, several mammalian species have shown promise as animal models for studying the neural mechanisms underlying high-level visual processing in humans. In light of this diversity, making tight comparisons between nonhuman and human primates is particularly critical in determining the best use of nonhuman primates to further the goal of the field of translating knowledge gained from animal models to humans. To the best of our knowledge, this study is the first systematic attempt at comparing a high-level visual behavior of humans and macaque monkeys.

摘要

未标注

尽管恒河猴被广泛用作人类视觉处理的动物模型，但尚不清楚不变的视觉对象识别行为在猴子和人类之间是否在数量上具有可比性。为了解决这个问题，我们系统地比较了两只猴子与人类受试者的核心对象识别行为。为了测试真正的对象识别行为（而不是图像匹配），我们生成了数千张24个基本层级对象的自然合成图像，这些图像在观察参数和图像背景方面具有高度变化。猴子被训练在匹配样本范式上执行二元对象识别任务。汇总了605名在亚马逊土耳其机器人平台上执行相同任务的人类受试者的数据，以表征“总体人类”对象识别行为，以及33名单独的亚马逊土耳其机器人平台受试者的数据，以表征个体人类受试者行为。我们的结果表明，猴子在几天内就能学会每个新对象，之后它们不仅能达到人类的平均表现，还表现出一种对象混淆模式，这种模式与总体人类混淆模式高度相关，并且在统计学上与个体人类受试者没有区别。重要的是，这种人类和猴子共有的3D对象混淆模式与低级视觉表征（像素、V1+；视网膜和初级视觉皮层模型）不同，但与一种先进的计算机视觉特征表征相同。总之，这些结果与恒河猴和人类共享一种直接支持对象感知的共同神经形状表征这一假设一致。

意义声明

迄今为止，几种哺乳动物物种已显示出有望作为研究人类高级视觉处理潜在神经机制的动物模型。鉴于这种多样性，在确定如何最好地利用非人类灵长类动物以推进将从动物模型中获得的知识转化为人类应用这一领域目标时，对非人类和人类灵长类动物进行严格比较尤为关键。据我们所知，本研究是首次系统尝试比较人类和猕猴的高级视觉行为。

相似文献

Comparison of Object Recognition Behavior in Human and Monkey.

J Neurosci. 2015 Sep 2;35(35):12127-36. doi: 10.1523/JNEUROSCI.0573-15.2015.

Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.

J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.

Unsupervised changes in core object recognition behavior are predicted by neural plasticity in inferior temporal cortex.

Elife. 2021 Jun 11;10:e60830. doi: 10.7554/eLife.60830.

Simple Learned Weighted Sums of Inferior Temporal Neuronal Firing Rates Accurately Predict Human Core Object Recognition Performance.

J Neurosci. 2015 Sep 30;35(39):13402-18. doi: 10.1523/JNEUROSCI.5181-14.2015.

How does the brain rapidly learn and reorganize view-invariant and position-invariant object representations in the inferotemporal cortex?

Neural Netw. 2011 Dec;24(10):1050-61. doi: 10.1016/j.neunet.2011.04.004. Epub 2011 Apr 22.

View-dependent object recognition by monkeys.

Curr Biol. 1994 May 1;4(5):401-14. doi: 10.1016/s0960-9822(00)00089-0.

Common Object Representations for Visual Production and Recognition.

Cogn Sci. 2018 Nov;42(8):2670-2698. doi: 10.1111/cogs.12676. Epub 2018 Aug 20.

Relating Visual Production and Recognition of Objects in Human Visual Cortex.

J Neurosci. 2020 Feb 19;40(8):1710-1721. doi: 10.1523/JNEUROSCI.1843-19.2019. Epub 2019 Dec 23.

The Dynamic Multisensory Engram: Neural Circuitry Underlying Crossmodal Object Recognition in Rats Changes with the Nature of Object Experience.

J Neurosci. 2016 Jan 27;36(4):1273-89. doi: 10.1523/JNEUROSCI.3043-15.2016.

Factorized visual representations in the primate visual system and deep neural networks.

Elife. 2024 Jul 5;13:RP91685. doi: 10.7554/eLife.91685.

引用本文的文献

Dimensions underlying the representational alignment of deep neural networks with humans.

Nat Mach Intell. 2025;7(6):848-859. doi: 10.1038/s42256-025-01041-7. Epub 2025 Jun 23.

Unraveling the complexity of rat object vision requires a full convolutional network and beyond.

Patterns (N Y). 2025 Jan 17;6(2):101149. doi: 10.1016/j.patter.2024.101149. eCollection 2025 Feb 14.

The Impact of Scene Context on Visual Object Recognition: Comparing Humans, Monkeys, and Computational Models.

bioRxiv. 2024 Jun 1:2024.05.27.596127. doi: 10.1101/2024.05.27.596127.

Morphine exposure modulates dimensional bias and set formation in anthropoids.

Addict Biol. 2024 Feb;29(2):e13380. doi: 10.1111/adb.13380.

How does the primate brain combine generative and discriminative computations in vision?

ArXiv. 2024 Jan 11:arXiv:2401.06005v1.

Unsupervised learning on spontaneous retinal activity leads to efficient neural representation geometry.

ArXiv. 2023 Dec 5:arXiv:2312.02791v1.

How well do rudimentary plasticity rules predict adult visual object learning?

PLoS Comput Biol. 2023 Dec 11;19(12):e1011713. doi: 10.1371/journal.pcbi.1011713. eCollection 2023 Dec.

Model metamers reveal divergent invariances between biological and artificial neural networks.

Nat Neurosci. 2023 Nov;26(11):2017-2034. doi: 10.1038/s41593-023-01442-0. Epub 2023 Oct 16.

Novel object recognition in Octopus maya.

Anim Cogn. 2023 Jun;26(3):1065-1072. doi: 10.1007/s10071-023-01753-6. Epub 2023 Feb 21.

Reevaluating the role of the hippocampus in memory: A meta-analysis of neurotoxic lesion studies in nonhuman primates.

Hippocampus. 2023 Jun;33(6):787-807. doi: 10.1002/hipo.23499. Epub 2023 Jan 17.

本文引用的文献

Deep neural networks rival the representation of primate IT cortex for core visual object recognition.

PLoS Comput Biol. 2014 Dec 18;10(12):e1003963. doi: 10.1371/journal.pcbi.1003963. eCollection 2014 Dec.

Neural substrates of view-invariant object recognition developed without experiencing rotations of the objects.

J Neurosci. 2014 Nov 5;34(45):15047-59. doi: 10.1523/JNEUROSCI.1898-14.2014.

Deep supervised, but not unsupervised, models may explain IT cortical representation.

PLoS Comput Biol. 2014 Nov 6;10(11):e1003915. doi: 10.1371/journal.pcbi.1003915. eCollection 2014 Nov.

Color-detection thresholds in rhesus macaque monkeys and humans.

J Vis. 2014 Jul 15;14(8):12. doi: 10.1167/14.8.12.

The functional architecture of the ventral temporal cortex and its role in categorization.

Nat Rev Neurosci. 2014 Aug;15(8):536-48. doi: 10.1038/nrn3747. Epub 2014 Jun 25.

Performance-optimized hierarchical models predict neural responses in higher visual cortex.

Proc Natl Acad Sci U S A. 2014 Jun 10;111(23):8619-24. doi: 10.1073/pnas.1403112111. Epub 2014 May 8.

Bridging the gap between the human and macaque connectome: a quantitative comparison of global interspecies structure-function relationships and network topology.

J Neurosci. 2014 Apr 16;34(16):5552-63. doi: 10.1523/JNEUROSCI.4229-13.2014.

Evaluating Amazon's Mechanical Turk as a tool for experimental behavioral research.

PLoS One. 2013;8(3):e57410. doi: 10.1371/journal.pone.0057410. Epub 2013 Mar 13.

How does the brain solve visual object recognition?

Neuron. 2012 Feb 9;73(3):415-34. doi: 10.1016/j.neuron.2012.01.010.

Interspecies activity correlations reveal functional correspondence between monkey and human brain areas.

Nat Methods. 2012 Feb 5;9(3):277-82. doi: 10.1038/nmeth.1868.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

人类与猴子物体识别行为的比较

Comparison of Object Recognition Behavior in Human and Monkey.

作者信息

机构信息

出版信息

UNLABELLED

SIGNIFICANCE STATEMENT

未标注

意义声明

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr超能文献

人类与猴子物体识别行为的比较

Comparison of Object Recognition Behavior in Human and Monkey.

作者信息

机构信息

出版信息

UNLABELLED

SIGNIFICANCE STATEMENT

未标注

意义声明

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr
超能文献