Grill-Spector Kalanit, Weiner Kevin S, Gomez Jesse, Stigliani Anthony, Natu Vaidehi S
Department of Psychology, School of Medicine, Stanford University, Stanford, CA 94305, USA.
Stanford Neurosciences Institute, School of Medicine, Stanford University, Stanford, CA 94305, USA.
Interface Focus. 2018 Aug 6;8(4):20180013. doi: 10.1098/rsfs.2018.0013. Epub 2018 Jun 15.
A central goal in neuroscience is to understand how processing within the ventral visual stream enables rapid and robust perception and recognition. Recent neuroscientific discoveries have significantly advanced understanding of the function, structure and computations along the ventral visual stream that serve as the infrastructure supporting this behaviour. In parallel, significant advances in computational models, such as hierarchical deep neural networks (DNNs), have brought machine performance to a level that is commensurate with human performance. Here, we propose a new framework using the ventral face network as a model system to illustrate how increasing the neural accuracy of present DNNs may allow researchers to test the computational benefits of the functional architecture of the human brain. Thus, the review (i) considers specific neural implementational features of the ventral face network, (ii) describes similarities and differences between the functional architecture of the brain and DNNs, and (iii) provides a hypothesis for the computational value of implementational features within the brain that may improve DNN performance. Importantly, this new framework promotes the incorporation of neuroscientific findings into DNNs in order to test the computational benefits of fundamental organizational features of the visual system.
神经科学的一个核心目标是了解腹侧视觉通路中的信息处理如何实现快速且可靠的感知与识别。近期的神经科学发现显著推动了对腹侧视觉通路的功能、结构及计算的理解,而腹侧视觉通路正是支持这一行为的基础架构。与此同时,诸如分层深度神经网络(DNN)等计算模型取得了重大进展,使机器性能达到了与人类性能相当的水平。在此,我们提出一个以腹侧面部网络为模型系统的新框架,以说明提高当前DNN的神经精确性如何能让研究人员测试人类大脑功能架构的计算优势。因此,本综述(i)考虑腹侧面部网络的特定神经实现特征,(ii)描述大脑和DNN功能架构之间的异同,(iii)为大脑内可能提高DNN性能的实现特征的计算价值提供一个假设。重要的是,这个新框架促进将神经科学研究结果纳入DNN,以便测试视觉系统基本组织特征的计算优势。