Wiesner Steven, Ghimire Bikalpa, Huang Xin
bioRxiv. 2025 Sep 11:2025.09.11.675659. doi: 10.1101/2025.09.11.675659.
Segregating objects from one another and the background is essential for scene understanding, object recognition, and visually guided action. In natural scenes, it is common to encounter spatially separated stimuli, such as distinct figure-ground regions, adjacent objects, and partial occlusions. Neurons in mid- and high-level visual cortex have large receptive fields (RFs) that often encompass multiple, spatially separated stimuli. It is unclear how neurons represent and segregate these stimuli within their RFs. We investigated this question by recording the neuronal responses in the middle temporal (MT) cortex from two male macaque monkeys to multiple moving stimuli. We placed a motion border between two spatially separated random-dot patches within the RFs that moved in two different directions. We varied the vector average direction of the stimuli to characterize the full direction tuning curves. Across motion directions, responses to multiple stimuli were systematically biased toward the stimulus located at the neuron's more-preferred RF subregion. The sign and magnitude of this spatial-location bias were correlated with the neuron's spatial preference measured with single patches presented in isolation. We demonstrated that neuronal responses to multiple stimuli can be captured by an extended divisive normalization model, as a sum of the responses elicited by individual stimuli, weighted by the neuron's spatial preference. We also proposed a circuit implementation for the extended normalization model. Our results indicate that MT leverages spatial selectivity within the RFs to represent spatially separated moving stimuli. The spatial-location bias in neuronal responses enables individual components of multiple stimuli to be represented by a population of neurons with heterogeneous spatial preferences, providing a neural substrate for segregating multiple visual stimuli.
将物体彼此之间以及与背景区分开来对于场景理解、物体识别和视觉引导行动至关重要。在自然场景中,经常会遇到空间上分离的刺激,例如不同的图形-背景区域、相邻物体和部分遮挡。中高级视觉皮层中的神经元具有大的感受野(RFs),通常包含多个空间上分离的刺激。目前尚不清楚神经元如何在其感受野内表征和分离这些刺激。我们通过记录两只雄性猕猴颞中(MT)皮层对多个移动刺激的神经元反应来研究这个问题。我们在感受野内两个空间上分离的随机点斑块之间放置了一个运动边界,这两个斑块向两个不同方向移动。我们改变刺激的矢量平均方向以表征完整的方向调谐曲线。在不同的运动方向上,对多个刺激的反应系统地偏向位于神经元更偏好的感受野子区域的刺激。这种空间位置偏差的符号和大小与用单独呈现的单个斑块测量的神经元空间偏好相关。我们证明,对多个刺激的神经元反应可以通过扩展的除法归一化模型来捕获,即作为单个刺激引发的反应之和,并由神经元的空间偏好加权。我们还为扩展归一化模型提出了一种电路实现方式。我们的结果表明,MT利用感受野内的空间选择性来表征空间上分离的移动刺激。神经元反应中的空间位置偏差使得多个刺激的各个成分能够由具有异质空间偏好的神经元群体来表征,为分离多个视觉刺激提供了神经基础。