Hou Kuinan, Zorzi Marco, Testolin Alberto
Department of General Psychology, University of Padova, Padua, Italy.
IRCCS San Camillo Hospital, Lido, VE, Italy.
Psychol Res. 2024 Dec 3;89(1):31. doi: 10.1007/s00426-024-02064-2.
Humans share with many animal species the ability to perceive and approximately represent the number of objects in visual scenes. This ability improves throughout childhood, suggesting that learning and development play a key role in shaping our number sense. This hypothesis is further supported by computational investigations based on deep learning, which have shown that numerosity perception can spontaneously emerge in neural networks that learn the statistical structure of images with a varying number of items. However, neural network models are usually trained using synthetic datasets that might not faithfully reflect the statistical structure of natural environments, and there is also growing interest in using more ecological visual stimuli to investigate numerosity perception in humans. In this work, we exploit recent advances in computer vision algorithms to design and implement an original pipeline that can be used to estimate the distribution of numerosity and non-numerical magnitudes in large-scale datasets containing thousands of real images depicting objects in daily life situations. We show that in natural visual scenes the frequency of appearance of different numerosities follows a power law distribution. Moreover, we show that the correlational structure for numerosity and continuous magnitudes is stable across datasets and scene types (homogeneous vs. heterogeneous object sets). We suggest that considering such "ecological" pattern of covariance is important to understand the influence of non-numerical visual cues on numerosity judgements.
人类与许多动物物种一样,具备感知并大致表征视觉场景中物体数量的能力。这种能力在整个童年时期不断提升,这表明学习和发展在塑造我们的数字感方面起着关键作用。基于深度学习的计算研究进一步支持了这一假设,这些研究表明,在学习具有不同数量物品的图像统计结构的神经网络中,数字感知能够自发出现。然而,神经网络模型通常使用可能无法如实反映自然环境统计结构的合成数据集进行训练,并且人们越来越有兴趣使用更具生态性的视觉刺激来研究人类的数字感知。在这项工作中,我们利用计算机视觉算法的最新进展,设计并实现了一个原创的流程,可用于估计大规模数据集中数字和非数字量的分布,这些数据集包含数千张描绘日常生活中物体的真实图像。我们表明,在自然视觉场景中,不同数字出现的频率遵循幂律分布。此外,我们表明,数字与连续量的相关结构在不同数据集和场景类型(同质与异质物体集)中是稳定的。我们认为,考虑这种“生态”协方差模式对于理解非数字视觉线索对数字判断的影响很重要。