de la Torre-Ortiz Carlos, Spapé Michiel, Ruotsalo Tuukka
Department of Computer Science, University of Helsinki, Helsinki, Finland.
Department of Psychology & Logopedics, University of Helsinki, Helsinki, Finland.
Psychophysiology. 2023 May;60(5):e14225. doi: 10.1111/psyp.14225. Epub 2023 Mar 3.
Visual recognition requires inferring the similarity between a perceived object and a mental target. However, a measure of similarity is difficult to determine when it comes to complex stimuli such as faces. Indeed, people may notice someone "looks like" a familiar face, but find it hard to describe on the basis of what features such a comparison is based. Previous work shows that the number of similar visual elements between a face pictogram and a memorized target correlates with the P300 amplitude in the visual evoked potential. Here, we redefine similarity as the distance inferred from a latent space learned using a state-of-the-art generative adversarial neural network (GAN). A rapid serial visual presentation experiment was conducted with oddball images generated at varying distances from the target to determine how P300 amplitude related to GAN-derived distances. The results showed that distance-to-target was monotonically related to the P300, showing perceptual identification was associated with smooth, drifting image similarity. Furthermore, regression modeling indicated that while the P3a and P3b sub-components had distinct responses in location, time, and amplitude, they were similarly related to target distance. The work demonstrates that the P300 indexes the distance between perceived and target image in smooth, natural, and complex visual stimuli and shows that GANs present a novel modeling methodology for studying the relationships between stimuli, perception, and recognition.
视觉识别需要推断所感知物体与心理目标之间的相似性。然而,当涉及到诸如面孔等复杂刺激时,相似性的衡量标准很难确定。实际上,人们可能会注意到某人“看起来像”一张熟悉的面孔,但却很难基于这种比较所依据的特征来进行描述。先前的研究表明,面部象形图与记忆目标之间相似视觉元素的数量与视觉诱发电位中的P300波幅相关。在此,我们将相似性重新定义为从使用先进的生成对抗神经网络(GAN)学习到的潜在空间中推断出的距离。我们进行了一项快速序列视觉呈现实验,使用与目标距离不同的奇异图像来确定P300波幅与GAN衍生距离之间的关系。结果表明,与目标的距离与P300呈单调关系,表明知觉识别与平滑、渐变的图像相似性相关。此外,回归模型表明,虽然P3a和P3b子成分在位置、时间和波幅上有不同的反应,但它们与目标距离的关系相似。这项工作表明,P300在平滑、自然和复杂的视觉刺激中指示了感知图像与目标图像之间的距离,并表明GAN为研究刺激、感知和识别之间的关系提供了一种新颖的建模方法。