早期的低通滤波图像经验有助于神经网络模型中的视觉类别学习。

Early experience with low-pass filtered images facilitates visual category learning in a neural network model.

机构信息

Department of Psychology, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America.

Neuroscience Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America.

出版信息

PLoS One. 2023 Jan 6;18(1):e0280145. doi: 10.1371/journal.pone.0280145. eCollection 2023.

DOI:10.1371/journal.pone.0280145

PMID:36608003

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9821476/

Abstract

Humans are born with very low contrast sensitivity, meaning that inputs to the infant visual system are both blurry and low contrast. Is this solely a byproduct of maturational processes or is there a functional advantage for beginning life with poor visual acuity? We addressed the impact of poor vision during early learning by exploring whether reduced visual acuity facilitated the acquisition of basic-level categories in a convolutional neural network model (CNN), as well as whether any such benefit transferred to subordinate-level category learning. Using the ecoset dataset to simulate basic-level category learning, we manipulated model training curricula along three dimensions: presence of blurred inputs early in training, rate of blur reduction over time, and grayscale versus color inputs. First, a training regime where blur was initially high and was gradually reduced over time-as in human development-improved basic-level categorization performance in a CNN relative to a regime in which non-blurred inputs were used throughout training. Second, when basic-level models were fine-tuned on a task including both basic-level and subordinate-level categories (using the ImageNet dataset), models initially trained with blurred inputs showed a greater performance benefit as compared to models trained exclusively on non-blurred inputs, suggesting that the benefit of blurring generalized from basic-level to subordinate-level categorization. Third, analogous to the low sensitivity to color that infants experience during the first 4-6 months of development, these advantages were observed only when grayscale images were used as inputs. We conclude that poor visual acuity in human newborns may confer functional advantages, including, as demonstrated here, more rapid and accurate acquisition of visual object categories at multiple levels.

摘要

人类天生的对比度敏感度很低，这意味着婴儿的视觉系统输入既模糊又低对比度。这仅仅是成熟过程的副产品，还是生命开始时视力不佳有功能优势？我们通过探索在卷积神经网络模型（CNN）中，较差的视力是否会促进基本类别学习的过程，来解决早期学习中视力不佳的问题，以及这种优势是否会转移到下属类别学习。我们使用 ecoset 数据集来模拟基本类别学习，通过三个维度来操纵模型训练课程：早期训练中模糊输入的存在、随时间减少模糊的速度，以及灰度与彩色输入。首先，一种训练模式，其中模糊度最初很高，并且随着时间的推移逐渐降低，就像人类发育一样，与整个训练过程中使用非模糊输入的模式相比，提高了 CNN 中的基本类别分类性能。其次，当基本级别模型在包括基本级别和下属级别类别的任务上进行微调时（使用 ImageNet 数据集），与仅在非模糊输入上进行训练的模型相比，最初在模糊输入上进行训练的模型表现出更大的性能优势，这表明模糊输入的优势从基本级别分类推广到下属级别分类。第三，类似于婴儿在发育的前 4-6 个月对颜色的低敏感度，只有当灰度图像用作输入时，才会观察到这些优势。我们得出结论，人类新生儿的低视力可能会带来功能优势，包括，正如这里所展示的，在多个级别上更快、更准确地获得视觉对象类别。