Flachot Alban, Gegenfurtner Karl R
J Opt Soc Am A Opt Image Sci Vis. 2018 Apr 1;35(4):B334-B346. doi: 10.1364/JOSAA.35.00B334.
Deep convolutional neural networks are a class of machine-learning algorithms capable of solving non-trivial tasks, such as object recognition, with human-like performance. Little is known about the exact computations that deep neural networks learn, and to what extent these computations are similar to the ones performed by the primate brain. Here, we investigate how color information is processed in the different layers of the AlexNet deep neural network, originally trained on object classification of over 1.2M images of objects in their natural contexts. We found that the color-responsive units in the first layer of AlexNet learned linear features and were broadly tuned to two directions in color space, analogously to what is known of color responsive cells in the primate thalamus. Moreover, these directions are decorrelated and lead to statistically efficient representations, similar to the cardinal directions of the second-stage color mechanisms in primates. We also found, in analogy to the early stages of the primate visual system, that chromatic and achromatic information were segregated in the early layers of the network. Units in the higher layers of AlexNet exhibit on average a lower responsivity for color than units at earlier stages.
深度卷积神经网络是一类能够以类似人类的性能解决诸如目标识别等重要任务的机器学习算法。对于深度神经网络所学习的精确计算以及这些计算与灵长类大脑执行的计算在多大程度上相似,我们知之甚少。在这里,我们研究了在最初针对超过120万张自然场景中物体图像进行目标分类训练的AlexNet深度神经网络的不同层中,颜色信息是如何被处理的。我们发现,AlexNet第一层中的颜色响应单元学习线性特征,并在颜色空间中大致调整到两个方向,这类似于灵长类丘脑已知的颜色响应细胞。此外,这些方向是不相关的,并导致统计上有效的表示,类似于灵长类动物第二阶段颜色机制的主要方向。我们还发现,类似于灵长类视觉系统的早期阶段,网络的早期层中颜色和非颜色信息是分离的。AlexNet较高层中的单元平均而言对颜色的响应性低于早期阶段的单元。