Ibáñez-Berganza Miguel, Lucibello Carlo, Mariani Luca, Pezzulo Giovanni
IMT School for Advanced Studies, Lucca, Italy.
Istituto Italiano di Tecnologia, Napoli, Italy.
PLoS One. 2024 Jan 26;19(1):e0295054. doi: 10.1371/journal.pone.0295054. eCollection 2024.
Processing faces accurately and efficiently is a key capability of humans and other animals that engage in sophisticated social tasks. Recent studies reported a decoupled coding for faces in the primate inferotemporal cortex, with two separate neural populations coding for the geometric position of (texture-free) facial landmarks and for the image texture at fixed landmark positions, respectively. Here, we formally assess the efficiency of this decoupled coding by appealing to the information-theoretic notion of description length, which quantifies the amount of information that is saved when encoding novel facial images, with a given precision. We show that despite decoupled coding describes the facial images in terms of two sets of principal components (of landmark shape and image texture), it is more efficient (i.e., yields more information compression) than the encoding in terms of the image principal components only, which corresponds to the widely used eigenface method. The advantage of decoupled coding over eigenface coding increases with image resolution and is especially prominent when coding variants of training set images that only differ in facial expressions. Moreover, we demonstrate that decoupled coding entails better performance in three different tasks: the representation of facial images, the (daydream) sampling of novel facial images, and the recognition of facial identities and gender. In summary, our study provides a first principle perspective on the efficiency and accuracy of the decoupled coding of facial stimuli reported in the primate inferotemporal cortex.
准确而高效地处理面孔是人类和其他从事复杂社交任务的动物的一项关键能力。最近的研究报告称,灵长类动物颞下皮质对面孔存在解耦编码,分别有两个独立的神经群体,一个编码(无纹理的)面部标志点的几何位置,另一个编码固定标志点位置处的图像纹理。在这里,我们通过引入描述长度的信息论概念来正式评估这种解耦编码的效率,该概念量化了在以给定精度编码新的面部图像时所节省的信息量。我们表明,尽管解耦编码是根据两组主成分(标志点形状和图像纹理)来描述面部图像的,但它比仅根据图像主成分进行的编码更有效(即产生更多的信息压缩),后者对应于广泛使用的特征脸方法。解耦编码相对于特征脸编码的优势随着图像分辨率的提高而增加,并且在编码仅面部表情不同的训练集图像变体时尤为突出。此外,我们证明解耦编码在三个不同任务中表现更好:面部图像的表示、新面部图像的(白日梦式)采样以及面部身份和性别的识别。总之,我们的研究为灵长类动物颞下皮质中报道的面部刺激解耦编码的效率和准确性提供了一个第一性原理视角。