Rupp Kyle M, Hect Jasmine L, Harford Emily E, Holt Lori L, Ghuman Avniel Singh, Abel Taylor J
Department of Neurological Surgery, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America.
Department of Psychology, The University of Texas at Austin, Austin, Texas, United States of America.
bioRxiv. 2024 May 26:2024.05.24.595822. doi: 10.1101/2024.05.24.595822.
Efficient behavior is supported by humans' ability to rapidly recognize acoustically distinct sounds as members of a common category. Within auditory cortex, there are critical unanswered questions regarding the organization and dynamics of sound categorization. Here, we performed intracerebral recordings in the context of epilepsy surgery as 20 patient-participants listened to natural sounds. We built encoding models to predict neural responses using features of these sounds extracted from different layers within a sound-categorization deep neural network (DNN). This approach yielded highly accurate models of neural responses throughout auditory cortex. The complexity of a cortical site's representation (measured by the depth of the DNN layer that produced the best model) was closely related to its anatomical location, with shallow, middle, and deep layers of the DNN associated with core (primary auditory cortex), lateral belt, and parabelt regions, respectively. Smoothly varying gradients of representational complexity also existed within these regions, with complexity increasing along a posteromedial-to-anterolateral direction in core and lateral belt, and along posterior-to-anterior and dorsal-to-ventral dimensions in parabelt. When we estimated the time window over which each recording site integrates information, we found shorter integration windows in core relative to lateral belt and parabelt. Lastly, we found a relationship between the length of the integration window and the complexity of information processing within core (but not lateral belt or parabelt). These findings suggest hierarchies of timescales and processing complexity, and their interrelationship, represent a functional organizational principle of the auditory stream that underlies our perception of complex, abstract auditory information.
人类能够迅速将声学上不同的声音识别为同一类别中的成员,这有助于高效行为的产生。在听觉皮层中,关于声音分类的组织和动态存在一些关键的未解决问题。在这里,我们在癫痫手术的背景下进行了脑内记录,20名患者参与者聆听自然声音。我们构建了编码模型,使用从声音分类深度神经网络(DNN)的不同层中提取的这些声音的特征来预测神经反应。这种方法产生了整个听觉皮层神经反应的高度准确模型。皮层位点表征的复杂性(通过产生最佳模型的DNN层的深度来衡量)与其解剖位置密切相关,DNN的浅层、中层和深层分别与核心(初级听觉皮层)、外侧带和旁带区域相关。在这些区域内也存在表征复杂性的平滑变化梯度,在核心和外侧带中,复杂性沿后内侧到前外侧方向增加,在旁带中沿后到前和背到腹的维度增加。当我们估计每个记录位点整合信息的时间窗口时,我们发现核心区域的整合窗口比外侧带和旁带更短。最后,我们发现整合窗口的长度与核心区域内信息处理的复杂性之间存在关系(但在外侧带或旁带中不存在)。这些发现表明,时间尺度和处理复杂性的层次结构及其相互关系代表了听觉信息流的一种功能组织原则,它是我们对复杂抽象听觉信息感知的基础。