Parida Satyabrata, Bharadwaj Hari, Heinz Michael G
Weldon School of Biomedical Engineering, Purdue University, West Lafayette, Indiana, United States of America.
Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana, United States of America.
PLoS Comput Biol. 2021 Feb 22;17(2):e1008155. doi: 10.1371/journal.pcbi.1008155. eCollection 2021 Feb.
Significant scientific and translational questions remain in auditory neuroscience surrounding the neural correlates of perception. Relating perceptual and neural data collected from humans can be useful; however, human-based neural data are typically limited to evoked far-field responses, which lack anatomical and physiological specificity. Laboratory-controlled preclinical animal models offer the advantage of comparing single-unit and evoked responses from the same animals. This ability provides opportunities to develop invaluable insight into proper interpretations of evoked responses, which benefits both basic-science studies of neural mechanisms and translational applications, e.g., diagnostic development. However, these comparisons have been limited by a disconnect between the types of spectrotemporal analyses used with single-unit spike trains and evoked responses, which results because these response types are fundamentally different (point-process versus continuous-valued signals) even though the responses themselves are related. Here, we describe a unifying framework to study temporal coding of complex sounds that allows spike-train and evoked-response data to be analyzed and compared using the same advanced signal-processing techniques. The framework uses a set of peristimulus-time histograms computed from single-unit spike trains in response to polarity-alternating stimuli to allow advanced spectral analyses of both slow (envelope) and rapid (temporal fine structure) response components. Demonstrated benefits include: (1) novel spectrally specific temporal-coding measures that are less confounded by distortions due to hair-cell transduction, synaptic rectification, and neural stochasticity compared to previous metrics, e.g., the correlogram peak-height, (2) spectrally specific analyses of spike-train modulation coding (magnitude and phase), which can be directly compared to modern perceptually based models of speech intelligibility (e.g., that depend on modulation filter banks), and (3) superior spectral resolution in analyzing the neural representation of nonstationary sounds, such as speech and music. This unifying framework significantly expands the potential of preclinical animal models to advance our understanding of the physiological correlates of perceptual deficits in real-world listening following sensorineural hearing loss.
在听觉神经科学领域,围绕感知的神经关联仍存在重大的科学和转化问题。将从人类收集的感知数据与神经数据相关联可能会有所帮助;然而,基于人类的神经数据通常仅限于诱发的远场反应,而这种反应缺乏解剖学和生理学特异性。实验室控制的临床前动物模型具有比较同一动物的单单元反应和诱发反应的优势。这种能力为深入了解诱发反应的正确解释提供了机会,这对神经机制的基础科学研究和转化应用(如诊断开发)都有益处。然而,这些比较受到单单元尖峰序列和诱发反应所使用的频谱时间分析类型之间脱节的限制,之所以出现这种脱节,是因为尽管这些反应本身相关,但这些反应类型本质上不同(点过程与连续值信号)。在这里,我们描述了一个统一的框架来研究复杂声音的时间编码,该框架允许使用相同的先进信号处理技术来分析和比较尖峰序列数据和诱发反应数据。该框架使用一组根据单单元尖峰序列对极性交替刺激的响应计算得出的刺激时间直方图,以便对缓慢(包络)和快速(时间精细结构)反应成分进行先进的频谱分析。已证明的优势包括:(1)与先前的指标(如互相关图峰值高度)相比,新的频谱特异性时间编码测量方法受毛细胞转导、突触整流和神经随机性引起的失真影响较小;(2)对尖峰序列调制编码(幅度和相位)的频谱特异性分析,可直接与基于现代感知的语音可懂度模型(如依赖调制滤波器组的模型)进行比较;(3)在分析非平稳声音(如语音和音乐)的神经表征时具有更高的频谱分辨率。这个统一的框架显著扩展了临床前动物模型的潜力,有助于我们进一步理解感音神经性听力损失后在现实世界听力中感知缺陷的生理关联。