Center for Neural Science at New York University, New York, New York 10003, USA.
J Neurosci. 2010 Jan 13;30(2):767-84. doi: 10.1523/JNEUROSCI.4170-09.2010.
The encoding of sound level is fundamental to auditory signal processing, and the temporal information present in amplitude modulation is crucial to the complex signals used for communication sounds, including human speech. The modulation transfer function, which measures the minimum detectable modulation depth across modulation frequency, has been shown to predict speech intelligibility performance in a range of adverse listening conditions and hearing impairments, and even for users of cochlear implants. We presented sinusoidal amplitude modulation (SAM) tones of varying modulation depths to awake macaque monkeys while measuring the responses of neurons in the auditory core. Using spike train classification methods, we found that thresholds for modulation depth detection and discrimination in the most sensitive units are comparable to psychophysical thresholds when precise temporal discharge patterns rather than average firing rates are considered. Moreover, spike timing information was also superior to average rate information when discriminating static pure tones varying in level but with similar envelopes. The limited utility of average firing rate information in many units also limited the utility of standard measures of sound level tuning, such as the rate level function (RLF), in predicting cortical responses to dynamic signals like SAM. Response modulation typically exceeded that predicted by the slope of the RLF by large factors. The decoupling of the cortical encoding of SAM and static tones indicates that enhancing the representation of acoustic contrast is a cardinal feature of the ascending auditory pathway.
声级编码是听觉信号处理的基础,幅度调制中存在的时间信息对于用于通信声音的复杂信号至关重要,包括人类语音。调制传递函数(modulation transfer function,MTF)可衡量调制频率下可检测到的最小调制深度,它已被证明可以预测各种不利听力条件和听力障碍下的言语可懂度表现,甚至对使用人工耳蜗的用户也有效。我们向清醒的猕猴呈现了不同调制深度的正弦幅度调制(sinusoidal amplitude modulation,SAM)音调,同时测量听觉核心中神经元的反应。使用尖峰序列分类方法,我们发现,在考虑精确的时间放电模式而不是平均放电率时,最敏感单元的调制深度检测和辨别阈值与心理物理阈值相当。此外,在辨别水平变化但包络相似的静态纯音时,尖峰时间信息也优于平均率信息。许多单元中平均放电率信息的有限效用也限制了标准的声级调谐测量(如率级函数)在预测像 SAM 这样的动态信号时对皮质反应的作用。响应调制通常超过 RLF 斜率预测的幅度大得多。SAM 和静态音调的皮质编码解耦表明,增强对声对比度的表示是听觉通路上行的一个主要特征。