Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL 60208, USA.
J Neurosci. 2012 May 9;32(19):6542-9. doi: 10.1523/JNEUROSCI.5732-11.2012.
Natural sounds are characterized by complex patterns of sound intensity distributed across both frequency (spectral modulation) and time (temporal modulation). Perception of these patterns has been proposed to depend on a bank of modulation filters, each tuned to a unique combination of a spectral and a temporal modulation frequency. There is considerable physiological evidence for such combined spectrotemporal tuning. However, direct behavioral evidence is lacking. Here we examined the processing of spectrotemporal modulation behaviorally using a perceptual-learning paradigm. We trained human listeners for ∼1 h/d for 7 d to discriminate the depth of spectral (0.5 cyc/oct; 0 Hz), temporal (0 cyc/oct; 32 Hz), or upward spectrotemporal (0.5 cyc/oct; 32 Hz) modulation. Each trained group learned more on their respective trained condition than did controls who received no training. Critically, this depth-discrimination learning did not generalize to the trained stimuli of the other groups or to downward spectrotemporal (0.5 cyc/oct; -32 Hz) modulation. Learning on discrimination also led to worsening on modulation detection, but only when the same spectrotemporal modulation was used for both tasks. Thus, these influences of training were specific to the trained combination of spectral and temporal modulation frequencies, even when the trained and untrained stimuli had one modulation frequency in common. This specificity indicates that training modified circuitry that had combined spectrotemporal tuning, and therefore that circuits with such tuning can influence perception. These results are consistent with the possibility that the auditory system analyzes sounds through filters tuned to combined spectrotemporal modulation.
自然声音的特点是声音强度在频率(频谱调制)和时间(时间调制)上分布复杂的模式。这些模式的感知被认为依赖于一组调制滤波器,每个滤波器调谐到独特的频谱和时间调制频率组合。有大量的生理证据表明存在这种联合的频谱时间调谐。然而,直接的行为证据是缺乏的。在这里,我们使用感知学习范式在行为上研究了频谱时间调制的处理。我们训练人类听众每天约 1 小时,持续 7 天,以区分频谱(0.5 个周/倍频程;0 Hz)、时间(0 个周/倍频程;32 Hz)或向上频谱时间调制(0.5 个周/倍频程;32 Hz)的深度。每个训练组在各自的训练条件下的学习都比没有接受训练的对照组多。关键是,这种深度辨别学习并没有推广到其他组的训练刺激或向下频谱时间调制(0.5 个周/倍频程;-32 Hz)。辨别学习也导致调制检测恶化,但仅当两个任务使用相同的频谱时间调制时才会出现这种情况。因此,这种训练的影响是针对特定的频谱和时间调制频率的组合,即使训练和未训练的刺激具有一个调制频率。这种特异性表明,训练改变了具有联合频谱时间调谐的电路,因此具有这种调谐的电路可以影响感知。这些结果与听觉系统通过调谐到联合频谱时间调制的滤波器来分析声音的可能性一致。