Miller M I, Sachs M B
Hear Res. 1984 Jun;14(3):257-79. doi: 10.1016/0378-5955(84)90054-6.
Responses of populations of auditory-nerve fibers were measured for synthesized consonant-vowel stimuli. This paper explores the encoding of fundamental frequency (pitch) in these responses. Post-stimulus time (PST) histograms were computed from 25 ms segments of the spike trains. Discrete Fourier transforms with a 40 Hz resolution were computed from the histograms. Two representations of pitch are considered. The first representation is based on the pitch-related temporal properties of the speech signal. Histograms for individual units can show envelope modulations directly related to the pitch period. These modulations reflect the responses of these fibers to a number of stimulus harmonics near fiber CF. Responses of fibers near formant frequencies are dominated by a single large harmonic component, and thus show small or no pitch-related enveloped modulations. Envelope modulations are reduced in the presence of background noise. The second representation uses both temporal properties of auditory-nerve responses and cochlear place to encode the pitch-related harmonic structure of speech. As a measure of the response of the population of fibers to each harmonic of 40 Hz the magnitude of the component of the Fourier transform at that frequency was averaged across all fibers whose characteristic frequencies were within one-fourth octave of that harmonic. We call this measure the average localized synchronized rate (ALSR). The ALSR provides a good representation of stimulus spectrum, even in the presence of background noise. From the harmonic structure of the ALSR, we are able to extract the stimulus pitch frequency. The relationship of these two representations to pitch perception in both acoustic and electrical stimulation (via cochlear implants) is discussed.
针对合成的辅音 - 元音刺激,测量了听觉神经纤维群体的反应。本文探讨了这些反应中基频(音高)的编码。从尖峰序列的25毫秒片段计算出刺激后时间(PST)直方图。从这些直方图计算出分辨率为40赫兹的离散傅里叶变换。考虑了两种音高表示。第一种表示基于语音信号与音高相关的时间特性。单个单元的直方图可以直接显示与音高周期相关的包络调制。这些调制反映了这些纤维对纤维CF附近多个刺激谐波的反应。共振峰频率附近纤维的反应由单个大谐波分量主导,因此显示出小的或没有与音高相关的包络调制。在存在背景噪声的情况下,包络调制会降低。第二种表示使用听觉神经反应的时间特性和耳蜗位置来编码语音与音高相关的谐波结构。作为纤维群体对40赫兹每个谐波反应的一种度量,在所有特征频率在该谐波的四分之一倍频程内的纤维上,对该频率处傅里叶变换分量的幅度进行平均。我们将此度量称为平均局部同步率(ALSR)。即使在存在背景噪声的情况下,ALSR也能很好地表示刺激频谱。从ALSR的谐波结构中,我们能够提取刺激音高频率。讨论了这两种表示与声学和电刺激(通过人工耳蜗)中音高感知的关系。