Suppr超能文献

听神经中的语音编码:I. 类元音声音。

Speech coding in the auditory nerve: I. Vowel-like sounds.

作者信息

Delgutte B, Kiang N Y

出版信息

J Acoust Soc Am. 1984 Mar;75(3):866-78. doi: 10.1121/1.390596.

Abstract

Discharge patterns of auditory-nerve fibers in anesthetized cats were recorded in response to a set of nine steady-state, two-formant vowels presented at 60 and 75 dB SPL. The largest components in the discrete Fourier transforms of period histograms were almost always harmonics of the vowel fundamental frequency that were close to one of the formant frequencies, the fundamental frequency or the fiber characteristic frequency (CF). For any fiber, the position of its CF relative to the formant frequencies (F1 and F2) appears to determine which of these components dominates the response. Specifically, the response characteristics of the tonotopically arranged array of fibers can be described in terms of five CF regions: (1) a low-CF region below F1 in which the largest response components are the harmonics of the fundamental frequency closest to CF; (2) a region centered around CF = F1 in which the first formant and its harmonics are the largest components; (3) an intermediate region between F1 and F2 with prominent components at both the fiber CF and the fundamental frequency; (4) a region centered around CF = F2 in which harmonics close to the second formant are the largest for frequencies above the fundamental; and (5) a high-CF region in which response spectra tend to show broad, multiple peaks at the formant and fundamental frequencies. These CF regions are related to the phonetic descriptions of vowels. For example, the extent of the low-CF region is largest for "open" vowels (which have a high F1), and the intermediate region is distinct only for "spread" vowels for which F1 and F2 are more than 1.5-2 octaves apart. For all vowels, response activity for the majority of fibers is concentrated near the formant frequencies, in contrast to responses to broadband noise for which components near CF are dominant.

摘要

在60和75 dB声压级下,向麻醉猫呈现一组九个稳态双共振峰元音,记录听神经纤维的放电模式。周期直方图离散傅里叶变换中的最大成分几乎总是元音基频的谐波,这些谐波接近共振峰频率、基频或纤维特征频率(CF)之一。对于任何纤维,其CF相对于共振峰频率(F1和F2)的位置似乎决定了这些成分中哪一个主导反应。具体而言,纤维的音调拓扑排列阵列的反应特性可以用五个CF区域来描述:(1)低于F1的低CF区域,其中最大的反应成分是最接近CF的基频谐波;(2)以CF = F1为中心的区域,其中第一共振峰及其谐波是最大成分;(3)F1和F2之间的中间区域,在纤维CF和基频处都有突出成分;(4)以CF = F2为中心的区域,对于高于基频的频率,接近第二共振峰的谐波最大;(5)高CF区域,其中反应谱在共振峰和基频处往往显示出宽的、多个峰值。这些CF区域与元音的语音描述有关。例如,“开元音”(具有高F1)的低CF区域范围最大,中间区域仅对于F1和F2相隔超过1.5 - 2倍频程的“展唇元音”才明显。对于所有元音,大多数纤维的反应活动集中在共振峰频率附近,这与对宽带噪声的反应形成对比,在宽带噪声反应中CF附近的成分占主导。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验