Suppr超能文献

作为频谱斜率函数的元音识别中,共振峰峰值的最小频谱对比度。

Minimal spectral contrast of formant peaks for vowel recognition as a function of spectral slope.

作者信息

Lea A P, Summerfield Q

机构信息

ATR Human Information Processing Laboratories, Kyoto, Japan.

出版信息

Percept Psychophys. 1994 Oct;56(4):379-91. doi: 10.3758/bf03206730.

Abstract

In four experiments we investigated whether listeners can locate the formants of vowels not only from peaks, but also from spectral "shoulders"--features that give rise to zero crossings in the third, but not the first, differential of the excitation pattern--as hypothesized by Assmann and Summerfield (1989). Stimuli were steady-state approximations to the vowels [a, i, e, u, o] created by summing the first 45 harmonics of a fundamental of 100 Hz. Thirty-nine harmonics had equal amplitudes; the other 6 formed three pairs that were raised in level to define three "formants." An adaptive psychophysical procedure determined the minimal difference in level between the 6 harmonics and the remaining 39 at which the vowels were identifiably different from one another. These thresholds were measured through simulated communication channels, giving overall slopes of the excitation patterns of the five vowels that ranged from -1 dB/erb to + 2 dB/erb. Excitation patterns of the threshold stimuli were computed, and the locations of formants were estimated from zero crossings in the first and third differentials. With the more steeply sloping communication channels, some formants of some vowels were represented as shoulders rather than peaks, confirming the predictions of Assmann and Summerfield's models. We discuss the limitations of the excitation pattern model and the related issue of whether the location of formants can be computed from spectral shoulders in auditory analysis.

摘要

在四项实验中,我们研究了听众是否不仅能够根据峰值,还能根据频谱“肩部”来定位元音的共振峰——正如阿斯曼和萨默菲尔德(1989年)所假设的那样,这些特征会在激励模式的三阶微分而非一阶微分中产生过零点。刺激信号是通过对100赫兹基频的前45个谐波求和而产生的对元音[a、i、e、u、o]的稳态近似。39个谐波具有相等的幅度;另外6个谐波形成三对,其幅度升高以定义三个“共振峰”。一种自适应心理物理学程序确定了这6个谐波与其余39个谐波之间的最小电平差异,在该差异下元音彼此能够被明确区分。这些阈值是通过模拟通信通道测量的,得出五个元音激励模式的总体斜率范围为-1分贝/等效矩形带宽至+2分贝/等效矩形带宽。计算了阈值刺激的激励模式,并根据一阶和三阶微分中的过零点估计共振峰的位置。在斜率更陡的通信通道中,一些元音的一些共振峰表现为肩部而非峰值,这证实了阿斯曼和萨默菲尔德模型的预测。我们讨论了激励模式模型的局限性以及在听觉分析中是否可以从频谱肩部计算共振峰位置的相关问题。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验