Suppr超能文献

针对说话者差异和频谱包络失真的感知补偿。

Perceptual compensation for speaker differences and for spectral-envelope distortion.

作者信息

Watkins A J, Makin S J

机构信息

Department of Psychology, Reading University, England.

出版信息

J Acoust Soc Am. 1994 Sep;96(3):1263-82. doi: 10.1121/1.410275.

Abstract

This study asks whether perceptual mechanisms that compensate for the spectral-envelope distortion of transmission channels also contribute to compensation for speaker differences. Subjects identified test words that were played after a carrier sentence. In some conditions the carriers were synthesized with F1 in low- and high-frequency ranges and in others they were distorted by filters whose frequency response is the spectral envelope of one vowel minus the spectral envelope of another. The filter /I/ minus /epsilon/ and its inverse were used. Test words were drawn from an /Itch/ to /epsilon tch/ continuum. Carriers filtered by /I/ minus /epsilon/ and its inverse give a phoneme boundary difference, indicating compensation for spectral envelope distortion. A phoneme boundary difference also occurs between carriers with F1 in low and high ranges, indicating compensation for speaker differences. Neither of these effects is reduced by playing the carrier backwards, even though a measurement of the perceived naturalness of carriers is sharply reduced by this manipulation. Analysis of carriers synthesized with low and high F1 showed that they have different long-term spectra, and subsequent experiments used time-stationary filters to alter this characteristic. The results showed that the long-term spectra of the carriers govern their influence on the identity of subsequent test sounds. However, measurements of perceptual confusions among the carriers and of perceived talker-differences between carriers revealed that other, time-varying factors are more important for voice identification.

摘要

本研究探讨了补偿传输通道频谱包络失真的感知机制是否也有助于补偿说话者差异。受试者识别在载体句子之后播放的测试单词。在某些条件下,载体在低频和高频范围内合成F1,在其他条件下,它们被滤波器扭曲,这些滤波器的频率响应是一个元音的频谱包络减去另一个元音的频谱包络。使用了滤波器 /I/ 减去 /epsilon/ 及其逆滤波器。测试单词取自 /Itch/ 到 /epsilon tch/ 的连续体。被 /I/ 减去 /epsilon/ 及其逆滤波器滤波的载体给出了音素边界差异,表明对频谱包络失真的补偿。在F1处于低频和高频范围的载体之间也出现了音素边界差异,表明对说话者差异的补偿。即使通过将载体倒放会大幅降低对载体感知自然度的测量,但这两种效应都不会因倒放载体而减弱。对具有低F1和高F1的合成载体的分析表明,它们具有不同的长期频谱,随后的实验使用时不变滤波器来改变这一特性。结果表明,载体的长期频谱决定了它们对后续测试声音识别的影响。然而,对载体之间感知混淆以及载体之间感知说话者差异的测量表明,其他随时间变化的因素对语音识别更为重要。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验