• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

言语产生与感知之间的协同作用。

The synergy between speech production and perception.

作者信息

Ru Powen, Chi Taishih, Shamma Shihab

机构信息

Center for Auditory and Acoustics Research, Institute for Systems Research, Electrical and Computer Engineering Department, University of Maryland, College Park, Maryland 20742, USA.

出版信息

J Acoust Soc Am. 2003 Jan;113(1):498-515. doi: 10.1121/1.1525288.

DOI:10.1121/1.1525288
PMID:12558287
Abstract

Speech intelligibility is known to be relatively unaffected by certain deformations of the acoustic spectrum. These include translations, stretching or contracting dilations, and shearing of the spectrum (represented along the logarithmic frequency axis). It is argued here that such robustness reflects a synergy between vocal production and auditory perception. Thus, on the one hand, it is shown that these spectral distortions are produced by common and unavoidable variations among different speakers pertaining to the length, cross-sectional profile, and losses of their vocal tracts. On the other hand, it is argued that these spectral changes leave the auditory cortical representation of the spectrum largely unchanged except for translations along one of its representational axes. These assertions are supported by analyses of production and perception models. On the production side, a simplified sinusoidal model of the vocal tract is developed which analytically relates a few "articulatory" parameters, such as the extent and location of the vocal tract constriction, to the spectral peaks of the acoustic spectra synthesized from it. The model is evaluated by comparing the identification of synthesized sustained vowels to labeled natural vowels extracted from the TIMIT corpus. On the perception side a "multiscale" model of sound processing is utilized to elucidate the effects of the deformations on the representation of the acoustic spectrum in the primary auditory cortex. Finally, the implications of these results for the perception of generally identifiable classes of sound sources beyond the specific case of speech and the vocal tract are discussed.

摘要

众所周知,语音清晰度相对不受声谱某些变形的影响。这些变形包括平移、拉伸或收缩扩张以及声谱的剪切(沿对数频率轴表示)。本文认为,这种鲁棒性反映了发声产生与听觉感知之间的协同作用。因此,一方面,研究表明这些频谱失真由不同说话者之间与声道长度、横截面轮廓及其损耗相关的常见且不可避免的变化所产生。另一方面,有人认为这些频谱变化除了沿其表示轴之一的平移外,在很大程度上不会改变频谱在听觉皮层中的表示。这些断言得到了对产生和感知模型的分析的支持。在发声产生方面,开发了一种简化的声道正弦模型,该模型分析性地将一些“发音”参数(如声道收缩的程度和位置)与由其合成的声谱的频谱峰值联系起来。通过将合成的持续元音的识别与从TIMIT语料库中提取的带标签的自然元音进行比较来评估该模型。在感知方面,利用一种“多尺度”声音处理模型来阐明变形对初级听觉皮层中声谱表示的影响。最后,讨论了这些结果对于除语音和声道特定情况之外的一般可识别声源类别的感知的影响。

相似文献

1
The synergy between speech production and perception.言语产生与感知之间的协同作用。
J Acoust Soc Am. 2003 Jan;113(1):498-515. doi: 10.1121/1.1525288.
2
Acoustic and perceptual effects of changes in vocal tract constrictions for vowels.元音声道收缩变化的声学和感知效应。
J Acoust Soc Am. 1992 Sep;92(3):1301-9. doi: 10.1121/1.403924.
3
Perturbation Measurements on the Degree of Naturalness of Synthesized Vowels.合成元音自然度程度的微扰测量
J Voice. 2017 May;31(3):389.e1-389.e8. doi: 10.1016/j.jvoice.2016.09.020. Epub 2016 Oct 21.
4
Representation of the vocal roughness of aperiodic speech sounds in the auditory cortex.非周期性语音声音的嗓音粗糙度在听觉皮层中的表征。
J Acoust Soc Am. 2009 May;125(5):3177-85. doi: 10.1121/1.3097471.
5
Analysis of Measured and Simulated Supraglottal Acoustic Waves.测量与模拟的声门上声波分析
J Voice. 2016 Sep;30(5):518-28. doi: 10.1016/j.jvoice.2015.08.006. Epub 2015 Sep 14.
6
Sensorimotor adaptation to feedback perturbations of vowel acoustics and its relation to perception.感觉运动对元音声学反馈扰动的适应及其与感知的关系。
J Acoust Soc Am. 2007 Oct;122(4):2306-19. doi: 10.1121/1.2773966.
7
A role for the second subglottal resonance in lexical access.声门下第二共振峰在词汇提取中的作用。
J Acoust Soc Am. 2007 Oct;122(4):2320-7. doi: 10.1121/1.2772227.
8
Exploring the anatomical encoding of voice with a mathematical model of the vocal system.用语音系统的数学模型探索语音的解剖学编码。
Neuroimage. 2016 Nov 1;141:31-39. doi: 10.1016/j.neuroimage.2016.07.033. Epub 2016 Jul 17.
9
Learning to produce speech with an altered vocal tract: the role of auditory feedback.学习通过改变声道来产生语音:听觉反馈的作用。
J Acoust Soc Am. 2003 Jan;113(1):532-43. doi: 10.1121/1.1529670.
10
Cross-channel amplitude sweeps are crucial to speech intelligibility.跨通道幅度扫描对于语音可懂度至关重要。
Brain Lang. 2012 Mar;120(3):406-11. doi: 10.1016/j.bandl.2011.11.001. Epub 2011 Dec 3.

引用本文的文献

1
A comparison of vocal tract perturbation patterns based on statistical and acoustic considerations.基于统计和声学考量的声道扰动模式比较。
J Acoust Soc Am. 2007 Oct;122(4):EL107-14. doi: 10.1121/1.2771369.