• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

同时语音识别中的位置和声学尺度线索

Location and acoustic scale cues in concurrent speech recognition.

作者信息

Ives D Timothy, Vestergaard Martin D, Kistler Doris J, Patterson Roy D

机构信息

Department of Physiology, Centre for the Neural Basis of Hearing, University of Cambridge, Downing Street, Cambridge CB2 3EG, United Kingdom.

出版信息

J Acoust Soc Am. 2010 Jun;127(6):3729-37. doi: 10.1121/1.3377051.

DOI:10.1121/1.3377051
PMID:20550271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3041806/
Abstract

Location and acoustic scale cues have both been shown to have an effect on the recognition of speech in multi-speaker environments. This study examines the interaction of these variables. Subjects were presented with concurrent triplets of syllables from a target voice and a distracting voice, and asked to recognize a specific target syllable. The task was made more or less difficult by changing (a) the location of the distracting speaker, (b) the scale difference between the two speakers, and/or (c) the relative level of the two speakers. Scale differences were produced by changing the vocal tract length and glottal pulse rate during syllable synthesis: 32 acoustic scale differences were used. Location cues were produced by convolving head-related transfer functions with the stimulus. The angle between the target speaker and the distracter was 0 degrees, 4 degrees, 8 degrees, 16 degrees, or 32 degrees on the 0 degrees horizontal plane. The relative level of the target to the distracter was 0 or -6 dB. The results show that location and scale difference interact, and the interaction is greatest when one of these cues is small. Increasing either the acoustic scale or the angle between target and distracter speakers quickly elevates performance to ceiling levels.

摘要

位置和声学尺度线索已被证明在多说话者环境中对语音识别都有影响。本研究考察了这些变量之间的相互作用。向受试者呈现来自目标声音和干扰声音的同时出现的三音节组,并要求他们识别特定的目标音节。通过改变(a)干扰说话者的位置、(b)两个说话者之间的尺度差异和/或(c)两个说话者的相对音量,使任务变得或多或少更具难度。尺度差异是通过在音节合成过程中改变声道长度和声门脉冲率产生的:使用了32种声学尺度差异。位置线索是通过将头部相关传递函数与刺激进行卷积产生的。在0度水平面上,目标说话者与干扰者之间的角度为0度、4度、8度、16度或32度。目标相对于干扰者的相对音量为0或 -6分贝。结果表明,位置和尺度差异相互作用,当这些线索之一较小时,这种相互作用最大。增加声学尺度或目标与干扰说话者之间的角度会迅速将性能提升到上限水平。

相似文献

1
Location and acoustic scale cues in concurrent speech recognition.同时语音识别中的位置和声学尺度线索
J Acoust Soc Am. 2010 Jun;127(6):3729-37. doi: 10.1121/1.3377051.
2
The interaction of vocal characteristics and audibility in the recognition of concurrent syllables.在同时出现的音节识别中声音特征与可听度的相互作用。
J Acoust Soc Am. 2009 Feb;125(2):1114-24. doi: 10.1121/1.3050321.
3
Effects of voicing in the recognition of concurrent syllables.协同音节识别中的浊音效应。
J Acoust Soc Am. 2009 Dec;126(6):2860-3. doi: 10.1121/1.3257582.
4
The mutual roles of temporal glimpsing and vocal characteristics in cocktail-party listening.时间窥视和声音特征在鸡尾酒会听力中的相互作用。
J Acoust Soc Am. 2011 Jul;130(1):429-39. doi: 10.1121/1.3596462.
5
Discrimination of speaker size from syllable phrases.从音节短语中辨别说话者的体型。
J Acoust Soc Am. 2005 Dec;118(6):3816-22. doi: 10.1121/1.2118427.
6
A neural mechanism for recognizing speech spoken by different speakers.一种识别不同说话者语音的神经机制。
Neuroimage. 2014 May 1;91:375-85. doi: 10.1016/j.neuroimage.2014.01.005. Epub 2014 Jan 13.
7
The Use of Voice Cues for Speaker Gender Recognition in Cochlear Implant Recipients.人工耳蜗植入者中语音线索用于说话者性别识别的研究
J Speech Lang Hear Res. 2016 Jun 1;59(3):546-56. doi: 10.1044/2015_JSLHR-H-15-0128.
8
Discrimination of speaker sex and size when glottal-pulse rate and vocal-tract length are controlled.在声门脉冲率和声道长度得到控制的情况下对说话者性别和体型的辨别。
J Acoust Soc Am. 2007 Dec;122(6):3628-39. doi: 10.1121/1.2799507.
9
Divided listening in the free field becomes asymmetric when acoustic cues are limited.自由场中的分离听力在声学线索有限时变得不对称。
Hear Res. 2022 Mar 15;416:108444. doi: 10.1016/j.heares.2022.108444. Epub 2022 Jan 17.
10
The acoustic bases of human voice identity processing in dogs.人类声音身份处理在犬类中的声学基础。
Anim Cogn. 2022 Aug;25(4):905-916. doi: 10.1007/s10071-022-01601-z. Epub 2022 Feb 10.

引用本文的文献

1
Behavioral Account of Attended Stream Enhances Neural Tracking.被关注信息流的行为描述增强了神经追踪。
Front Neurosci. 2021 Dec 13;15:674112. doi: 10.3389/fnins.2021.674112. eCollection 2021.
2
Interaural level differences do not suffice for restoring spatial release from masking in simulated cochlear implant listening.双侧声级差不足以恢复模拟人工耳蜗听力中的掩蔽释放的空间辨别力。
PLoS One. 2012;7(9):e45296. doi: 10.1371/journal.pone.0045296. Epub 2012 Sep 20.
3
Effects of fundamental frequency and vocal-tract length cues on sentence segregation by listeners with hearing loss.频率基音和声道长度线索对听力损失者句子切分的影响。
J Acoust Soc Am. 2011 Aug;130(2):1006-19. doi: 10.1121/1.3605548.

本文引用的文献

1
The difference between monaural and binaural thresholds.
J Exp Psychol. 1947 Jun;37(3):229-42. doi: 10.1037/h0055386.
2
A statistical, formant-pattern model for segregating vowel type and vocal-tract length in developmental formant data.一种用于在发育共振峰数据中分离元音类型和声道长度的统计共振峰模式模型。
J Acoust Soc Am. 2009 Apr;125(4):2374-86. doi: 10.1121/1.3079772.
3
The interaction of vocal characteristics and audibility in the recognition of concurrent syllables.在同时出现的音节识别中声音特征与可听度的相互作用。
J Acoust Soc Am. 2009 Feb;125(2):1114-24. doi: 10.1121/1.3050321.
4
Speech segregation in rooms: effects of reverberation on both target and interferer.房间内的语音分离:混响对目标语音和干扰语音的影响。
J Acoust Soc Am. 2007 Sep;122(3):1713. doi: 10.1121/1.2764469.
5
Neural representation of auditory size in the human voice and in sounds from other resonant sources.人类声音及其他共鸣源声音中听觉大小的神经表征。
Curr Biol. 2007 Jul 3;17(13):1123-8. doi: 10.1016/j.cub.2007.05.061.
6
A glimpsing model of speech perception in noise.一种噪声中语音感知的一瞥模型。
J Acoust Soc Am. 2006 Mar;119(3):1562-73. doi: 10.1121/1.2166600.
7
Discrimination of speaker size from syllable phrases.从音节短语中辨别说话者的体型。
J Acoust Soc Am. 2005 Dec;118(6):3816-22. doi: 10.1121/1.2118427.
8
The interaction of glottal-pulse rate and vocal-tract length in judgements of speaker size, sex, and age.声门脉冲率与声道长度在说话者体型、性别和年龄判断中的相互作用。
J Acoust Soc Am. 2005 Nov;118(5):3177-86. doi: 10.1121/1.2047107.
9
The processing and perception of size information in speech sounds.语音中大小信息的处理与感知。
J Acoust Soc Am. 2005 Jan;117(1):305-18. doi: 10.1121/1.1828637.
10
Informational and energetic masking effects in the perception of two simultaneous talkers.同时感知两个说话者时的信息性和能量性掩蔽效应。
J Acoust Soc Am. 2001 Mar;109(3):1101-9. doi: 10.1121/1.1345696.