• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用合成语音信号的单耳语音分离

Monaural speech segregation using synthetic speech signals.

作者信息

Brungart Douglas S, Iyer Nandini, Simpson Brian D

机构信息

Air Force Research Laboratory, Wright-Patterson Air Force Base, Ohio 45433-7901, USA.

出版信息

J Acoust Soc Am. 2006 Apr;119(4):2327-33. doi: 10.1121/1.2170030.

DOI:10.1121/1.2170030
PMID:16642846
Abstract

When listening to natural speech, listeners are fairly adept at using cues such as pitch, vocal tract length, prosody, and level differences to extract a target speech signal from an interfering speech masker. However, little is known about the cues that listeners might use to segregate synthetic speech signals that retain the intelligibility characteristics of speech but lack many of the features that listeners normally use to segregate competing talkers. In this experiment, intelligibility was measured in a diotic listening task that required the segregation of two simultaneously presented synthetic sentences. Three types of synthetic signals were created: (1) sine-wave speech (SWS); (2) modulated noise-band speech (MNB); and (3) modulated sine-band speech (MSB). The listeners performed worse for all three types of synthetic signals than they did with natural speech signals, particularly at low signal-to-noise ratio (SNR) values. Of the three synthetic signals, the results indicate that SWS signals preserve more of the voice characteristics used for speech segregation than MNB and MSB signals. These findings have implications for cochlear implant users, who rely on signals very similar to MNB speech and thus are likely to have difficulty understanding speech in cocktail-party listening environments.

摘要

在聆听自然语音时,听众相当擅长利用诸如音高、声道长度、韵律和电平差异等线索,从干扰性语音掩蔽中提取目标语音信号。然而,对于听众可能用于分离合成语音信号的线索却知之甚少,这些合成语音信号保留了语音的可懂度特征,但缺乏许多听众通常用于分离竞争说话者的特征。在本实验中,通过双耳聆听任务测量可懂度,该任务要求分离两个同时呈现的合成句子。创建了三种类型的合成信号:(1) 正弦波语音 (SWS);(2) 调制噪声带语音 (MNB);以及 (3) 调制正弦带语音 (MSB)。与自然语音信号相比,听众对所有三种类型的合成信号的表现都更差,尤其是在低信噪比 (SNR) 值时。在这三种合成信号中,结果表明,与MNB和MSB信号相比,SWS信号保留了更多用于语音分离的语音特征。这些发现对人工耳蜗使用者具有启示意义,他们依赖与MNB语音非常相似的信号,因此在鸡尾酒会聆听环境中理解语音可能会有困难。

相似文献

1
Monaural speech segregation using synthetic speech signals.使用合成语音信号的单耳语音分离
J Acoust Soc Am. 2006 Apr;119(4):2327-33. doi: 10.1121/1.2170030.
2
Informational and energetic masking effects in the perception of multiple simultaneous talkers.多个同时说话者感知中的信息性和能量掩蔽效应。
J Acoust Soc Am. 2001 Nov;110(5 Pt 1):2527-38. doi: 10.1121/1.1408946.
3
Sine-wave and noise-vocoded sine-wave speech in a tone language: Acoustic details matter.声调语言中的正弦波语音和噪声编码正弦波语音:声学细节很重要。
J Acoust Soc Am. 2015 Dec;138(6):3698-702. doi: 10.1121/1.4937605.
4
Across-ear interference from parametrically degraded synthetic speech signals in a dichotic cocktail-party listening task.在双耳分听鸡尾酒会式聆听任务中,参数降质合成语音信号产生的跨耳干扰。
J Acoust Soc Am. 2005 Jan;117(1):292-304. doi: 10.1121/1.1835509.
5
Within-ear and across-ear interference in a cocktail-party listening task.鸡尾酒会式听力任务中的耳内和耳间干扰
J Acoust Soc Am. 2002 Dec;112(6):2985-95. doi: 10.1121/1.1512703.
6
Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort.双耳语音可懂度和感知聆听努力中语音掩蔽的能量和信息成分。
Trends Hear. 2019 Jan-Dec;23:2331216519854597. doi: 10.1177/2331216519854597.
7
The effects of spatial separation in distance on the informational and energetic masking of a nearby speech signal.距离上的空间分离对附近语音信号的信息掩蔽和能量掩蔽的影响。
J Acoust Soc Am. 2002 Aug;112(2):664-76. doi: 10.1121/1.1490592.
8
Speech perception, localization, and lateralization with bilateral cochlear implants.双侧人工耳蜗植入后的言语感知、定位和侧向化
J Acoust Soc Am. 2003 Mar;113(3):1617-30. doi: 10.1121/1.1539520.
9
Noise and pitch interact during the cortical segregation of concurrent speech.在同时出现的语音的皮质分离过程中,噪声和音高相互作用。
Hear Res. 2017 Aug;351:34-44. doi: 10.1016/j.heares.2017.05.008. Epub 2017 May 25.
10
Pure linguistic interference during comprehension of competing speech signals.在理解竞争性言语信号过程中的纯粹语言干扰。
J Acoust Soc Am. 2017 Mar;141(3):EL249. doi: 10.1121/1.4977590.

引用本文的文献

1
Speech intelligibility and talker gender classification with noise-vocoded and tone-vocoded speech.基于噪声声码和音调声码语音的语音清晰度及说话者性别分类
JASA Express Lett. 2021 Sep;1(9):094401. doi: 10.1121/10.0006285. Epub 2021 Sep 20.
2
Variations in the slope of the psychometric functions for speech intelligibility: a systematic survey.言语可懂度心理物理函数斜率的变化:系统调查。
Trends Hear. 2014 Jun 6;18:2331216514537722. doi: 10.1177/2331216514537722.
3
Effect of fundamental-frequency and sentence-onset differences on speech-identification performance of young and older adults in a competing-talker background.
在有竞争说话者背景下,基频和句子起始差异对年轻和老年成年人言语识别表现的影响。
J Acoust Soc Am. 2012 Sep;132(3):1700-17. doi: 10.1121/1.4740482.
4
Estimating speech spectra for copy synthesis by linear prediction and by hand.通过线性预测和手工方法估计复制合成的语音频谱。
J Acoust Soc Am. 2011 Oct;130(4):2173-8. doi: 10.1121/1.3631667.
5
AUDITORY-PHONETIC PROJECTION AND LEXICAL STRUCTURE IN THE RECOGNITION OF SINE-WAVE WORDS.正弦波词识别中的听觉语音投射与词汇结构
J Exp Psychol Hum Percept Perform. 2009 Apr 1;125(4):2656.
6
[Examining informational masking in cochlear implant users].[研究人工耳蜗使用者中的信息掩蔽]
HNO. 2009 Jul;57(7):671-7. doi: 10.1007/s00106-008-1747-5.
7
Lexical and indexical cues in masking by competing speech.竞争性言语掩蔽中的词汇和索引线索。
J Acoust Soc Am. 2009 Jan;125(1):447-56. doi: 10.1121/1.3035837.