• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

目标/掩蔽语音基频轮廓相似性对掩蔽语音识别的影响。

The effect of target/masker fundamental frequency contour similarity on masked-speech recognition.

机构信息

Department of Psychological Sciences, Case Western Reserve University, Cleveland, Ohio 44106, USA.

Department of Otolaryngology/Head and Neck Surgery, University of North Carolina, Chapel Hill, North Carolina 27599, USA.

出版信息

J Acoust Soc Am. 2019 Aug;146(2):1065. doi: 10.1121/1.5121314.

DOI:10.1121/1.5121314
PMID:31472562
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6690832/
Abstract

Greater informational masking is observed when the target and masker speech are more perceptually similar. Fundamental frequency (f0) contour, or the dynamic movement of f0, is thought to provide cues for segregating target speech presented in a speech masker. Most of the data demonstrating this effect have been collected using digitally modified stimuli. Less work has been done exploring the role of f0 contour for speech-in-speech recognition when all of the stimuli have been produced naturally. The goal of this project was to explore the importance of target and masker f0 contour similarity by manipulating the speaking style of talkers producing the target and masker speech streams. Sentence recognition thresholds were evaluated for target and masker speech that was produced with either flat, normal, or exaggerated speaking styles; performance was also measured in speech spectrum shaped noise and for conditions in which the stimuli were processed through an ideal-binary mask. Results confirmed that similarities in f0 contour depth elevated speech-in-speech recognition thresholds; however, when the target and masker had similar contour depths, targets with normal f0 contours were more resistant to masking than targets with flat or exaggerated contours. Differences in energetic masking across stimuli cannot account for these results.

摘要

当目标语音和掩蔽语音在感知上更相似时,会观察到更大的信息掩蔽。基频 (f0) 轮廓,或 f0 的动态运动,被认为为在语音掩蔽中呈现的目标语音提供了分离的线索。大多数证明这种效果的数据都是使用数字修改的刺激收集的。当所有刺激都是自然产生时,探索 f0 轮廓对语音内语音识别的作用的工作较少。该项目的目标是通过操纵产生目标和掩蔽语音流的说话者的说话风格来探索目标和掩蔽 f0 轮廓相似性的重要性。评估了使用平坦、正常或夸张说话风格产生的目标和掩蔽语音的句子识别阈值;还在语音频谱成形噪声中以及在通过理想二进制掩蔽处理刺激的情况下测量了性能。结果证实,f0 轮廓深度的相似性提高了语音内语音识别的阈值;然而,当目标和掩蔽具有相似的轮廓深度时,具有正常 f0 轮廓的目标比具有平坦或夸张轮廓的目标更能抵抗掩蔽。刺激之间的能量掩蔽差异不能解释这些结果。

相似文献

1
The effect of target/masker fundamental frequency contour similarity on masked-speech recognition.目标/掩蔽语音基频轮廓相似性对掩蔽语音识别的影响。
J Acoust Soc Am. 2019 Aug;146(2):1065. doi: 10.1121/1.5121314.
2
Effects of Target and Masker Fundamental Frequency Contour Depth on School-Age Children's Speech Recognition in a Two-Talker Masker.目标和掩蔽声基频轮廓深度对双说话人掩蔽中学生言语识别的影响。
J Speech Lang Hear Res. 2023 Jan 12;66(1):400-414. doi: 10.1044/2022_JSLHR-22-00207. Epub 2022 Dec 29.
3
The effect of fundamental frequency contour similarity on multi-talker listening in older and younger adults.基频轮廓相似性对老年和年轻成年人多说话人聆听的影响。
J Acoust Soc Am. 2020 Dec;148(6):3527. doi: 10.1121/10.0002661.
4
Informational masking and the effects of differences in fundamental frequency and fundamental-frequency contour on phonetic integration in a formant ensemble.信息掩蔽以及共振峰组合中基频和基频轮廓差异对语音整合的影响。
Hear Res. 2017 Feb;344:295-303. doi: 10.1016/j.heares.2016.10.026. Epub 2016 Nov 1.
5
The intelligibility of speech in a harmonic masker varying in fundamental frequency contour, broadband temporal envelope, and spatial location.在基频轮廓、宽带时间包络和空间位置变化的谐波掩蔽器中语音的可懂度。
Hear Res. 2017 Jul;350:1-10. doi: 10.1016/j.heares.2017.03.012. Epub 2017 Mar 29.
6
Effectiveness of Two-Talker Maskers That Differ in Talker Congruity and Perceptual Similarity to the Target Speech.双说话人掩蔽在与目标语音的说话人一致性和感知相似性方面存在差异的有效性。
Trends Hear. 2017 Jan-Dec;21:2331216517709385. doi: 10.1177/2331216517709385.
7
The effects of fundamental frequency contour manipulations on speech intelligibility in background noise.基频轮廓处理对背景噪声中语音可懂度的影响。
J Acoust Soc Am. 2010 Jul;128(1):435-43. doi: 10.1121/1.3397384.
8
Masking release with changing fundamental frequency: Electric acoustic stimulation resembles normal hearing subjects.随着基频变化的掩蔽释放:电声刺激类似于正常听力受试者。
Hear Res. 2017 Jul;350:226-234. doi: 10.1016/j.heares.2017.05.004. Epub 2017 May 11.
9
Roles of the target and masker fundamental frequencies in voice segregation.目标音和掩蔽音基频在语音分离中的作用。
J Acoust Soc Am. 2014 Sep;136(3):1225. doi: 10.1121/1.4890649.
10
Developmental Effects in Children's Ability to Benefit From F0 Differences Between Target and Masker Speech.儿童从目标语音和掩蔽语音的 F0 差异中获益的能力的发展影响。
Ear Hear. 2019 Jul/Aug;40(4):927-937. doi: 10.1097/AUD.0000000000000673.

引用本文的文献

1
The Interaction of Target and Masker Speech in Competing Speech Perception.竞争言语感知中目标言语与掩蔽言语的相互作用
Brain Sci. 2025 Aug 4;15(8):834. doi: 10.3390/brainsci15080834.
2
Breathy Vocal Quality, Background Noise, and Hearing Loss: How Do These Adverse Conditions Affect Speech Perception by Older Adults?呼吸音质、背景噪音与听力损失:这些不利状况如何影响老年人的言语感知?
Ear Hear. 2025;46(2):474-482. doi: 10.1097/AUD.0000000000001599. Epub 2024 Nov 4.
3
Recognition of Speech With Dynamic Pitch Manipulation in Noise: Effects of Manipulation Methods.噪声环境下具有动态音高操纵的语音识别:操纵方法的影响。
J Speech Lang Hear Res. 2024 Jan 8;67(1):269-281. doi: 10.1044/2023_JSLHR-23-00142. Epub 2023 Nov 20.
4
Interactions between acoustic challenges and processing depth in speech perception as measured by task-evoked pupil response.通过任务诱发瞳孔反应测量的语音感知中声学挑战与加工深度之间的相互作用。
Front Psychol. 2022 Oct 25;13:959638. doi: 10.3389/fpsyg.2022.959638. eCollection 2022.
5
Maturation of Speech-in-Speech Recognition for Whispered and Voiced Speech.语音识别中语音和耳语语音的成熟度。
J Speech Lang Hear Res. 2022 Aug 17;65(8):3117-3128. doi: 10.1044/2022_JSLHR-21-00620. Epub 2022 Jul 22.
6
Revisiting the target-masker linguistic similarity hypothesis.重新审视目标-掩蔽语料库语言相似性假说。
Atten Percept Psychophys. 2022 Jul;84(5):1772-1787. doi: 10.3758/s13414-022-02486-3. Epub 2022 Apr 26.
7
Analysis Model of Spoken English Evaluation Algorithm Based on Intelligent Algorithm of Internet of Things.基于物联网智能算法的英语口语评估算法分析模型。
Comput Intell Neurosci. 2022 Mar 27;2022:8469945. doi: 10.1155/2022/8469945. eCollection 2022.
8
The Effects of Uncertainty in Level on Speech-on-Speech Masking.水平不确定性对言语掩蔽的影响。
Trends Hear. 2022 Jan-Dec;26:23312165221077555. doi: 10.1177/23312165221077555.
9
Pupillary response to dynamic pitch alteration during speech perception in noise.噪声环境下言语感知过程中对动态音调变化的瞳孔反应。
JASA Express Lett. 2021 Nov;1(11):115202. doi: 10.1121/10.0007056. Epub 2021 Nov 5.
10
Band importance for speech-in-speech recognition.语音中语音识别的频段重要性。
JASA Express Lett. 2021 Aug;1(8):084402. doi: 10.1121/10.0005762. Epub 2021 Aug 2.

本文引用的文献

1
Developmental Effects in Children's Ability to Benefit From F0 Differences Between Target and Masker Speech.儿童从目标语音和掩蔽语音的 F0 差异中获益的能力的发展影响。
Ear Hear. 2019 Jul/Aug;40(4):927-937. doi: 10.1097/AUD.0000000000000673.
2
Developmental Effects in Masking Release for Speech-in-Speech Perception Due to a Target/Masker Sex Mismatch.由于目标/掩蔽者性别不匹配导致语音感知掩蔽释放中的发育效应。
Ear Hear. 2018 Sep/Oct;39(5):935-945. doi: 10.1097/AUD.0000000000000554.
3
Making predictable unpredictable with style - Behavioral and electrophysiological evidence for the critical role of prosodic expectations in the perception of prominence in speech.用风格使不可预测变得可预测——关于在言语感知中重音感知的韵律预期的关键作用的行为和电生理证据。
Neuropsychologia. 2018 Jan 31;109:181-199. doi: 10.1016/j.neuropsychologia.2017.12.011. Epub 2017 Dec 14.
4
Effectiveness of Two-Talker Maskers That Differ in Talker Congruity and Perceptual Similarity to the Target Speech.双说话人掩蔽在与目标语音的说话人一致性和感知相似性方面存在差异的有效性。
Trends Hear. 2017 Jan-Dec;21:2331216517709385. doi: 10.1177/2331216517709385.
5
Effect of F0 contours on top-down repair of interrupted speech.基频轮廓对中断言语自上而下修复的影响。
J Acoust Soc Am. 2017 Jul;142(1):EL7. doi: 10.1121/1.4990398.
6
Perception of Sentence Stress in Speech Correlates With the Temporal Unpredictability of Prosodic Features.言语中句子重音的感知与韵律特征的时间不可预测性相关。
Cogn Sci. 2016 Sep;40(7):1739-1774. doi: 10.1111/cogs.12306. Epub 2015 Oct 20.
7
Listening to speech in a background of other talkers: effects of talker number and noise vocoding.在其他说话者背景下听语音:说话者数量和噪声编码的影响。
J Acoust Soc Am. 2013 Apr;133(4):2431-43. doi: 10.1121/1.4794379.
8
Notionally steady background noise acts primarily as a modulation masker of speech.从概念上讲,稳定的背景噪声主要充当言语的调制掩蔽器。
J Acoust Soc Am. 2012 Jul;132(1):317-26. doi: 10.1121/1.4725766.
9
Effects of fundamental frequency and vocal-tract length cues on sentence segregation by listeners with hearing loss.频率基音和声道长度线索对听力损失者句子切分的影响。
J Acoust Soc Am. 2011 Aug;130(2):1006-19. doi: 10.1121/1.3605548.
10
Effects of target-masker contextual similarity on the multimasker penalty in a three-talker diotic listening task.在三说话者双耳聆听任务中,目标-掩蔽声上下文相似性对多掩蔽声惩罚的影响。
J Acoust Soc Am. 2010 Nov;128(5):2998-10. doi: 10.1121/1.3479547.