El Boghdady Nawal, Langner Florian, Gaudrain Etienne, Başkent Deniz, Nogueira Waldo
Department of Otorhinolaryngology, University Medical Center Groningen, Groningen, the Netherlands.
Graduate School of Medical Sciences, Research School of Behavioral and Cognitive Neurosciences, University of Groningen, Groningen, the Netherlands.
Ear Hear. 2021 Mar/Apr;42(2):271-289. doi: 10.1097/AUD.0000000000000936.
Speech intelligibility in the presence of a competing talker (speech-on-speech; SoS) presents more difficulties for cochlear implant (CI) users compared with normal-hearing listeners. A recent study implied that these difficulties may be related to CI users' low sensitivity to two fundamental voice cues, namely, the fundamental frequency (F0) and the vocal tract length (VTL) of the speaker. Because of the limited spectral resolution in the implant, important spectral cues carrying F0 and VTL information are expected to be distorted. This study aims to address two questions: (1) whether spectral contrast enhancement (SCE), previously shown to enhance CI users' speech intelligibility in the presence of steady state background noise, could also improve CI users' SoS intelligibility, and (2) whether such improvements in SoS from SCE processing are due to enhancements in CI users' sensitivity to F0 and VTL differences between the competing talkers.
The effect of SCE on SoS intelligibility and comprehension was measured in two separate tasks in a sample of 14 CI users with Cochlear devices. In the first task, the CI users were asked to repeat the sentence spoken by the target speaker in the presence of a single competing talker. The competing talker was the same target speaker whose F0 and VTL were parametrically manipulated to obtain the different experimental conditions. SoS intelligibility, in terms of the percentage of correctly repeated words from the target sentence, was assessed using the standard advanced combination encoder (ACE) strategy and SCE for each voice condition. In the second task, SoS comprehension accuracy and response times were measured using the same experimental setup as in the first task, but with a different corpus. In the final task, CI users' sensitivity to F0 and VTL differences were measured for the ACE and SCE strategies. The benefit in F0 and VTL discrimination from SCE processing was evaluated with respect to the improvement in SoS perception from SCE.
While SCE demonstrated the potential of improving SoS intelligibility in CI users, this effect appeared to stem from SCE improving the overall signal to noise ratio in SoS rather than improving the sensitivity to the underlying F0 and VTL differences. A second key finding of this study was that, contrary to what has been observed in a previous study for childlike voice manipulations, F0 and VTL manipulations of a reference female speaker (target speaker) toward male-like voices provided a small but significant release from masking for the CI users tested.
The present findings, together with those previously reported in the literature, indicate that SCE could serve as a possible background-noise-reduction strategy in commercial CI speech processors that could enhance speech intelligibility especially in the presence of background talkers that have longer VTLs compared with the target speaker.
与听力正常的听众相比,人工耳蜗(CI)使用者在存在竞争谈话者的情况下(言语对言语;SoS)的言语可懂度面临更多困难。最近的一项研究表明,这些困难可能与CI使用者对两个基本语音线索的低敏感性有关,即说话者的基频(F0)和声道长(VTL)。由于植入物中有限的频谱分辨率,携带F0和VTL信息的重要频谱线索预计会失真。本研究旨在解决两个问题:(1)先前已证明能提高CI使用者在稳态背景噪声下言语可懂度的频谱对比度增强(SCE),是否也能提高CI使用者的SoS可懂度;(2)SCE处理对SoS的这种改善是否归因于CI使用者对竞争谈话者之间F0和VTL差异的敏感性增强。
在14名使用科利耳设备的CI使用者样本中,通过两个独立任务测量SCE对SoS可懂度和理解的影响。在第一个任务中,要求CI使用者在存在单个竞争谈话者的情况下重复目标谈话者所说的句子。竞争谈话者是同一个目标谈话者,其F0和VTL通过参数控制以获得不同的实验条件。对于每种语音条件,使用标准的高级组合编码器(ACE)策略和SCE,根据目标句子中正确重复单词的百分比来评估SoS可懂度。在第二个任务中,使用与第一个任务相同的实验设置,但使用不同的语料库来测量SoS理解准确率和反应时间。在最后一个任务中,测量CI使用者对ACE和SCE策略下F0和VTL差异的敏感性。根据SCE对SoS感知的改善情况,评估SCE处理在F0和VTL辨别方面的益处。
虽然SCE显示出提高CI使用者SoS可懂度的潜力,但这种效果似乎源于SCE提高了SoS中的整体信噪比,而不是提高了对潜在的F0和VTL差异的敏感性。本研究的第二个关键发现是,与先前在类似儿童语音操纵研究中观察到的情况相反,将参考女性谈话者(目标谈话者)的F0和VTL操纵为类似男性的声音,为测试的CI使用者提供了一个虽小但显著的掩蔽解除。
目前的研究结果与文献中先前报道的结果表明,SCE可作为商业CI语音处理器中一种可能的背景降噪策略,可提高言语可懂度,特别是在存在与目标谈话者相比声道长更长的背景谈话者的情况下。