University of Groningen, University Medical Center Groningen, Department of Otorhinolaryngology-Head and Neck Surgery, Groningen, The Netherlands.
Ear Hear. 2018 Mar/Apr;39(2):226-237. doi: 10.1097/AUD.0000000000000480.
When listening to two competing speakers, normal-hearing (NH) listeners can take advantage of voice differences between the speakers. Users of cochlear implants (CIs) have difficulty in perceiving speech on speech. Previous literature has indicated sensitivity to voice pitch (related to fundamental frequency, F0) to be poor among implant users, while sensitivity to vocal-tract length (VTL; related to the height of the speaker and formant frequencies), the other principal voice characteristic, has not been directly investigated in CIs. A few recent studies evaluated F0 and VTL perception indirectly, through voice gender categorization, which relies on perception of both voice cues. These studies revealed that, contrary to prior literature, CI users seem to rely exclusively on F0 while not utilizing VTL to perform this task. The objective of the present study was to directly and systematically assess raw sensitivity to F0 and VTL differences in CI users to define the extent of the deficit in voice perception.
The just-noticeable differences (JNDs) for F0 and VTL were measured in 11 CI listeners using triplets of consonant-vowel syllables in an adaptive three-alternative forced choice method.
The results showed that while NH listeners had average JNDs of 1.95 and 1.73 semitones (st) for F0 and VTL, respectively, CI listeners showed JNDs of 9.19 and 7.19 st. These JNDs correspond to differences of 70% in F0 and 52% in VTL. For comparison to the natural range of voices in the population, the F0 JND in CIs remains smaller than the typical male-female F0 difference. However, the average VTL JND in CIs is about twice as large as the typical male-female VTL difference.
These findings, thus, directly confirm that CI listeners do not seem to have sufficient access to VTL cues, likely as a result of limited spectral resolution, and, hence, that CI listeners' voice perception deficit goes beyond poor perception of F0. These results provide a potential common explanation not only for a number of deficits observed in CI listeners, such as voice identification and gender categorization, but also for competing speech perception.
当聆听两个相互竞争的演讲者时,正常听力(NH)的听众可以利用演讲者之间的声音差异。人工耳蜗(CI)使用者在感知言语对言语时会遇到困难。先前的文献表明,植入者对嗓音音高(与基频,F0 相关)的敏感性较差,而对声道长度(VTL;与说话者的高度和共振峰频率有关)的敏感性则没有直接进行研究。一些最近的研究通过语音性别分类来间接评估 F0 和 VTL 感知,这依赖于对两种语音线索的感知。这些研究表明,与先前的文献相反,CI 用户似乎仅依赖 F0,而不利用 VTL 来执行此任务。本研究的目的是直接和系统地评估 CI 用户对 F0 和 VTL 差异的原始敏感性,以确定语音感知缺陷的程度。
使用自适应三择一强制选择方法,通过辅音-元音音节的三胞胎,在 11 名 CI 听众中测量 F0 和 VTL 的可察觉差异(JND)。
结果表明,NH 听众的 F0 和 VTL 的平均 JND 分别为 1.95 和 1.73 半音(st),而 CI 听众的 JND 分别为 9.19 和 7.19 st。这些 JND 分别对应 F0 差异的 70%和 VTL 差异的 52%。为了与人群中自然声音范围进行比较,CI 中的 F0 JND 仍然小于典型的男女 F0 差异。然而,CI 中的平均 VTL JND 大约是典型的男女 VTL 差异的两倍。
这些发现直接证实,CI 听众似乎无法充分利用 VTL 线索,这可能是由于频谱分辨率有限所致,因此,CI 听众的语音感知缺陷不仅限于 F0 感知较差。这些结果不仅为 CI 听众在语音识别和性别分类等方面观察到的许多缺陷提供了潜在的共同解释,也为竞争语音感知提供了潜在的共同解释。