Salagovic Cailey A, Stevenson Ryan A, Butler Blake E
Graduate Program in Psychology, University of Western Ontario, London, Ontario, Canada.
Department of Psychology, University of Western Ontario, London, Ontario, Canada.
Ear Hear. 2025;46(3):596-606. doi: 10.1097/AUD.0000000000001607. Epub 2024 Dec 11.
Speech intelligibility is supported by the sound of a talker's voice and visual cues related to articulatory movements. The relative contribution of auditory and visual cues to an integrated audiovisual percept varies depending on a listener's environment and sensory acuity. Cochlear implant users rely more on visual cues than those with acoustic hearing to help compensate for the fact that the auditory signal produced by their implant is poorly resolved relative to that of the typically developed cochlea. The relative weight placed on auditory and visual speech cues can be measured by presenting discordant cues across the two modalities and assessing the resulting percept (the McGurk effect). The current literature is mixed with regards to how cochlear implant users respond to McGurk stimuli; some studies suggest they report hearing syllables that represent a fusion of the auditory and visual cues more frequently than typical hearing controls while others report less frequent fusion. However, several of these studies compared implant users to younger control samples despite evidence that the likelihood and strength of audiovisual integration increase with age. Thus, the present study sought to clarify the impacts of hearing status and age on multisensory speech integration using a combination of behavioral analyses and response modeling.
Cochlear implant users (mean age = 58.9 years), age-matched controls (mean age = 61.5 years), and younger controls (mean age = 25.9 years) completed an online audiovisual speech task. Participants were shown and/or heard four different talkers producing syllables in auditory-alone, visual-alone, and incongruent audiovisual conditions. After each trial, participants reported the syllable they heard or saw from a list of four possible options.
The younger and older control groups performed similarly in both unisensory conditions. The cochlear implant users performed significantly better than either control group in the visual-alone condition. When responding to the incongruent audiovisual trials, cochlear implant users and age-matched controls experienced significantly more fusion than younger controls. When fusion was not experienced, younger controls were more likely to report the auditorily presented syllable than either implant users or age-matched controls. Conversely, implant users were more likely to report the visually presented syllable than either age-matched controls or younger controls. Modeling of the relationship between stimuli and behavioral responses revealed that younger controls had lower disparity thresholds (i.e., were less likely to experience a fused audiovisual percept) than either the implant users or older controls, while implant users had higher levels of sensory noise (i.e., more variability in the way a given stimulus pair is perceived across multiple presentations) than age-matched controls.
Our findings suggest that age and cochlear implantation may have independent effects on McGurk effect perception. Noisy encoding of disparity modeling confirms that age is a strong predictor of an individual's prior likelihood of experiencing audiovisual integration but suggests that hearing status modulates this relationship due to differences in sensory noise during speech encoding. Together, these findings demonstrate that different groups of listeners can arrive at similar levels of performance in different ways, and highlight the need for careful consideration of stimulus- and group-related effects on multisensory speech perception.
言语可懂度受说话者声音以及与发音动作相关的视觉线索的支持。听觉和视觉线索对综合视听感知的相对贡献因听众的环境和感官敏锐度而异。与具有正常听力的人相比,人工耳蜗使用者更多地依赖视觉线索,以弥补其人工耳蜗产生的听觉信号相对于正常发育的耳蜗而言分辨率较差的事实。通过在两种模态中呈现不一致的线索并评估产生的感知(麦格克效应),可以测量听觉和视觉言语线索的相对权重。目前关于人工耳蜗使用者如何对麦格克刺激做出反应的文献存在分歧;一些研究表明,与正常听力的对照组相比,他们更频繁地报告代表听觉和视觉线索融合的音节,而另一些研究则报告融合频率较低。然而,这些研究中有几项将人工耳蜗使用者与较年轻的对照样本进行了比较,尽管有证据表明视听整合的可能性和强度会随着年龄的增长而增加。因此,本研究试图通过行为分析和反应建模相结合的方式,阐明听力状态和年龄对多感官言语整合的影响。
人工耳蜗使用者(平均年龄 = 58.9岁)、年龄匹配的对照组(平均年龄 = 61.5岁)和较年轻的对照组(平均年龄 = 25.9岁)完成了一项在线视听言语任务。向参与者展示和/或播放四名不同的说话者在仅听觉、仅视觉和不一致的视听条件下发出的音节。每次试验后,参与者从四个可能的选项列表中报告他们听到或看到的音节。
较年轻和较年长的对照组在单感官条件下的表现相似。人工耳蜗使用者在仅视觉条件下的表现明显优于任何一个对照组。在对不一致的视听试验做出反应时,人工耳蜗使用者和年龄匹配的对照组比年轻对照组经历了更多的融合。当没有经历融合时,年轻对照组比人工耳蜗使用者或年龄匹配的对照组更有可能报告听觉呈现的音节。相反,人工耳蜗使用者比年龄匹配的对照组或年轻对照组更有可能报告视觉呈现的音节。对刺激与行为反应之间关系的建模显示,年轻对照组的视差阈值较低(即不太可能经历融合的视听感知),低于人工耳蜗使用者或年长对照组,而人工耳蜗使用者的感觉噪声水平较高(即给定刺激对在多次呈现中的感知方式变化更大),高于年龄匹配的对照组。
我们的研究结果表明,年龄和人工耳蜗植入可能对麦格克效应感知有独立影响。对视差建模的噪声编码证实,年龄是个体先前经历视听整合可能性的有力预测指标,但表明听力状态由于言语编码过程中感觉噪声的差异而调节这种关系。总之,这些研究结果表明,不同的听众群体可以通过不同的方式达到相似的表现水平,并强调需要仔细考虑刺激和群体相关因素对多感官言语感知的影响。