Jerger Susan, Damian Markus F, McAlpine Rachel P, Abdi Hervé
School of Behavioral and Brain Sciences, GR4.1, University of Texas at Dallas, 800 W. Campbell Rd, Richardson, TX, 75080, USA; Callier Center for Communication Disorders, 811 Synergy Park Blvd., Richardson, TX, 75080, USA.
University of Bristol, School of Experimental Psychology, 12a Priory Road, Room 1D20, Bristol, BS8 1TU, United Kingdom.
Int J Pediatr Otorhinolaryngol. 2017 Mar;94:127-137. doi: 10.1016/j.ijporl.2017.01.009. Epub 2017 Jan 9.
Understanding spoken language is an audiovisual event that depends critically on the ability to discriminate and identify phonemes yet we have little evidence about the role of early auditory experience and visual speech on the development of these fundamental perceptual skills. Objectives of this research were to determine 1) how visual speech influences phoneme discrimination and identification; 2) whether visual speech influences these two processes in a like manner, such that discrimination predicts identification; and 3) how the degree of hearing loss affects this relationship. Such evidence is crucial for developing effective intervention strategies to mitigate the effects of hearing loss on language development.
Participants were 58 children with early-onset sensorineural hearing loss (CHL, 53% girls, M = 9;4 yrs) and 58 children with normal hearing (CNH, 53% girls, M = 9;4 yrs). Test items were consonant-vowel (CV) syllables and nonwords with intact visual speech coupled to non-intact auditory speech (excised onsets) as, for example, an intact consonant/rhyme in the visual track (Baa or Baz) coupled to non-intact onset/rhyme in the auditory track (/-B/aa or/-B/az). The items started with an easy-to-speechread/B/or difficult-to-speechread/G/onset and were presented in the auditory (static face) vs. audiovisual (dynamic face) modes. We assessed discrimination for intact vs. non-intact different pairs (e.g., Baa:/-B/aa). We predicted that visual speech would cause the non-intact onset to be perceived as intact and would therefore generate more same-as opposed to different-responses in the audiovisual than auditory mode. We assessed identification by repetition of nonwords with non-intact onsets (e.g.,/-B/az). We predicted that visual speech would cause the non-intact onset to be perceived as intact and would therefore generate more Baz-as opposed to az- responses in the audiovisual than auditory mode.
Performance in the audiovisual mode showed more same responses for the intact vs. non-intact different pairs (e.g., Baa:/-B/aa) and more intact onset responses for nonword repetition (Baz for/-B/az). Thus visual speech altered both discrimination and identification in the CHL-to a large extent for the/B/onsets but only minimally for the/G/onsets. The CHL identified the stimuli similarly to the CNH but did not discriminate the stimuli similarly. A bias-free measure of the children's discrimination skills (i.e., d' analysis) revealed that the CHL had greater difficulty discriminating intact from non-intact speech in both modes. As the degree of HL worsened, the ability to discriminate the intact vs. non-intact onsets in the auditory mode worsened. Discrimination ability in CHL significantly predicted their identification of the onsets-even after variation due to the other variables was controlled.
These results clearly established that visual speech can fill in non-intact auditory speech, and this effect, in turn, made the non-intact onsets more difficult to discriminate from intact speech and more likely to be perceived as intact. Such results 1) demonstrate the value of visual speech at multiple levels of linguistic processing and 2) support intervention programs that view visual speech as a powerful asset for developing spoken language in CHL.
理解口语是一个视听过程,严重依赖于辨别和识别音素的能力,但我们几乎没有证据表明早期听觉经验和视觉语音在这些基本感知技能发展中的作用。本研究的目的是确定:1)视觉语音如何影响音素辨别和识别;2)视觉语音是否以类似的方式影响这两个过程,即辨别是否能预测识别;3)听力损失程度如何影响这种关系。这些证据对于制定有效的干预策略以减轻听力损失对语言发展的影响至关重要。
参与者包括58名早发性感音神经性听力损失儿童(CHL,53%为女孩,平均年龄9岁4个月)和58名听力正常儿童(CNH,53%为女孩,平均年龄9岁4个月)。测试项目为辅音-元音(CV)音节和非单词,其视觉语音完整但听觉语音不完整(起始音切除),例如,视觉轨道中完整的辅音/韵(Baa或Baz)与听觉轨道中不完整的起始音/韵(/-B/aa或/-B/az)配对。项目以易于语音读取的/B/或难以语音读取的/G/起始音开始,并以听觉(静态面部)与视听(动态面部)模式呈现。我们评估了完整与不完整不同对(如Baa:/-B/aa)的辨别能力。我们预测,视觉语音会使不完整的起始音被感知为完整,因此在视听模式下比听觉模式下会产生更多相同而非不同的反应。我们通过重复不完整起始音的非单词(如/-B/az)来评估识别能力。我们预测,视觉语音会使不完整的起始音被感知为完整,因此在视听模式下比听觉模式下会产生更多Baz而非az的反应。
在视听模式下,对于完整与不完整不同对(如Baa:/-B/aa),表现出更多相同的反应,对于非单词重复(Baz对应/-B/az),表现出更多完整起始音的反应。因此,视觉语音在很大程度上改变了CHL儿童的辨别和识别能力,对于/B/起始音改变程度较大,而对于/G/起始音改变程度最小。CHL儿童对刺激的识别与CNH儿童相似,但对刺激的辨别不同。对儿童辨别技能的无偏差测量(即d'分析)表明,CHL儿童在两种模式下辨别完整与不完整语音都有更大困难。随着听力损失程度加重,在听觉模式下辨别完整与不完整起始音的能力下降。CHL儿童的辨别能力显著预测了他们对起始音的识别——即使在控制了其他变量的影响之后。
这些结果清楚地表明,视觉语音可以填补不完整的听觉语音,反过来,这种效果使得不完整的起始音更难与完整语音区分开来,并且更有可能被感知为完整。这些结果1)证明了视觉语音在语言处理多个层面的价值,2)支持将视觉语音视为CHL儿童发展口语的有力资产的干预项目。