Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington, IN, USA; Program in Neuroscience, Indiana University, Bloomington, IN, USA.
Department of Speech, Language and Hearing Sciences, Indiana University, Bloomington, IN, USA; Program in Neuroscience, Indiana University, Bloomington, IN, USA; Cognitive Science Program, Indiana University, Bloomington, IN, USA.
Brain Res. 2024 Dec 1;1844:149166. doi: 10.1016/j.brainres.2024.149166. Epub 2024 Aug 14.
Acoustic information in speech changes continuously, yet listeners form discrete perceptual categories to ease the demands of perception. Being a more continuous/gradient as opposed to a more discrete/categorical listener may be further advantageous for understanding speech in noise by increasing perceptual flexibility and resolving ambiguity. The degree to which a listener's responses to a continuum of speech sounds are categorical versus continuous can be quantified using visual analog scaling (VAS) during speech labeling tasks. Here, we recorded event-related brain potentials (ERPs) to vowels along an acoustic-phonetic continuum (/u/ to /a/) while listeners categorized phonemes in both clean and noise conditions. Behavior was assessed using standard two alternative forced choice (2AFC) and VAS paradigms to evaluate categorization under task structures that promote discrete vs. continuous hearing, respectively. Behaviorally, identification curves were steeper under 2AFC vs. VAS categorization but were relatively immune to noise, suggesting robust access to abstract, phonetic categories even under signal degradation. Behavioral slopes were correlated with listeners' QuickSIN scores; shallower slopes corresponded with better speech in noise performance, suggesting a perceptual advantage to noise degraded speech comprehension conferred by a more gradient listening strategy. At the neural level, P2 amplitudes and latencies of the ERPs were modulated by task and noise; VAS responses were larger and showed greater noise-related latency delays than 2AFC responses. More gradient responders had smaller shifts in ERP latency with noise, suggesting their neural encoding of speech was more resilient to noise degradation. Interestingly, source-resolved ERPs showed that more gradient listening was also correlated with stronger neural responses in left superior temporal gyrus. Our results demonstrate that listening strategy modulates the categorical organization of speech and behavioral success, with more continuous/gradient listening being advantageous to sentential speech in noise perception.
语音中的声学信息是连续变化的,但听众会形成离散的感知类别,以减轻感知的负担。与更离散/分类的听众相比,作为一个更连续/渐变的听众可能通过增加感知灵活性和解决歧义,对理解噪声中的语音更有利。在语音标注任务中,可以使用视觉模拟评分 (VAS) 来量化听众对语音声音连续体的反应是分类的还是连续的程度。在这里,我们记录了事件相关脑电位 (ERP),在干净和噪声条件下,听众对沿声学语音连续体 (/u/ 到 /a/) 的元音进行分类。使用标准的二选一强制选择 (2AFC) 和 VAS 范式来评估分别在促进离散和连续听觉的任务结构下的分类,以此评估行为。从行为上看,2AFC 分类下的识别曲线比 VAS 分类更陡峭,但对噪声相对免疫,这表明即使在信号退化的情况下,也能很好地获得抽象的、语音类别。行为斜率与听众的快速 SIN 分数相关;斜率越浅,噪声下的语音表现越好,这表明更渐变的听力策略对噪声下语音理解的感知优势。在神经水平上,ERP 的 P2 振幅和潜伏期受到任务和噪声的调制;VAS 反应比 2AFC 反应更大,并且具有更大的与噪声相关的潜伏期延迟。更渐变的反应者在噪声下的 ERP 潜伏期变化较小,这表明他们的语音神经编码对噪声退化更具弹性。有趣的是,源分辨 ERP 显示,更渐变的听力也与左颞上回更强的神经反应相关。我们的研究结果表明,听力策略调节了言语的分类组织和行为成功,而更连续/渐变的听力对噪声感知中的句子语音更有利。