Department of Otorhinolaryngology-Head and Neck Surgery, Nowon Eulji Medical Center, Eulji University School of Medicine, Seoul 01830, Republic of Korea.
Eulji Tinnitus and Hearing Research Institute, Nowon Eulji Medical Center, Seoul 01830, Republic of Korea.
eNeuro. 2024 May 29;11(5). doi: 10.1523/ENEURO.0528-23.2024. Print 2024 May.
This study compared the impact of spectral and temporal degradation on vocoded speech recognition between early-blind and sighted subjects. The participants included 25 early-blind subjects (30.32 ± 4.88 years; male:female, 14:11) and 25 age- and sex-matched sighted subjects. Tests included monosyllable recognition in noise at various signal-to-noise ratios (-18 to -4 dB), matrix sentence-in-noise recognition, and vocoded speech recognition with different numbers of channels (4, 8, 16, and 32) and temporal envelope cutoff frequencies (50 vs 500 Hz). Cortical-evoked potentials (N2 and P3b) were measured in response to spectrally and temporally degraded stimuli. The early-blind subjects displayed superior monosyllable and sentence recognition than sighted subjects (all < 0.01). In the vocoded speech recognition test, a three-way repeated-measure analysis of variance (two groups × four channels × two cutoff frequencies) revealed significant main effects of group, channel, and cutoff frequency (all < 0.001). Early-blind subjects showed increased sensitivity to spectral degradation for speech recognition, evident in the significant interaction between group and channel ( = 0.007). N2 responses in early-blind subjects exhibited shorter latency and greater amplitude in the 8-channel ( = 0.022 and 0.034, respectively) and shorter latency in the 16-channel ( = 0.049) compared with sighted subjects. In conclusion, early-blind subjects demonstrated speech recognition advantages over sighted subjects, even in the presence of spectral and temporal degradation. Spectral degradation had a greater impact on speech recognition in early-blind subjects, while the effect of temporal degradation was similar in both groups.
本研究比较了光谱和时间退化对早期失明和正常视力受试者语音识别的影响。参与者包括 25 名早期失明受试者(30.32±4.88 岁;男:女,14:11)和 25 名年龄和性别匹配的正常视力受试者。测试包括在不同信噪比(-18 至-4dB)下的噪声中单音节识别、矩阵句子在噪声中的识别以及不同声道数(4、8、16 和 32)和时间包络截止频率(50 与 500Hz)的语音识别。皮质诱发电位(N2 和 P3b)用于测量对光谱和时间退化刺激的反应。早期失明受试者的单音节和句子识别能力优于正常视力受试者(均<0.01)。在语音识别测试中,三因素重复测量方差分析(两组×四声道×两个截止频率)显示组、声道和截止频率均有显著主效应(均<0.001)。早期失明受试者对语音识别的光谱退化更为敏感,组和声道之间的显著交互作用表明了这一点( = 0.007)。与正常视力受试者相比,早期失明受试者的 N2 反应潜伏期更短,8 声道( = 0.022 和 0.034)和 16 声道( = 0.049)的振幅更大。总之,即使在存在光谱和时间退化的情况下,早期失明受试者的语音识别能力也优于正常视力受试者。光谱退化对早期失明受试者的语音识别影响更大,而时间退化的影响在两组之间相似。