Department of Communication Sciences and Disorders & Waisman Center, University of Wisconsin-Madison, Madison, WI, USA.
National Acoustic Laboratories, Sydney, Australia.
J Assoc Res Otolaryngol. 2022 Dec;23(6):759-769. doi: 10.1007/s10162-022-00855-1. Epub 2022 Aug 24.
Vowel-evoked envelope following responses (EFRs) reflect neural encoding of the fundamental frequency of voice (f). Accurate analysis of EFRs elicited by natural vowels requires the use of methods like the Fourier analyzer (FA) to consider the production-related f changes. The FA's accuracy in estimating EFRs is, however, dependent on the assumed neurophysiological processing delay needed to time-align the f time course and the recorded electroencephalogram (EEG). For male-spoken vowels (f ~ 100 Hz), a constant 10-ms delay correction is often assumed. Since processing delays vary with stimulus and physiological factors, we quantified (i) the delay-related variability that would occur in EFR estimation, and (ii) the influence of stimulus frequency, non-f related neural activity, and the listener's age on such variability. EFRs were elicited by the low-frequency first formant, and mid-frequency second and higher formants of /u/, /a/, and /i/ in young adults and 6- to 17-year-old children. To time-align with the f time course, EEG was shifted by delays between 5 and 25 ms to encompass plausible response latencies. The delay-dependent range in EFR amplitude did not vary by stimulus frequency or age and was significantly smaller when interference from low-frequency activity was reduced. On average, the delay-dependent range was < 22% of the maximum variability in EFR amplitude that could be expected by noise. Results suggest that using a constant EEG delay correction in FA analysis does not substantially alter EFR amplitude estimation. In the present study, the lack of substantial variability was likely facilitated by using vowels with small f ranges.
元音诱发包络跟随反应(EFRs)反映了语音基频(f)的神经编码。准确分析自然元音诱发的 EFR 需要使用傅里叶分析器(FA)等方法来考虑与产生相关的 f 变化。然而,FA 在估计 EFR 时的准确性取决于假设的神经生理处理延迟,以便时间对齐 f 时程和记录的脑电图(EEG)。对于男性发音的元音(f≈100 Hz),通常假设使用 10 毫秒的恒定延迟校正。由于处理延迟随刺激和生理因素而变化,我们量化了(i)在 EFR 估计中会发生的延迟相关可变性,以及(ii)刺激频率、非 f 相关神经活动和听众年龄对这种可变性的影响。EFR 是由低频第一共振峰和/u/、/a/和/i/的中频第二和更高共振峰诱发的。为了与 f 时程时间对齐,EEG 通过 5 到 25 毫秒的延迟进行移位,以包含可能的反应潜伏期。EFR 幅度的延迟依赖性范围不因刺激频率或年龄而变化,当减少低频活动的干扰时,该范围显著减小。平均而言,延迟依赖性范围<22%的 EFR 幅度最大可变性,这可能是由噪声引起的。结果表明,在 FA 分析中使用恒定的 EEG 延迟校正不会显著改变 EFR 幅度估计。在本研究中,缺乏实质性的可变性可能是由于使用了 f 范围较小的元音而得以促进。