School of Rehabilitation and Communication Sciences, Ohio University, Athens, OH 45701, USA.
Int J Audiol. 2011 Jan;50(1):14-26. doi: 10.3109/14992027.2010.515620. Epub 2010 Nov 4.
Voice pitch carries important cues for speech perception in humans. Recent studies have shown the feasibility of recording the frequency-following response (FFR) to voice pitch in normal-hearing listeners. The presence of such a response, however, has been dependent on subjective interpretation of experimenters. The purpose of this study was to develop and test an automated procedure including a control-experimental protocol and response-threshold criteria suitable for extracting FFRs to voice pitch, and compare the results to human judgments.
A set of four Mandarin tones (Tone 1 flat; Tone 2 rising; Tone 3 dipping; and Tone 4 falling) were prepared to reflect the four contrastive pitch contours. Two distinctive algorithms, short-term autocorrelation in the time domain and narrow-band spectrogram in the frequency domain, were used to estimate the Frequency Error, Slope Error, Tracking Accuracy, Pitch Strength and Pitch-Noise Ratio of the recordings from individual listeners as well as the power and false-positive rates of each algorithm.
Eleven native speakers (five males; age: mean ± SD = 31.4 ± 4.7 years) of Mandarin Chinese were recruited.
The results demonstrated that both algorithms were suitable for extracting FFRs and the objective measures showed comparable results to human judgments.
The automated procedure used in this study, including the use of the control-experimental protocol and response thresholds used for each of the five objective indices, can be used for difficult-to-test patients and may prove to be useful as an assessment and diagnostic method in both clinical and basic research efforts.
人类的语音感知中,音高携带重要线索。近期研究显示,正常听力人群中可记录到频率跟随反应(FFR)对语音音高的响应。然而,这种响应的存在取决于实验者的主观解释。本研究旨在开发和测试一种自动化程序,包括控制实验方案和响应阈值标准,以提取语音音高的 FFR,并将结果与人类判断进行比较。
我们准备了一组四个普通话声调(平调 1、升调 2、降调 3 和降调 4),以反映四种对比的音高轮廓。我们使用了两种独特的算法,即时域中的短期自相关和频域中的窄带频谱图,来估计每位听众录音的频率误差、斜率误差、跟踪精度、音强和音噪比,以及每种算法的功率和假阳性率。
我们招募了 11 名母语为普通话的汉语者(男性 5 名;年龄:均值 ± 标准差=31.4 ± 4.7 岁)。
结果表明,两种算法均适用于提取 FFR,客观测量结果与人类判断相当。
本研究中使用的自动化程序,包括控制实验方案和每个五个客观指标的响应阈值,可用于难以测试的患者,并且可能在临床和基础研究中作为评估和诊断方法非常有用。