Li Bei, Wang Hui, Yang Guang, Hou Limin, Su Kaiming, Feng Yanmei, Yin Shankai
1Department of Otolaryngology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; and 2School of Communication and Information Engineering, Shanghai University, Shanghai, China.
Ear Hear. 2016 Jan-Feb;37(1):e52-6. doi: 10.1097/AUD.0000000000000216.
To study the relative contribution of acoustic temporal fine structure (TFS) cues in low-, mid-, and high-frequency regions to Mandarin sentence recognition.
Twenty-one subjects with normal hearing were involved in a study of Mandarin sentence recognition using acoustic TFS. The acoustic TFS information was extracted from 10 3-equivalent rectangular bandwidth-wide bands within the range 80 to 8858 Hz using the Hilbert transform and was assigned to low-, mid-, and high-frequency regions. Percent-correct recognition scores were obtained with acoustic TFS information presented using one, two, or three frequency regions. The relative weights of the three frequency regions were calculated using the least-squares approach.
Results indicated that the mean percent-correct scores for sentence recognition using acoustic TFS were nearly perfect for stimuli with all three frequency regions together. Recognition was approximately 50 to 60% correct with only the low- or mid-frequency region but decreased to approximately 5% correct with only the high-frequency region of acoustic TFS. The mean weights of the low-, mid-, and high-frequency regions were 0.39, 0.48, and 0.13, respectively, and the difference between each pair of frequency regions was statistically significant.
The acoustic TFS cues in low- and mid-frequency regions convey greater information for Mandarin sentence recognition, whereas those in the high-frequency region have little effect.
研究低频、中频和高频区域的声学时间精细结构(TFS)线索对汉语句子识别的相对贡献。
21名听力正常的受试者参与了一项使用声学TFS进行汉语句子识别的研究。声学TFS信息通过希尔伯特变换从80至8858Hz范围内的10个等效矩形带宽宽带中提取,并分配到低频、中频和高频区域。使用一个、两个或三个频率区域呈现声学TFS信息,获得正确识别百分比分数。使用最小二乘法计算三个频率区域的相对权重。
结果表明,使用声学TFS进行句子识别时,对于所有三个频率区域一起呈现的刺激,平均正确识别百分比分数几乎是完美的。仅使用低频或中频区域时,识别正确率约为50%至60%,但仅使用声学TFS的高频区域时,正确率降至约5%。低频、中频和高频区域的平均权重分别为0.39、0.48和0.13,每对频率区域之间的差异具有统计学意义。
低频和中频区域的声学TFS线索对汉语句子识别传达了更多信息,而高频区域的线索影响很小。