Suppr超能文献

不同频谱区域中声学时间精细结构线索对汉语句子识别的重要性

The Importance of Acoustic Temporal Fine Structure Cues in Different Spectral Regions for Mandarin Sentence Recognition.

作者信息

Li Bei, Wang Hui, Yang Guang, Hou Limin, Su Kaiming, Feng Yanmei, Yin Shankai

机构信息

1Department of Otolaryngology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China; and 2School of Communication and Information Engineering, Shanghai University, Shanghai, China.

出版信息

Ear Hear. 2016 Jan-Feb;37(1):e52-6. doi: 10.1097/AUD.0000000000000216.

Abstract

OBJECTIVES

To study the relative contribution of acoustic temporal fine structure (TFS) cues in low-, mid-, and high-frequency regions to Mandarin sentence recognition.

DESIGN

Twenty-one subjects with normal hearing were involved in a study of Mandarin sentence recognition using acoustic TFS. The acoustic TFS information was extracted from 10 3-equivalent rectangular bandwidth-wide bands within the range 80 to 8858 Hz using the Hilbert transform and was assigned to low-, mid-, and high-frequency regions. Percent-correct recognition scores were obtained with acoustic TFS information presented using one, two, or three frequency regions. The relative weights of the three frequency regions were calculated using the least-squares approach.

RESULTS

Results indicated that the mean percent-correct scores for sentence recognition using acoustic TFS were nearly perfect for stimuli with all three frequency regions together. Recognition was approximately 50 to 60% correct with only the low- or mid-frequency region but decreased to approximately 5% correct with only the high-frequency region of acoustic TFS. The mean weights of the low-, mid-, and high-frequency regions were 0.39, 0.48, and 0.13, respectively, and the difference between each pair of frequency regions was statistically significant.

CONCLUSION

The acoustic TFS cues in low- and mid-frequency regions convey greater information for Mandarin sentence recognition, whereas those in the high-frequency region have little effect.

摘要

目的

研究低频、中频和高频区域的声学时间精细结构(TFS)线索对汉语句子识别的相对贡献。

设计

21名听力正常的受试者参与了一项使用声学TFS进行汉语句子识别的研究。声学TFS信息通过希尔伯特变换从80至8858Hz范围内的10个等效矩形带宽宽带中提取,并分配到低频、中频和高频区域。使用一个、两个或三个频率区域呈现声学TFS信息,获得正确识别百分比分数。使用最小二乘法计算三个频率区域的相对权重。

结果

结果表明,使用声学TFS进行句子识别时,对于所有三个频率区域一起呈现的刺激,平均正确识别百分比分数几乎是完美的。仅使用低频或中频区域时,识别正确率约为50%至60%,但仅使用声学TFS的高频区域时,正确率降至约5%。低频、中频和高频区域的平均权重分别为0.39、0.48和0.13,每对频率区域之间的差异具有统计学意义。

结论

低频和中频区域的声学TFS线索对汉语句子识别传达了更多信息,而高频区域的线索影响很小。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验