Suppr超能文献

说话者变异性中的结构:有多少以及能有多大帮助?

Structure in talker variability: How much is there and how much can it help?

作者信息

Kleinschmidt Dave F

机构信息

Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.

Department of Brain and Cognitive Sciences, University of Rochester, New York, NY, USA.

出版信息

Lang Cogn Neurosci. 2018;34(1):43-68. doi: 10.1080/23273798.2018.1500698. Epub 2018 Jul 30.

Abstract

One of the persistent puzzles in understanding human speech perception is how listeners cope with talker variability. One thing that might help listeners is structure in talker variability: rather than varying randomly, talkers of the same gender, dialect, age, etc. tend to produce language in similar ways. Listeners are sensitive to this covariation between linguistic variation and socio-indexical variables. In this paper I present new techniques based on ideal observer models to quantify (1) the amount and type of structure in talker variation ( of a grouping variable), and (2) how useful such structure can be for robust speech recognition in the face of talker variability (the of a grouping variable). I demonstrate these techniques in two phonetic domains-word-initial stop voicing and vowel identity-and show that these domains have different amounts and types of talker variability, consistent with previous, impressionistic findings. An R package (phondisttools) accompanies this paper, and the source and data are available from osf.io/zv6e3.

摘要

理解人类言语感知过程中一个长期存在的谜题是听众如何应对说话者的变异性。可能有助于听众的一点是说话者变异性中的结构:相同性别、方言、年龄等的说话者往往不会随机变化,而是倾向于以相似的方式产生语言。听众对语言变异和社会索引变量之间的这种协变很敏感。在本文中,我提出了基于理想观察者模型的新技术,以量化:(1)说话者变异(分组变量的)结构的数量和类型,以及(2)面对说话者变异性时,这种结构对稳健语音识别有多大用处(分组变量的)。我在两个语音领域——词首塞音清浊和元音识别——中展示了这些技术,并表明这些领域具有不同数量和类型的说话者变异性,这与之前的印象主义研究结果一致。本文附带了一个R包(phondisttools),源代码和数据可从osf.io/zv6e3获取。

相似文献

3
Talker familiarity and the accommodation of talker variability.说话人熟悉度与说话人变异性的顺应。
Atten Percept Psychophys. 2021 May;83(4):1842-1860. doi: 10.3758/s13414-020-02203-y. Epub 2021 Jan 4.

引用本文的文献

本文引用的文献

1
Sociolinguistic Perception as Inference Under Uncertainty.社会语言学感知作为不确定性下的推理。
Top Cogn Sci. 2018 Oct;10(4):818-834. doi: 10.1111/tops.12331. Epub 2018 Mar 15.
2
Audiovisual perceptual learning with multiple speakers.多说话者的视听感知学习
J Phon. 2016 May;56:66-74. doi: 10.1016/j.wocn.2016.02.003. Epub 2016 Mar 14.
6
Variability in Vowel Production within and between Days.不同日期内及不同日期间元音发音的变异性。
PLoS One. 2015 Sep 2;10(9):e0136791. doi: 10.1371/journal.pone.0136791. eCollection 2015.
9
Lexically guided phonetic retuning of foreign-accented speech and its generalization.词汇引导的外国口音语音调整及其泛化。
J Exp Psychol Hum Percept Perform. 2014 Apr;40(2):539-55. doi: 10.1037/a0034409. Epub 2013 Sep 23.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验