Suppr超能文献

在音位构词概率和邻域密度分析中最小化词长混杂效应的方法。

Methods for minimizing the confounding effects of word length in the analysis of phonotactic probability and neighborhood density.

作者信息

Storkel Holly L

机构信息

Department of Speech-Language-Hearing: Sciences and Disorders, 3001 Dole Human Development Center, University of Kansas, Lawrence 66045-7555, USA.

出版信息

J Speech Lang Hear Res. 2004 Dec;47(6):1454-68. doi: 10.1044/1092-4388(2004/108).

Abstract

Recent research suggests that phonotactic probability (the likelihood of occurrence of a sound sequence) and neighborhood density (the number of words phonologically similar to a given word) influence spoken language processing and acquisition across the lifespan in both normal and clinical populations. The majority of research in this area has tended to focus on controlled laboratory studies rather than naturalistic data such as spontaneous speech samples or elicited probes. One difficulty in applying current measures of phonotactic probability and neighborhood density to more naturalistic samples is the significant correlation between these variables and word length. This study examines several alternative transformations of phonotactic probability and neighborhood density as a means of reducing or eliminating this correlation with word length. Computational analyses of the words in a large database and reanalysis of archival data supported the use of z scores for the analysis of phonotactic probability as a continuous variable and the use of median transformation scores for the analysis of phonotactic probability as a dichotomous variable. Neighborhood density results were less clear with the conclusion that analysis of neighborhood density as a continuous variable warrants further investigation to differentiate the utility of z scores in comparison to median transformation scores. Furthermore, balanced dichotomous coding of neighborhood density was difficult to achieve, suggesting that analysis of neighborhood density as a dichotomous variable should be approached with caution. Recommendations for future application and analyses are discussed.

摘要

近期研究表明,音位组合概率(声音序列出现的可能性)和邻域密度(与给定单词在语音上相似的单词数量)在正常和临床人群的整个生命周期中都会影响口语加工和习得。该领域的大多数研究倾向于集中在受控实验室研究上,而非自然数据,如自发言语样本或引发性探测。将当前音位组合概率和邻域密度测量方法应用于更自然的样本时,一个困难在于这些变量与单词长度之间存在显著相关性。本研究考察了音位组合概率和邻域密度的几种替代变换,作为减少或消除与单词长度相关性的一种手段。对一个大型数据库中的单词进行的计算分析以及对存档数据的重新分析支持了使用z分数来分析作为连续变量的音位组合概率,以及使用中位数变换分数来分析作为二分变量的音位组合概率。邻域密度的结果不太明确,结论是将邻域密度作为连续变量进行分析值得进一步研究,以区分z分数与中位数变换分数的效用。此外,邻域密度的平衡二分编码难以实现,这表明在将邻域密度作为二分变量进行分析时应谨慎行事。文中讨论了对未来应用和分析的建议。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验