• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Zipf 频率分布有助于上下文分词。

Zipfian frequency distributions facilitate word segmentation in context.

机构信息

Department of Linguistics, Stanford University, United States.

出版信息

Cognition. 2013 Jun;127(3):439-53. doi: 10.1016/j.cognition.2013.02.002. Epub 2013 Apr 2.

DOI:10.1016/j.cognition.2013.02.002
PMID:23558340
Abstract

Word frequencies in natural language follow a highly skewed Zipfian distribution, but the consequences of this distribution for language acquisition are only beginning to be understood. Typically, learning experiments that are meant to simulate language acquisition use uniform word frequency distributions. We examine the effects of Zipfian distributions using two artificial language paradigms-a standard forced-choice task and a new orthographic segmentation task in which participants click on the boundaries between words in contexts. Our data show that learners can identify word forms robustly across widely varying frequency distributions. In addition, although performance in recognizing individual words is predicted best by their frequency, a Zipfian distribution facilitates word segmentation in context: the presence of high-frequency words creates more chances for learners to apply their knowledge in processing new sentences. We find that computational models that implement "chunking" are more effective than "transition finding" models at reproducing this pattern of performance.

摘要

自然语言中的词汇频率遵循高度倾斜的齐夫分布,但这种分布对语言习得的影响才刚刚开始被理解。通常,旨在模拟语言习得的学习实验使用均匀的词汇频率分布。我们使用两种人工语言范例——标准的强制选择任务和新的正字法分割任务,来检验齐夫分布的影响,在正字法分割任务中,参与者在上下文点击单词之间的边界。我们的数据表明,学习者可以在广泛变化的频率分布中稳健地识别单词形式。此外,尽管识别单个单词的性能最好由其频率预测,但齐夫分布在上下文中有助于分词:高频词的出现为学习者在处理新句子时应用知识创造了更多机会。我们发现,实现“组块”的计算模型比“转换发现”模型更有效地再现这种性能模式。

相似文献

1
Zipfian frequency distributions facilitate word segmentation in context.Zipf 频率分布有助于上下文分词。
Cognition. 2013 Jun;127(3):439-53. doi: 10.1016/j.cognition.2013.02.002. Epub 2013 Apr 2.
2
The learnability consequences of Zipfian distributions in language.语言中齐夫分布的可学性后果。
Cognition. 2022 Jun;223:105038. doi: 10.1016/j.cognition.2022.105038. Epub 2022 Feb 2.
3
Zipfian distributions facilitate children's learning of novel word-referent mappings.Zipf 分布有助于儿童学习新的单词-指称映射。
Cognition. 2024 Dec;253:105932. doi: 10.1016/j.cognition.2024.105932. Epub 2024 Aug 31.
4
Modeling human performance in statistical word segmentation.统计分词中人类表现的建模。
Cognition. 2010 Nov;117(2):107-25. doi: 10.1016/j.cognition.2010.07.005. Epub 2010 Sep 15.
5
Zipfian Distributions in Child-Directed Speech.儿童指向性言语中的齐普夫分布。
Open Mind (Camb). 2023 Jan 24;7:1-30. doi: 10.1162/opmi_a_00070. eCollection 2023.
6
Word segmentation with universal prosodic cues.基于通用韵律线索的分词。
Cogn Psychol. 2010 Sep;61(2):177-99. doi: 10.1016/j.cogpsych.2010.05.001. Epub 2010 Jun 22.
7
Visual statistical learning is facilitated in Zipfian distributions.视觉统计学习在 Zipf 分布中得到促进。
Cognition. 2021 Jan;206:104492. doi: 10.1016/j.cognition.2020.104492. Epub 2020 Nov 3.
8
Since when or how often? Dissociating the roles of age of acquisition (AoA) and lexical frequency in early visual word processing.何时或多久一次?在早期视觉词汇处理中分离习得年龄(AoA)和词汇频率的作用。
Brain Lang. 2013 Jan;124(1):132-41. doi: 10.1016/j.bandl.2012.11.005. Epub 2013 Jan 11.
9
Cross-situational learning in a Zipfian environment.在 Zipf 环境下的跨情境学习。
Cognition. 2019 Aug;189:11-22. doi: 10.1016/j.cognition.2019.03.005. Epub 2019 Mar 20.
10
The statistical signature of morphosyntax: a study of Hungarian and Italian infant-directed speech.形态句法的统计特征:对匈牙利语和意大利语婴儿导向语的研究。
Cognition. 2012 Nov;125(2):263-87. doi: 10.1016/j.cognition.2012.06.010. Epub 2012 Aug 6.

引用本文的文献

1
Individual differences in distributional statistical learning: Better frequency "discriminators" are better "estimators".分布统计学习中的个体差异:更好的频率“辨别者”是更好的“估计者”。
Q J Exp Psychol (Hove). 2024 Nov 14;78(9):17470218241293235. doi: 10.1177/17470218241293235.
2
Cultural evolution creates the statistical structure of language.文化进化创造了语言的统计结构。
Sci Rep. 2024 Mar 4;14(1):5255. doi: 10.1038/s41598-024-56152-9.
3
A changing role for transitional probabilities in word learning during the transition to toddlerhood?
在向幼儿期过渡期间,过渡概率在单词学习中的作用是否发生了变化?
Dev Psychol. 2024 Mar;60(3):567-581. doi: 10.1037/dev0001641. Epub 2024 Jan 25.
4
Discourse with Few Words: Coherence Statistics, Parent-Infant Actions on Objects, and Object Names.少言话语:连贯统计、母婴对物体的动作及物体名称
Lang Acquis. 2023;30(3-4):211-229. doi: 10.1080/10489223.2022.2054342. Epub 2022 Jul 4.
5
Zipfian Distributions in Child-Directed Speech.儿童指向性言语中的齐普夫分布。
Open Mind (Camb). 2023 Jan 24;7:1-30. doi: 10.1162/opmi_a_00070. eCollection 2023.
6
The infant's view redefines the problem of referential uncertainty in early word learning.婴儿的视角重新定义了早期词汇学习中所指不确定性的问题。
Proc Natl Acad Sci U S A. 2021 Dec 28;118(52). doi: 10.1073/pnas.2107019118.
7
Everyday music in infancy.婴儿期的日常音乐。
Dev Sci. 2021 Nov;24(6):e13122. doi: 10.1111/desc.13122. Epub 2021 Jun 25.
8
When statistics collide: The use of transitional and phonotactic probability cues to word boundaries.当统计学碰撞时:过渡和音位概率线索在词边界上的应用。
Mem Cognit. 2021 Oct;49(7):1300-1310. doi: 10.3758/s13421-021-01163-4. Epub 2021 Mar 9.
9
How do infants start learning object names in a sea of clutter?婴儿如何在纷繁复杂的环境中开始学习物体名称?
Cogsci. 2019 Jul;2019:521-526.
10
Word Segmentation Cues in German Child-Directed Speech: A Corpus Analysis.德语儿童导向言语中的词分割线索:语料库分析。
Lang Speech. 2022 Mar;65(1):3-27. doi: 10.1177/0023830920979016. Epub 2021 Jan 30.