从共享语境到句法范畴：分布信息在学习语言形式类别中的作用。

From shared contexts to syntactic categories: the role of distributional information in learning linguistic form-classes.

机构信息

Department of Brain & Cognitive Sciences, University of Rochester, Rochester, NY 14627, USA.

出版信息

Cogn Psychol. 2013 Feb;66(1):30-54. doi: 10.1016/j.cogpsych.2012.09.001. Epub 2012 Oct 23.

DOI:10.1016/j.cogpsych.2012.09.001

PMID:23089290

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3621024/

Abstract

A fundamental component of language acquisition involves organizing words into grammatical categories. Previous literature has suggested a number of ways in which this categorization task might be accomplished. Here we ask whether the patterning of the words in a corpus of linguistic input (distributional information) is sufficient, along with a small set of learning biases, to extract these underlying structural categories. In a series of experiments, we show that learners can acquire linguistic form-classes, generalizing from instances of the distributional contexts of individual words in the exposure set to the full range of contexts for all the words in the set. Crucially, we explore how several specific distributional variables enable learners to form a category of lexical items and generalize to novel words, yet also allow for exceptions that maintain lexical specificity. We suggest that learners are sensitive to the contexts of individual words, the overlaps among contexts across words, the non-overlap of contexts (or systematic gaps in information), and the size of the exposure set. We also ask how learners determine the category membership of a new word for which there is very sparse contextual information. We find that, when there are strong category cues and robust category learning of other words, adults readily generalize the distributional properties of the learned category to a new word that shares just one context with the other category members. However, as the distributional cues regarding the category become sparser and contain more consistent gaps, learners show more conservatism in generalizing distributional properties to the novel word. Taken together, these results show that learners are highly systematic in their use of the distributional properties of the input corpus, using them in a principled way to determine when to generalize and when to preserve lexical specificity.

摘要

语言习得的一个基本组成部分涉及将单词组织到语法类别中。先前的文献提出了许多完成这种分类任务的方法。在这里，我们想知道语言输入语料库中单词的模式（分布信息）是否足以与少量学习偏差一起提取这些潜在的结构类别。在一系列实验中，我们表明学习者可以习得语言形式类别，从暴露集中单个单词的分布上下文实例中进行概括，以涵盖集中所有单词的全部上下文范围。至关重要的是，我们探讨了几个特定的分布变量如何使学习者能够形成词汇项的类别并推广到新单词，同时也允许保持词汇特异性的例外。我们认为学习者对单个单词的上下文、单词之间上下文的重叠、上下文的非重叠（或信息的系统间隙）以及暴露集的大小都很敏感。我们还询问学习者如何确定新单词的类别成员，而这些新单词的上下文信息非常稀疏。我们发现，当存在强烈的类别提示和对其他单词的强大类别学习时，成年人会很容易地将学习到的类别的分布特性推广到与其他类别成员仅共享一个上下文的新单词上。然而，随着关于类别的分布线索变得更加稀疏并且包含更多一致的间隙，学习者在将分布特性推广到新单词时表现出更大的保守性。综上所述，这些结果表明学习者在使用输入语料库的分布特性方面非常系统，他们以一种有原则的方式使用这些特性来确定何时进行概括以及何时保留词汇特异性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc3e/3621024/9de08187a247/nihms412299f1.jpg

相似文献

From shared contexts to syntactic categories: the role of distributional information in learning linguistic form-classes.从共享语境到句法范畴：分布信息在学习语言形式类别中的作用。

Cogn Psychol. 2013 Feb;66(1):30-54. doi: 10.1016/j.cogpsych.2012.09.001. Epub 2012 Oct 23.

Semantic Coherence Facilitates Distributional Learning.语义连贯有助于分布学习。

Cogn Sci. 2017 Apr;41 Suppl 4:855-884. doi: 10.1111/cogs.12360. Epub 2016 Mar 14.

Word categorization from distributional information: frames confer more than the sum of their (Bigram) parts.基于分布信息的词分类：框架所赋予的意义超过其（二元组）各部分的总和。

Cogn Psychol. 2014 Dec;75:1-27. doi: 10.1016/j.cogpsych.2014.07.003. Epub 2014 Aug 27.

Distributional learning of subcategories in an artificial grammar: Category generalization and subcategory restrictions.人工语法中子类别的分布学习：类别泛化与子类别限制

J Mem Lang. 2017 Dec;97:17-29. doi: 10.1016/j.jml.2017.07.006. Epub 2017 Jul 20.

The effect of Zipfian frequency variations on category formation in adult artificial language learning.齐普夫频率变化对成人人工语言学习中类别形成的影响。

Lang Learn Dev. 2017;13(4):357-374. doi: 10.1080/15475441.2016.1263571. Epub 2017 Aug 2.

A universal cue for grammatical categories in the input to children: Frequent frames.儿童输入中语法范畴的通用线索：频繁的框架。

Cognition. 2018 Jun;175:131-140. doi: 10.1016/j.cognition.2018.02.005. Epub 2018 Mar 16.

Distributional Language Learning: Mechanisms and Models of ategory Formation.分布语言学习：范畴形成的机制与模型

Lang Learn. 2014 Sep 1;64(Suppl 2):86-105. doi: 10.1111/lang.12074. Epub 2014 Aug 25.

The differential role of phonological and distributional cues in grammatical categorisation.语音线索和分布线索在语法分类中的差异作用。

Cognition. 2005 Jun;96(2):143-82. doi: 10.1016/j.cognition.2004.09.001. Epub 2004 Dec 24.

Category induction from distributional cues in an artificial language.基于人工语言中分布线索的类别归纳

Mem Cognit. 2002 Jul;30(5):678-86. doi: 10.3758/bf03196424.

Lexical distributional cues, but not situational cues, are readily used to learn abstract locative verb-structure associations.词汇分布线索而非情境线索易于被用于学习抽象的方位动词结构关联。

Cognition. 2016 Aug;153:124-39. doi: 10.1016/j.cognition.2016.05.001. Epub 2016 May 14.

引用本文的文献

Effects of healthy aging and left hemisphere stroke on statistical language learning.健康衰老和左半球中风对统计语言学习的影响。

Lang Cogn Neurosci. 2022;37(8):984-999. doi: 10.1080/23273798.2022.2030481. Epub 2022 Jan 28.

One model for the learning of language.一种语言学习模型。

Proc Natl Acad Sci U S A. 2022 Feb 1;119(5). doi: 10.1073/pnas.2021865119.

Fast but Not Furious. When Sped Up Bit Rate of Information Drives Rule Induction.快而不乱。当信息的加速比特率推动规则归纳时。

Front Psychol. 2021 Nov 11;12:661785. doi: 10.3389/fpsyg.2021.661785. eCollection 2021.

Two for the price of one: Concurrent learning of words and phonotactic regularities from continuous speech.一举两得：从连续语音中同时学习单词和音系规则。

PLoS One. 2021 Jun 11;16(6):e0253039. doi: 10.1371/journal.pone.0253039. eCollection 2021.

Order Matters! Influences of Linear Order on Linguistic Category Learning.语序很重要！线性语序对语言范畴学习的影响。

Cogn Sci. 2020 Nov;44(11):e12910. doi: 10.1111/cogs.12910.

Cross-situational statistical learning in younger and older adults.年轻成年人和老年成年人的跨情境统计学习。

Neuropsychol Dev Cogn B Aging Neuropsychol Cogn. 2021 May;28(3):346-366. doi: 10.1080/13825585.2020.1759502. Epub 2020 May 5.

Individual Differences in Verb Bias Sensitivity in Children and Adults With Developmental Language Disorder.发育性语言障碍儿童和成人动词偏向敏感性的个体差异

Front Hum Neurosci. 2019 Nov 19;13:402. doi: 10.3389/fnhum.2019.00402. eCollection 2019.

The Developmental Origins of Syntactic Bootstrapping.句法启动的发展起源。

Top Cogn Sci. 2020 Jan;12(1):48-77. doi: 10.1111/tops.12447. Epub 2019 Aug 16.

Language Outcomes in Adults with a History of Institutionalization: Behavioral and Neurophysiological Characterization.语言障碍成年人的语言康复治疗：行为和神经生理学特征

Sci Rep. 2019 Mar 12;9(1):4252. doi: 10.1038/s41598-019-40007-9.

Children and Adults as Language Learners: Rules, Variation, and Maturational Change.儿童和成人作为语言学习者：规则、变化和成熟变化。

Top Cogn Sci. 2020 Jan;12(1):153-169. doi: 10.1111/tops.12416. Epub 2019 Mar 5.

本文引用的文献

Category induction via distributional analysis: Evidence from a serial reaction time task.通过分布分析进行类别归纳：来自序列反应时任务的证据

J Mem Lang. 2010 Feb 1;62(2):98-112. doi: 10.1016/j.jml.2009.10.002.

Acquiring and processing verb argument structure: distributional learning in a miniature language.获取与处理动词论元结构：微型语言中的分布学习

Cogn Psychol. 2008 May;56(3):165-209. doi: 10.1016/j.cogpsych.2007.04.002. Epub 2007 Jul 27.

Word learning as Bayesian inference.作为贝叶斯推理的词汇学习

Psychol Rev. 2007 Apr;114(2):245-72. doi: 10.1037/0033-295X.114.2.245.

Phonological typicality influences on-line sentence comprehension.语音典型性对在线句子理解有影响。

Proc Natl Acad Sci U S A. 2006 Aug 8;103(32):12203-8. doi: 10.1073/pnas.0602173103. Epub 2006 Aug 1.

Theory-based Bayesian models of inductive learning and reasoning.基于理论的归纳学习与推理贝叶斯模型。

Trends Cogn Sci. 2006 Jul;10(7):309-18. doi: 10.1016/j.tics.2006.05.009. Epub 2006 Jun 22.

Infants can use distributional cues to form syntactic categories.婴儿可以利用分布线索来形成句法类别。

J Child Lang. 2005 May;32(2):249-68. doi: 10.1017/s0305000904006786.

Decisions, decisions: infant language learning when multiple generalizations are possible.决策，决策：当多种泛化情况都有可能时的婴儿语言学习。

Cognition. 2006 Jan;98(3):B67-74. doi: 10.1016/j.cognition.2005.03.003. Epub 2005 Jun 29.

The differential role of phonological and distributional cues in grammatical categorisation.语音线索和分布线索在语法分类中的差异作用。

Cognition. 2005 Jun;96(2):143-82. doi: 10.1016/j.cognition.2004.09.001. Epub 2004 Dec 24.

A first step in form-based category abstraction by 12-month-old infants.12个月大婴儿基于形式的类别抽象的第一步。

Dev Sci. 2004 Nov;7(5):567-80. doi: 10.1111/j.1467-7687.2004.00381.x.

Frequent frames as a cue for grammatical categories in child directed speech.频繁出现的框架作为儿童导向性言语中语法类别的线索。

Cognition. 2003 Nov;90(1):91-117. doi: 10.1016/s0010-0277(03)00140-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从共享语境到句法范畴：分布信息在学习语言形式类别中的作用。

From shared contexts to syntactic categories: the role of distributional information in learning linguistic form-classes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献