作为句法约束处理认知模型的神经网络

Neural Networks as Cognitive Models of the Processing of Syntactic Constraints.

作者信息

Arehalli Suhas, Linzen Tal

机构信息

Department of Mathematics, Statistics, and Computer Science, Macalester College, Saint Paul, MN, USA.

Department of Linguistics and Center for Data Science, New York University, New York, NY, USA.

出版信息

Open Mind (Camb). 2024 May 6;8:558-614. doi: 10.1162/opmi_a_00137. eCollection 2024.

DOI:10.1162/opmi_a_00137

PMID:38746852

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11093404/

Abstract

Languages are governed by -structural rules that determine which sentences are grammatical in the language. In English, one such constraint is , which dictates that the number of a verb must match the number of its corresponding subject: "the dog run", but "the dog run". While this constraint appears to be simple, in practice speakers make agreement errors, particularly when a noun phrase near the verb differs in number from the subject (for example, a speaker might produce the ungrammatical sentence "the key to the cabinets are rusty"). This phenomenon, referred to as , is sensitive to a wide range of properties of the sentence; no single existing model is able to generate predictions for the wide variety of materials studied in the human experimental literature. We explore the viability of neural network language models-broad-coverage systems trained to predict the next word in a corpus-as a framework for addressing this limitation. We analyze the agreement errors made by Long Short-Term Memory (LSTM) networks and compare them to those of humans. The models successfully simulate certain results, such as the so-called number asymmetry and the difference between attraction strength in grammatical and ungrammatical sentences, but failed to simulate others, such as the effect of syntactic distance or notional (conceptual) number. We further evaluate networks trained with explicit syntactic supervision, and find that this form of supervision does not always lead to more human-like syntactic behavior. Finally, we show that the corpus used to train a network significantly affects the pattern of agreement errors produced by the network, and discuss the strengths and limitations of neural networks as a tool for understanding human syntactic processing.

摘要

语言受结构规则支配，这些规则决定了语言中哪些句子是合乎语法的。在英语中，这样一种限制是主谓一致，它规定动词的数必须与其相应主语的数相匹配：“狗跑”，但应该是“狗跑”（此处原文有误，正确应该是“the dog runs”）。虽然这种限制看似简单，但在实际中，说话者会出现一致错误，尤其是当动词附近的名词短语在数上与主语不同时（例如，说话者可能会说出不合语法的句子“橱柜的钥匙生锈了”）。这种现象，被称为主谓一致错误，对句子的多种属性敏感；现有的单一模型都无法对人类实验文献中研究的各种材料做出预测。我们探讨神经网络语言模型（一种经过训练以预测语料库中下一个单词的广泛覆盖系统）作为解决这一限制的框架的可行性。我们分析了长短期记忆（LSTM）网络所犯的一致错误，并将它们与人类的错误进行比较。这些模型成功地模拟了某些结果，比如所谓的数不对称以及合语法和不合语法句子中吸引强度的差异，但未能模拟其他结果，比如句法距离或概念数的影响。我们进一步评估了经过明确句法监督训练的网络，发现这种监督形式并不总是能导致更像人类的句法行为。最后，我们表明用于训练网络的语料库会显著影响网络产生的一致错误模式，并讨论了神经网络作为理解人类句法处理工具的优势和局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e644/11093404/f17afc230526/opmi-08-558-g001.jpg

相似文献

Neural Networks as Cognitive Models of the Processing of Syntactic Constraints.

Open Mind (Camb). 2024 May 6;8:558-614. doi: 10.1162/opmi_a_00137. eCollection 2024.

Mechanisms for handling nested dependencies in neural-network language models and humans.

Cognition. 2021 Aug;213:104699. doi: 10.1016/j.cognition.2021.104699. Epub 2021 Apr 30.

Minimal Interference from Possessor Phrases in the Production of Subject-Verb Agreement.

Front Psychol. 2016 May 2;7:548. doi: 10.3389/fpsyg.2016.00548. eCollection 2016.

The Reading Signatures of Agreement Attraction.

Open Mind (Camb). 2021 Nov 1;5:132-153. doi: 10.1162/opmi_a_00047. eCollection 2021.

Word Order Typology Interacts With Linguistic Complexity: A Cross-Linguistic Corpus Study.

Cogn Sci. 2020 Apr;44(4):e12822. doi: 10.1111/cogs.12822.

Object attraction effects during subject-verb agreement in Persian.

Q J Exp Psychol (Hove). 2019 Apr;72(4):742-752. doi: 10.1177/1747021818769567. Epub 2018 May 1.

PIPS: A Parallel Planning Model of Sentence Production.

Cogn Sci. 2022 Feb;46(2):e13079. doi: 10.1111/cogs.13079.

Agrammatic production of subject-verb agreement: the effect of conceptual number.

Brain Lang. 1999 Sep;69(2):119-60. doi: 10.1006/brln.1999.2059.

Emotional Attractors in Subject-Verb Number Agreement.

Front Psychol. 2022 Jun 29;13:880755. doi: 10.3389/fpsyg.2022.880755. eCollection 2022.

Misinterpretations in agreement and agreement attraction.

Q J Exp Psychol (Hove). 2016;69(5):950-71. doi: 10.1080/17470218.2014.992445. Epub 2015 Jan 27.

引用本文的文献

Language in vivo vs. in silico: Size matters but Larger Language Models still do not comprehend language on a par with humans due to impenetrable semantic reference.

PLoS One. 2025 Jul 17;20(7):e0327794. doi: 10.1371/journal.pone.0327794. eCollection 2025.

Systematic testing of three Language Models reveals low language accuracy, absence of response stability, and a yes-response bias.

Proc Natl Acad Sci U S A. 2023 Dec 19;120(51):e2309583120. doi: 10.1073/pnas.2309583120. Epub 2023 Dec 13.

本文引用的文献

Large-scale evidence for logarithmic effects of word predictability on reading time.

Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2307876121. doi: 10.1073/pnas.2307876121. Epub 2024 Feb 29.

Symbols and grounding in large language models.

Philos Trans A Math Phys Eng Sci. 2023 Jul 24;381(2251):20220041. doi: 10.1098/rsta.2022.0041. Epub 2023 Jun 5.

Comparison of Structural Parsers and Neural Language Models as Surprisal Estimators.

Front Artif Intell. 2022 Mar 3;5:777963. doi: 10.3389/frai.2022.777963. eCollection 2022.

Shared computational principles for language processing in humans and deep language models.

Nat Neurosci. 2022 Mar;25(3):369-380. doi: 10.1038/s41593-022-01026-4. Epub 2022 Mar 7.

The neural architecture of language: Integrative modeling converges on predictive processing.

Proc Natl Acad Sci U S A. 2021 Nov 9;118(45). doi: 10.1073/pnas.2105646118.

The puzzle of number agreement with disjunction.

Cognition. 2020 May;198:104161. doi: 10.1016/j.cognition.2019.104161. Epub 2020 Jan 27.

Beyond linear order: The role of argument structure in speaking.

Cogn Psychol. 2019 Nov;114:101228. doi: 10.1016/j.cogpsych.2019.101228. Epub 2019 Aug 14.

The grammaticality asymmetry in agreement attraction reflects response bias: Experimental and modeling evidence.

Cogn Psychol. 2019 May;110:70-104. doi: 10.1016/j.cogpsych.2019.01.001. Epub 2019 Feb 22.

Not All Phrases Are Equally Attractive: Experimental Evidence for Selective Agreement Attraction Effects.

Front Psychol. 2018 Aug 28;9:1566. doi: 10.3389/fpsyg.2018.01566. eCollection 2018.

The effect of word predictability on reading time is logarithmic.

Cognition. 2013 Sep;128(3):302-19. doi: 10.1016/j.cognition.2013.02.013. Epub 2013 Jun 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

作为句法约束处理认知模型的神经网络

Neural Networks as Cognitive Models of the Processing of Syntactic Constraints.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献