通过将贝叶斯先验知识提炼到人工神经网络中来模拟快速语言学习。

Modeling rapid language learning by distilling Bayesian priors into artificial neural networks.

作者信息

McCoy R Thomas, Griffiths Thomas L

机构信息

Department of Linguistics, Yale University, 370 Temple St, New Haven, CT, 06511, USA.

Wu Tsai Institute, Yale University, 100 College St, New Haven, CT, 06510, USA.

出版信息

Nat Commun. 2025 May 20;16(1):4676. doi: 10.1038/s41467-025-59957-y.

DOI:10.1038/s41467-025-59957-y

PMID:40393968

Abstract

Humans can learn languages from remarkably little experience. Developing computational models that explain this ability has been a major challenge in cognitive science. Existing approaches have been successful at explaining how humans generalize rapidly in controlled settings but are usually too restrictive to tractably handle naturalistic data. We show that learning from limited naturalistic data is possible with an approach that bridges the divide between two popular modeling traditions: Bayesian models and neural networks. This approach distills a Bayesian model's inductive biases-the factors that guide generalization-into a neural network that has flexible representations. Like a Bayesian model, the resulting system can learn formal linguistic patterns from limited data. Like a neural network, it can also learn aspects of English syntax from naturally-occurring sentences. Thus, this model provides a single system that can learn rapidly and can handle naturalistic data.

摘要

人类仅需极少的经验就能学习语言。开发能够解释这种能力的计算模型一直是认知科学中的一项重大挑战。现有方法在解释人类如何在受控环境中快速泛化方面取得了成功，但通常限制过多，难以处理自然主义数据。我们表明，通过一种弥合两种流行建模传统（贝叶斯模型和神经网络）之间差距的方法，可以从有限的自然主义数据中进行学习。这种方法将贝叶斯模型的归纳偏差（即指导泛化的因素）提炼成具有灵活表示的神经网络。与贝叶斯模型一样，由此产生的系统可以从有限的数据中学习形式语言模式。与神经网络一样，它还可以从自然出现的句子中学习英语句法的各个方面。因此，该模型提供了一个能够快速学习并处理自然主义数据的单一系统。

相似文献

Modeling rapid language learning by distilling Bayesian priors into artificial neural networks.

Nat Commun. 2025 May 20;16(1):4676. doi: 10.1038/s41467-025-59957-y.

Cognitive biases, linguistic universals, and constraint-based grammar learning.

Top Cogn Sci. 2013 Jul;5(3):392-424. doi: 10.1111/tops.12027. Epub 2013 May 23.

Inductive biases of neural network modularity in spatial navigation.

Sci Adv. 2024 Jul 19;10(29):eadk1256. doi: 10.1126/sciadv.adk1256.

Meta-learning as a bridge between neural networks and symbolic Bayesian models.

Behav Brain Sci. 2024 Sep 23;47:e155. doi: 10.1017/S0140525X24000116.

Training deep neural density estimators to identify mechanistic models of neural dynamics.

Elife. 2020 Sep 17;9:e56261. doi: 10.7554/eLife.56261.

Likelihood approximation networks (LANs) for fast inference of simulation models in cognitive neuroscience.

Elife. 2021 Apr 6;10:e65074. doi: 10.7554/eLife.65074.

Linguistic generalization and compositionality in modern artificial neural networks.

Philos Trans R Soc Lond B Biol Sci. 2020 Feb 3;375(1791):20190307. doi: 10.1098/rstb.2019.0307. Epub 2019 Dec 16.

Deep neural networks and humans both benefit from compositional language structure.

Nat Commun. 2024 Dec 30;15(1):10816. doi: 10.1038/s41467-024-55158-1.

Emulator-Based Bayesian Calibration of the CISNET Colorectal Cancer Models.

Med Decis Making. 2024 Jul;44(5):543-553. doi: 10.1177/0272989X241255618. Epub 2024 Jun 10.

A Bayesian model of biases in artificial language learning: the case of a word-order universal.

Cogn Sci. 2012 Nov-Dec;36(8):1468-98. doi: 10.1111/j.1551-6709.2012.01264.x. Epub 2012 Sep 10.

本文引用的文献

Dissociating language and thought in large language models.

Trends Cogn Sci. 2024 Jun;28(6):517-540. doi: 10.1016/j.tics.2024.01.011. Epub 2024 Mar 19.

Compositional diversity in visual concept learning.

Cognition. 2024 Mar;244:105711. doi: 10.1016/j.cognition.2023.105711. Epub 2024 Jan 14.

Meta-learned models of cognition.

Behav Brain Sci. 2023 Nov 23;47:e147. doi: 10.1017/S0140525X23003266.

Human-like systematic generalization through a meta-learning neural network.

Nature. 2023 Nov;623(7985):115-121. doi: 10.1038/s41586-023-06668-3. Epub 2023 Oct 25.

Symbols and mental programs: a hypothesis about human singularity.

Trends Cogn Sci. 2022 Sep;26(9):751-766. doi: 10.1016/j.tics.2022.06.010. Epub 2022 Aug 3.

One model for the learning of language.

Proc Natl Acad Sci U S A. 2022 Feb 1;119(5). doi: 10.1073/pnas.2021865119.

The neural architecture of language: Integrative modeling converges on predictive processing.

Proc Natl Acad Sci U S A. 2021 Nov 9;118(45). doi: 10.1073/pnas.2105646118.

I know what you're probably going to say: Listener adaptation to variable use of uncertainty expressions.

Cognition. 2020 Oct;203:104285. doi: 10.1016/j.cognition.2020.104285. Epub 2020 Jun 11.

Brain computation by assemblies of neurons.

Proc Natl Acad Sci U S A. 2020 Jun 23;117(25):14464-14472. doi: 10.1073/pnas.2001893117. Epub 2020 Jun 9.

Distributional learning of subcategories in an artificial grammar: Category generalization and subcategory restrictions.

J Mem Lang. 2017 Dec;97:17-29. doi: 10.1016/j.jml.2017.07.006. Epub 2017 Jul 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过将贝叶斯先验知识提炼到人工神经网络中来模拟快速语言学习。

Modeling rapid language learning by distilling Bayesian priors into artificial neural networks.

作者信息

McCoy R Thomas, Griffiths Thomas L

机构信息

Department of Linguistics, Yale University, 370 Temple St, New Haven, CT, 06511, USA.

Wu Tsai Institute, Yale University, 100 College St, New Haven, CT, 06510, USA.

出版信息

Nat Commun. 2025 May 20;16(1):4676. doi: 10.1038/s41467-025-59957-y.

DOI:10.1038/s41467-025-59957-y

PMID:40393968

Abstract

摘要

通过将贝叶斯先验知识提炼到人工神经网络中来模拟快速语言学习。

Modeling rapid language learning by distilling Bayesian priors into artificial neural networks.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

通过将贝叶斯先验知识提炼到人工神经网络中来模拟快速语言学习。

Modeling rapid language learning by distilling Bayesian priors into artificial neural networks.

作者信息

机构信息

出版信息

相似文献

本文引用的文献