• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯程序归纳综合人类语言理论。

Synthesizing theories of human language with Bayesian program induction.

机构信息

Department of Computer Science, Cornell University, Ithaca, NY, 14850, USA.

Department of Linguistics, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.

出版信息

Nat Commun. 2022 Aug 30;13(1):5024. doi: 10.1038/s41467-022-32012-w.

DOI:10.1038/s41467-022-32012-w
PMID:36042196
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9427767/
Abstract

Automated, data-driven construction and evaluation of scientific models and theories is a long-standing challenge in artificial intelligence. We present a framework for algorithmically synthesizing models of a basic part of human language: morpho-phonology, the system that builds word forms from sounds. We integrate Bayesian inference with program synthesis and representations inspired by linguistic theory and cognitive models of learning and discovery. Across 70 datasets from 58 diverse languages, our system synthesizes human-interpretable models for core aspects of each language's morpho-phonology, sometimes approaching models posited by human linguists. Joint inference across all 70 data sets automatically synthesizes a meta-model encoding interpretable cross-language typological tendencies. Finally, the same algorithm captures few-shot learning dynamics, acquiring new morphophonological rules from just one or a few examples. These results suggest routes to more powerful machine-enabled discovery of interpretable models in linguistics and other scientific domains.

摘要

自动化、数据驱动的科学模型和理论构建与评估是人工智能领域的一个长期挑战。我们提出了一个框架,用于从算法上合成人类语言基本部分的模型:形态音系学,即从声音构建单词形式的系统。我们将贝叶斯推理与程序综合以及受语言学理论和学习与发现的认知模型启发的表示相结合。在来自 58 种不同语言的 70 个数据集上,我们的系统为每种语言的形态音系学的核心方面综合了人类可解释的模型,有时接近人类语言学家提出的模型。对所有 70 个数据集的联合推断自动综合了一个元模型,该模型编码了可解释的跨语言类型学倾向。最后,同一个算法捕获了少样本学习动态,仅从一个或几个示例中获取新的形态音系规则。这些结果表明,在语言学和其他科学领域,通过更强大的机器能够发现可解释的模型的途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/e60607e4a09c/41467_2022_32012_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/36b03e4a0eba/41467_2022_32012_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/92c27a666353/41467_2022_32012_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/b1db0d397941/41467_2022_32012_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/0c84719f8609/41467_2022_32012_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/d4bf4014d74b/41467_2022_32012_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/8c988eabc3a1/41467_2022_32012_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/3573ede327d5/41467_2022_32012_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/e60607e4a09c/41467_2022_32012_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/36b03e4a0eba/41467_2022_32012_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/92c27a666353/41467_2022_32012_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/b1db0d397941/41467_2022_32012_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/0c84719f8609/41467_2022_32012_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/d4bf4014d74b/41467_2022_32012_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/8c988eabc3a1/41467_2022_32012_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/3573ede327d5/41467_2022_32012_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8d6/9427767/e60607e4a09c/41467_2022_32012_Fig8_HTML.jpg

相似文献

1
Synthesizing theories of human language with Bayesian program induction.贝叶斯程序归纳综合人类语言理论。
Nat Commun. 2022 Aug 30;13(1):5024. doi: 10.1038/s41467-022-32012-w.
2
Cognitive biases, linguistic universals, and constraint-based grammar learning.认知偏差、语言共性与基于约束的语法学习。
Top Cogn Sci. 2013 Jul;5(3):392-424. doi: 10.1111/tops.12027. Epub 2013 May 23.
3
A Bayesian model of biases in artificial language learning: the case of a word-order universal.人工语言学习中的偏差的贝叶斯模型:词序普遍性的案例。
Cogn Sci. 2012 Nov-Dec;36(8):1468-98. doi: 10.1111/j.1551-6709.2012.01264.x. Epub 2012 Sep 10.
4
Cumulative cultural evolution, population structure and the origin of combinatoriality in human language.累积文化进化、人口结构与人类语言组合性的起源。
Philos Trans R Soc Lond B Biol Sci. 2022 Jan 31;377(1843):20200319. doi: 10.1098/rstb.2020.0319. Epub 2021 Dec 13.
5
Quantitative standards for absolute linguistic universals.绝对语言共性的量化标准。
Cogn Sci. 2014 May-Jun;38(4):736-56. doi: 10.1111/cogs.12088. Epub 2013 Oct 3.
6
Learning biases predict a word order universal.学习偏见预测了一种语序普遍性。
Cognition. 2012 Mar;122(3):306-29. doi: 10.1016/j.cognition.2011.10.017. Epub 2011 Dec 28.
7
The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning.情感探针:通过机器学习实现跨语言和跨性别言语情感识别的普遍性研究。
Sensors (Basel). 2022 Mar 23;22(7):2461. doi: 10.3390/s22072461.
8
Linking norms, ratings, and relations of words and concepts across multiple language varieties.跨多种语言变体连接词和概念的规范、评级和关系。
Behav Res Methods. 2022 Apr;54(2):864-884. doi: 10.3758/s13428-021-01650-1. Epub 2021 Aug 6.
9
Large Language Models: A Historical and Sociocultural Perspective.大型语言模型:历史与社会文化视角。
Cogn Sci. 2024 Mar;48(3):e13430. doi: 10.1111/cogs.13430.
10
Analyzing the rate at which languages lose the influence of a common ancestor.分析语言失去共同祖先影响力的速度。
Cogn Sci. 2014 Sep-Oct;38(7):1406-31. doi: 10.1111/cogs.12112. Epub 2014 Mar 19.

引用本文的文献

1
Symbolic metaprogram search improves learning efficiency and explains rule learning in humans.符号元编程搜索提高了学习效率,并解释了人类的规则学习。
Nat Commun. 2024 Aug 10;15(1):6847. doi: 10.1038/s41467-024-50966-x.
2
Systematic testing of three Language Models reveals low language accuracy, absence of response stability, and a yes-response bias.系统测试三种语言模型发现,它们语言准确性低,缺乏响应稳定性,并且存在肯定回答偏见。
Proc Natl Acad Sci U S A. 2023 Dec 19;120(51):e2309583120. doi: 10.1073/pnas.2309583120. Epub 2023 Dec 13.
3
Emotion prediction as computation over a generative theory of mind.

本文引用的文献

1
Cognitive science in the era of artificial intelligence: A roadmap for reverse-engineering the infant language-learner.人工智能时代的认知科学:逆向工程婴儿语言学习者的路线图。
Cognition. 2018 Apr;173:43-59. doi: 10.1016/j.cognition.2017.11.008. Epub 2018 Jan 8.
2
Human-level concept learning through probabilistic program induction.通过概率编程归纳实现人类水平的概念学习。
Science. 2015 Dec 11;350(6266):1332-8. doi: 10.1126/science.aab3050.
3
Statistical learning: From acquiring specific items to forming general rules.统计学习:从获取特定项目到形成一般规则。
情绪预测作为一种生成心智理论的计算。
Philos Trans A Math Phys Eng Sci. 2023 Jul 24;381(2251):20220047. doi: 10.1098/rsta.2022.0047. Epub 2023 Jun 5.
Curr Dir Psychol Sci. 2012 Jun 1;21(3):170-176. doi: 10.1177/0963721412436806.
4
Learning and long-term retention of large-scale artificial languages.大规模人工语言的学习和长期保持。
PLoS One. 2013;8(1):e52500. doi: 10.1371/journal.pone.0052500. Epub 2013 Jan 2.
5
The origins of inquiry: inductive inference and exploration in early childhood.探究的起源:儿童早期的归纳推理和探索。
Trends Cogn Sci. 2012 Jul;16(7):382-9. doi: 10.1016/j.tics.2012.06.004. Epub 2012 Jun 19.
6
Kinship categories across languages reflect general communicative principles.语言中的亲属类别反映了一般的交际原则。
Science. 2012 May 25;336(6084):1049-54. doi: 10.1126/science.1218811.
7
How to grow a mind: statistics, structure, and abstraction.如何培养思维:统计、结构与抽象。
Science. 2011 Mar 11;331(6022):1279-85. doi: 10.1126/science.1192788.
8
The learnability of abstract syntactic principles.抽象句法原则的可学性。
Cognition. 2011 Mar;118(3):306-38. doi: 10.1016/j.cognition.2010.11.001. Epub 2010 Dec 24.
9
Three ideal observer models for rule learning in simple languages.三种用于简单语言规则学习的理想观察者模型。
Cognition. 2011 Sep;120(3):360-71. doi: 10.1016/j.cognition.2010.10.005. Epub 2010 Dec 4.
10
Infants use rational decision criteria for choosing among models of their input.婴儿在选择输入的模型时会使用理性决策标准。
Cognition. 2010 May;115(2):362-6. doi: 10.1016/j.cognition.2010.01.006. Epub 2010 Feb 9.