• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

形态句法的统计特征:对匈牙利语和意大利语婴儿导向语的研究。

The statistical signature of morphosyntax: a study of Hungarian and Italian infant-directed speech.

机构信息

CNRS, Paris, France.

出版信息

Cognition. 2012 Nov;125(2):263-87. doi: 10.1016/j.cognition.2012.06.010. Epub 2012 Aug 6.

DOI:10.1016/j.cognition.2012.06.010
PMID:22874070
Abstract

Does statistical learning (Saffran, Aslin, & Newport, 1996) offer a universal segmentation strategy for young language learners? Previous studies on large corpora of English and structurally similar languages have shown that statistical segmentation can be an effective strategy. However, many of the world's languages have richer morphological systems, with sometimes several affixes attached to a stem (e.g. Hungarian: iskoláinkban: iskolá-i-nk-ban school.pl.poss1pl.inessive 'in our schools'). In these languages, word boundaries and morpheme boundaries do not coincide. Does the internal structure of words affect segmentation? What word forms does segmentation yield in morphologically rich languages: complex word forms or separate stems and affixes? The present paper answers these questions by exploring different segmentation algorithms in infant-directed speech corpora from two typologically and structurally different languages, Hungarian and Italian. The results suggest that the morphological and syntactic type of a language has an impact on statistical segmentation, with different strategies working best in different languages. Specifically, the direction of segmentation seems to be sensitive to the affixation order of a language. Thus, backward probabilities are more effective in Hungarian, a heavily suffixing language, whereas forward probabilities are more informative in Italian, which has fewer suffixes and a large number of phrase-initial function words. The consequences of these findings for potential segmentation and word learning strategies are discussed.

摘要

统计学习(Saffran、Aslin 和 Newport,1996)是否为年轻的语言学习者提供了一种通用的分割策略?之前对大量英语和结构相似的语言的研究表明,统计分割可以是一种有效的策略。然而,世界上许多语言的形态系统更加丰富,有时一个词干上会有几个词缀(例如,匈牙利语:iskoláinkban:iskolá-i-nk-ban,意为“在我们的学校里”)。在这些语言中,词界和语素界并不重合。词的内部结构是否会影响分割?在形态丰富的语言中,分割会产生什么样的词形:复杂的词形还是独立的词干和词缀?本文通过探索来自两种类型学和结构不同的语言(匈牙利语和意大利语)的婴儿导向语音语料库中的不同分割算法,回答了这些问题。结果表明,语言的形态和句法类型对统计分割有影响,不同的策略在不同的语言中效果最佳。具体来说,分割的方向似乎对语言的词缀顺序敏感。因此,在后缀丰富的匈牙利语中,后向概率更有效,而在后缀较少且有大量短语起始功能词的意大利语中,前向概率更具信息量。这些发现对潜在的分割和单词学习策略的影响将在讨论中进行探讨。

相似文献

1
The statistical signature of morphosyntax: a study of Hungarian and Italian infant-directed speech.形态句法的统计特征:对匈牙利语和意大利语婴儿导向语的研究。
Cognition. 2012 Nov;125(2):263-87. doi: 10.1016/j.cognition.2012.06.010. Epub 2012 Aug 6.
2
Word segmentation with universal prosodic cues.基于通用韵律线索的分词。
Cogn Psychol. 2010 Sep;61(2):177-99. doi: 10.1016/j.cogpsych.2010.05.001. Epub 2010 Jun 22.
3
Does morphological complexity affect word segmentation? Evidence from computational modeling.形态复杂度是否影响分词?来自计算建模的证据。
Cognition. 2022 Mar;220:104960. doi: 10.1016/j.cognition.2021.104960. Epub 2021 Dec 14.
4
A Bayesian framework for word segmentation: exploring the effects of context.一种用于分词的贝叶斯框架:探索上下文的影响。
Cognition. 2009 Jul;112(1):21-54. doi: 10.1016/j.cognition.2009.03.008. Epub 2009 May 5.
5
Harmonic cues for speech segmentation: a cross-linguistic corpus study on child-directed speech.言语分段的和声线索:针对儿童言语的跨语言语料库研究。
J Child Lang. 2014 Mar;41(2):439-61. doi: 10.1017/S0305000912000724. Epub 2013 Feb 21.
6
Infant word segmentation revisited: edge alignment facilitates target extraction.再探婴儿单词分割:边缘对齐有助于目标提取。
Dev Sci. 2006 Nov;9(6):565-73. doi: 10.1111/j.1467-7687.2006.00534.x.
7
Words and syllables in fluent speech segmentation by French-learning infants: an ERP study.法语学习婴儿流畅言语分段中的词和音节:一项 ERP 研究。
Brain Res. 2010 May 21;1332:75-89. doi: 10.1016/j.brainres.2010.03.047. Epub 2010 Mar 21.
8
Co-occurrence statistics as a language-dependent cue for speech segmentation.作为语音分割的语言相关线索的共现统计
Dev Sci. 2017 May;20(3). doi: 10.1111/desc.12390. Epub 2016 May 4.
9
Relationships between language structure and language learning: the suffixing preference and grammatical categorization.语言结构与语言学习的关系:后缀偏好与语法分类。
Cogn Sci. 2009 Sep;33(7):1317-29. doi: 10.1111/j.1551-6709.2009.01065.x.
10
British English infants segment words only with exaggerated infant-directed speech stimuli.英国英语环境中的婴儿仅在面对夸张的儿向言语刺激时才会对单词进行切分。
Cognition. 2016 Mar;148:1-9. doi: 10.1016/j.cognition.2015.12.004. Epub 2015 Dec 18.

引用本文的文献

1
Consequences of phonological variation for algorithmic word segmentation.语音变异对算法分词的影响。
Cognition. 2023 Jun;235:105401. doi: 10.1016/j.cognition.2023.105401. Epub 2023 Feb 12.
2
When forgetting fosters learning: A neural network model for statistical learning.遗忘促进学习:统计学习的神经网络模型。
Cognition. 2021 Aug;213:104621. doi: 10.1016/j.cognition.2021.104621. Epub 2021 Feb 17.
3
Evolutionarily conserved neural signatures involved in sequencing predictions and their relevance for language.参与序列预测的进化保守神经特征及其与语言的相关性。
Curr Opin Behav Sci. 2018 Jun;21:145-153. doi: 10.1016/j.cobeha.2018.05.002.