从面向婴儿的语音中进行元音类别的无监督学习。

Unsupervised learning of vowel categories from infant-directed speech.

作者信息

Vallabha Gautam K, McClelland James L, Pons Ferran, Werker Janet F, Amano Shigeaki

机构信息

Department of Psychology, Stanford University, Jordan Hall Building 420, Stanford, CA 94305, USA.

出版信息

Proc Natl Acad Sci U S A. 2007 Aug 14;104(33):13273-8. doi: 10.1073/pnas.0705369104. Epub 2007 Jul 30.

DOI:10.1073/pnas.0705369104

PMID:17664424

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1934922/

Abstract

Infants rapidly learn the sound categories of their native language, even though they do not receive explicit or focused training. Recent research suggests that this learning is due to infants' sensitivity to the distribution of speech sounds and that infant-directed speech contains the distributional information needed to form native-language vowel categories. An algorithm, based on Expectation-Maximization, is presented here for learning the categories from a sequence of vowel tokens without (i) receiving any category information with each vowel token, (ii) knowing in advance the number of categories to learn, or (iii) having access to the entire data ensemble. When exposed to vowel tokens drawn from either English or Japanese infant-directed speech, the algorithm successfully discovered the language-specific vowel categories (/I, i, epsilon, e/ for English, /I, i, e, e/ for Japanese). A nonparametric version of the algorithm, closely related to neural network models based on topographic representation and competitive Hebbian learning, also was able to discover the vowel categories, albeit somewhat less reliably. These results reinforce the proposal that native-language speech categories are acquired through distributional learning and that such learning may be instantiated in a biologically plausible manner.

摘要

婴儿能迅速学习其母语的语音类别，即便他们并未接受明确或有针对性的训练。近期研究表明，这种学习归因于婴儿对语音分布的敏感性，且面向婴儿的言语包含形成母语元音类别的分布信息。本文提出一种基于期望最大化的算法，用于从元音样本序列中学习类别，而无需（i）每个元音样本都附带任何类别信息，（ii）预先知道要学习的类别数量，或（iii）获取整个数据集。当该算法接触从英语或日语面向婴儿的言语中提取的元音样本时，它成功发现了特定语言的元音类别（英语为 /I, i, epsilon, e/，日语为 /I, i, e, e/）。该算法的非参数版本与基于拓扑表示和竞争性赫布学习的神经网络模型密切相关，也能够发现元音类别，尽管可靠性稍低。这些结果强化了以下观点：母语语音类别是通过分布学习获得的，且这种学习可能以生物学上合理的方式实现。

相似文献

Unsupervised learning of vowel categories from infant-directed speech.

Proc Natl Acad Sci U S A. 2007 Aug 14;104(33):13273-8. doi: 10.1073/pnas.0705369104. Epub 2007 Jul 30.

Prosodic exaggeration within infant-directed speech: Consequences for vowel learnability.

J Acoust Soc Am. 2017 May;141(5):3070. doi: 10.1121/1.4982246.

Semantics guide infants' vowel learning: Computational and experimental evidence.

Infant Behav Dev. 2016 May;43:44-57. doi: 10.1016/j.infbeh.2016.01.002. Epub 2016 Apr 28.

Lexical Learning May Contribute to Phonetic Learning in Infants: A Corpus Analysis of Maternal Spanish.

Cogn Sci. 2018 May 21. doi: 10.1111/cogs.12620.

Indexical and linguistic processing by 12-month-olds: Discrimination of speaker, accent and vowel differences.

PLoS One. 2017 May 17;12(5):e0176762. doi: 10.1371/journal.pone.0176762. eCollection 2017.

A cross-language comparison of vowel perception in English-learning and German-learning infants.

J Acoust Soc Am. 1996 Jul;100(1):577-92. doi: 10.1121/1.415884.

Discriminating Non-native Vowels on the Basis of Multimodal, Auditory or Visual Information: Effects on Infants' Looking Patterns and Discrimination.

Front Psychol. 2016 Apr 19;7:525. doi: 10.3389/fpsyg.2016.00525. eCollection 2016.

Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input.

Proc Natl Acad Sci U S A. 2021 Feb 9;118(7). doi: 10.1073/pnas.2001844118.

Speech perception in early infancy: perceptual constancy for spectrally dissimilar vowel categories.

J Acoust Soc Am. 1979 Dec;66(6):1668-79. doi: 10.1121/1.383639.

Learning English vowels with different first-language vowel systems II: Auditory training for native Spanish and German speakers.

J Acoust Soc Am. 2009 Aug;126(2):866-77. doi: 10.1121/1.3148196.

引用本文的文献

Prosodic Cues Support Inferences About the Question's Pedagogical Intent.

Open Mind (Camb). 2025 Feb 16;9:340-363. doi: 10.1162/opmi_a_00192. eCollection 2025.

The acquisition of speech categories: Beyond perceptual narrowing, beyond unsupervised learning and beyond infancy.

Lang Cogn Neurosci. 2023;38(4):419-445. doi: 10.1080/23273798.2022.2105367. Epub 2022 Aug 8.

The nature of non-native speech sound representations.

J Acoust Soc Am. 2022 Nov;152(5):3025. doi: 10.1121/10.0015230.

Motor constellation theory: A model of infants' phonological development.

Front Psychol. 2022 Nov 3;13:996894. doi: 10.3389/fpsyg.2022.996894. eCollection 2022.

Naturalistic speech supports distributional learning across contexts.

Proc Natl Acad Sci U S A. 2022 Sep 20;119(38):e2123230119. doi: 10.1073/pnas.2123230119. Epub 2022 Sep 12.

Non-sensory Influences on Auditory Learning and Plasticity.

J Assoc Res Otolaryngol. 2022 Apr;23(2):151-166. doi: 10.1007/s10162-022-00837-3. Epub 2022 Mar 2.

Do Infants Really Learn Phonetic Categories?

Open Mind (Camb). 2021 Nov 1;5:113-131. doi: 10.1162/opmi_a_00046. eCollection 2021.

Vocal communication across cultures: theoretical and methodological issues.

Philos Trans R Soc Lond B Biol Sci. 2022 Jan 3;377(1841):20200387. doi: 10.1098/rstb.2020.0387. Epub 2021 Nov 15.

Learning nonnative speech sounds changes local encoding in the adult human cortex.

Proc Natl Acad Sci U S A. 2021 Sep 7;118(36). doi: 10.1073/pnas.2101777118.

Emerging native-similar neural representations underlie non-native speech category learning success.

Neurobiol Lang (Camb). 2021;2(2):280-307. doi: 10.1162/nol_a_00035. Epub 2021 Jun 9.

本文引用的文献

Statistical learning of phonetic categories: insights from a computational approach.

Dev Sci. 2009 Apr;12(3):369-78. doi: 10.1111/j.1467-7687.2009.00822.x.

Statistical phonetic learning in infants: facilitation and feature generalization.

Dev Sci. 2008 Jan;11(1):122-34. doi: 10.1111/j.1467-7687.2007.00653.x.

Success and failure of new speech category learning in adulthood: consequences of learned Hebbian attractors in topographic maps.

Cogn Affect Behav Neurosci. 2007 Mar;7(1):53-73. doi: 10.3758/cabn.7.1.53.

Infant-directed speech supports phonetic category learning in English and Japanese.

Cognition. 2007 Apr;103(1):147-62. doi: 10.1016/j.cognition.2006.03.006. Epub 2006 May 16.

Learning phonetic categories by tracking movements.

Cognition. 2007 Apr;103(1):80-106. doi: 10.1016/j.cognition.2006.03.002. Epub 2006 May 2.

Infants show a facilitation effect for native language phonetic perception between 6 and 12 months.

Dev Sci. 2006 Mar;9(2):F13-F21. doi: 10.1111/j.1467-7687.2006.00468.x.

The dynamics of perceptual learning: an incremental reweighting model.

Psychol Rev. 2005 Oct;112(4):715-43. doi: 10.1037/0033-295X.112.4.715.

Perceptuomotor bias in the imitation of steady-state vowels.

J Acoust Soc Am. 2004 Aug;116(2):1184-97. doi: 10.1121/1.1764832.

Representation of sound categories in auditory cortical maps.

J Speech Lang Hear Res. 2004 Feb;47(1):46-57. doi: 10.1044/1092-4388(2004/005).

A new model of sensorimotor coupling in the development of speech.

Brain Lang. 2004 May;89(2):393-400. doi: 10.1016/S0093-934X(03)00345-6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从面向婴儿的语音中进行元音类别的无监督学习。

Unsupervised learning of vowel categories from infant-directed speech.

作者信息

Vallabha Gautam K, McClelland James L, Pons Ferran, Werker Janet F, Amano Shigeaki

机构信息

Department of Psychology, Stanford University, Jordan Hall Building 420, Stanford, CA 94305, USA.

出版信息

Proc Natl Acad Sci U S A. 2007 Aug 14;104(33):13273-8. doi: 10.1073/pnas.0705369104. Epub 2007 Jul 30.

DOI:10.1073/pnas.0705369104

PMID:17664424

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1934922/

Abstract

摘要

从面向婴儿的语音中进行元音类别的无监督学习。

Unsupervised learning of vowel categories from infant-directed speech.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

从面向婴儿的语音中进行元音类别的无监督学习。

Unsupervised learning of vowel categories from infant-directed speech.

作者信息

机构信息

出版信息