现在你听到我，之后你听不到：语言计算的即时性和言语的表示。

Now You Hear Me, Later You Don't: The Immediacy of Linguistic Computation and the Representation of Speech.

机构信息

Department of Linguistics, University of Pennsylvania.

Department of Cognitive Science, Johns Hopkins University.

出版信息

Psychol Sci. 2021 Mar;32(3):410-423. doi: 10.1177/0956797620968787. Epub 2021 Feb 22.

DOI:10.1177/0956797620968787

PMID:33617735

Abstract

What happens to an acoustic signal after it enters the mind of a listener? Previous work has demonstrated that listeners maintain intermediate representations over time. However, the internal structure of such representations-be they the acoustic-phonetic signal or more general information about the probability of possible categories-remains underspecified. We present two experiments using a novel speaker-adaptation paradigm aimed at uncovering the format of speech representations. We exposed adult listeners ( = 297) to a speaker whose utterances contained acoustically ambiguous information concerning phones (and thus words), and we manipulated the temporal availability of disambiguating cues via visually presented text (presented before or after each utterance). Results from a traditional phoneme-categorization task showed that listeners adapted to a modified acoustic distribution when disambiguating text was provided before but not after the audio. These results support the position that speech representations consist of activation over categories and are inconsistent with direct maintenance of the acoustic-phonetic signal.

摘要

当一个声学信号进入听众的大脑后会发生什么？先前的研究已经证明，听众会随着时间的推移保持中间表示。然而，这种表示的内部结构——无论是声学语音信号还是关于可能类别概率的更一般信息——仍然没有具体说明。我们提出了两个使用新的说话人适应范式的实验，旨在揭示语音表示的格式。我们让成年听众（n=297）接触一个说话人，其话语包含有关电话（也就是单词）的声学歧义信息，并且我们通过视觉呈现的文本（在每个话语之前或之后呈现）来操纵可消除歧义线索的时间可用性。来自传统的音素分类任务的结果表明，当提供消除歧义的文本时，听众会适应修改后的声学分布，但在音频之后则不会。这些结果支持了这样一种观点，即语音表示由类别上的激活组成，并且与对声学语音信号的直接维持不一致。

相似文献

Now You Hear Me, Later You Don't: The Immediacy of Linguistic Computation and the Representation of Speech.

Psychol Sci. 2021 Mar;32(3):410-423. doi: 10.1177/0956797620968787. Epub 2021 Feb 22.

Distributional learning for speech reflects cumulative exposure to a talker's phonetic distributions.

Psychon Bull Rev. 2019 Jun;26(3):985-992. doi: 10.3758/s13423-018-1551-5.

When speaker identity is unavoidable: Neural processing of speaker identity cues in natural speech.

Brain Lang. 2017 Nov;174:42-49. doi: 10.1016/j.bandl.2017.07.001. Epub 2017 Jul 15.

Discovering phonetic coherence in acoustic patterns.

Percept Psychophys. 1989 Mar;45(3):237-50. doi: 10.3758/bf03210703.

Assessment of Spectral and Temporal Resolution in Cochlear Implant Users Using Psychoacoustic Discrimination and Speech Cue Categorization.

Ear Hear. 2016 Nov/Dec;37(6):e377-e390. doi: 10.1097/AUD.0000000000000328.

Interactions between distal speech rate, linguistic knowledge, and speech environment.

Psychon Bull Rev. 2015 Oct;22(5):1451-7. doi: 10.3758/s13423-015-0820-9.

Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech.

Curr Biol. 2018 Dec 17;28(24):3976-3983.e5. doi: 10.1016/j.cub.2018.10.042. Epub 2018 Nov 29.

Phonetic category activation predicts the direction and magnitude of perceptual adaptation to accented speech.

J Exp Psychol Hum Percept Perform. 2022 Sep;48(9):913-925. doi: 10.1037/xhp0001037. Epub 2022 Jul 18.

What Are You Waiting For? Real-Time Integration of Cues for Fricatives Suggests Encapsulated Auditory Memory.

Cogn Sci. 2019 Jan;43(1). doi: 10.1111/cogs.12700.

Cognitive load does not increase reliance on speaker information in phonetic categorization.

JASA Express Lett. 2022 May;2(5):055203. doi: 10.1121/10.0009895.

引用本文的文献

Word learning as category formation.

PLoS One. 2025 Jul 3;20(7):e0327615. doi: 10.1371/journal.pone.0327615. eCollection 2025.

Maintenance of subcategorical information during speech perception: revisiting misunderstood limitations.

J Mem Lang. 2025 Feb;140. doi: 10.1016/j.jml.2024.104565. Epub 2024 Sep 20.

The acquisition of speech categories: Beyond perceptual narrowing, beyond unsupervised learning and beyond infancy.

Lang Cogn Neurosci. 2023;38(4):419-445. doi: 10.1080/23273798.2022.2105367. Epub 2022 Aug 8.

Probabilistic social learning improves the public's judgments of news veracity.

PLoS One. 2021 Mar 9;16(3):e0247487. doi: 10.1371/journal.pone.0247487. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

现在你听到我，之后你听不到：语言计算的即时性和言语的表示。

Now You Hear Me, Later You Don't: The Immediacy of Linguistic Computation and the Representation of Speech.

机构信息

Department of Linguistics, University of Pennsylvania.

Department of Cognitive Science, Johns Hopkins University.

出版信息

Psychol Sci. 2021 Mar;32(3):410-423. doi: 10.1177/0956797620968787. Epub 2021 Feb 22.

DOI:10.1177/0956797620968787

PMID:33617735

Abstract

摘要

现在你听到我，之后你听不到：语言计算的即时性和言语的表示。

Now You Hear Me, Later You Don't: The Immediacy of Linguistic Computation and the Representation of Speech.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

现在你听到我，之后你听不到：语言计算的即时性和言语的表示。

Now You Hear Me, Later You Don't: The Immediacy of Linguistic Computation and the Representation of Speech.

机构信息

出版信息

相似文献

引用本文的文献