基于160万个美式英语单词的口语词汇频率统计。

Spoken word frequency counts based on 1.6 million words in American English.

作者信息

Pastizzo Matrhew J, Carbone Robert F

机构信息

Psychology Department, State University of New York, Geneseo 14454, USA.

出版信息

Behav Res Methods. 2007 Nov;39(4):1025-8. doi: 10.3758/bf03193000.

DOI:10.3758/bf03193000

PMID:18183922

Abstract

Written word frequency (e.g., Francis & Ku6era, 1982; Kucera & Francis, 1967) constitutes apopular measure of word familiarity, which is highly predictive of word recognition. Far less often, researchers employ spoken frequency counts in their studies. This discrepancy can be attributed most readily to the conspicuous absence of a sizeable spoken frequency count for American English. The present article reports the construction of a 1.6-million-word spoken frequency database derived from the Michigan Corpus of Academic Spoken English (Simpson, Swales, & Briggs, 2002). We generated spoken frequency counts for 34,922 words and extracted speaker attributes from the source material to generate relative frequencies of words spoken by each speaker category. We assessthe predictive validity of these counts, and discuss some possible applications outside of word recognition studies.

摘要

书面词频（例如，弗朗西斯和库泽拉，1982年；库泽拉和弗朗西斯，1967年）是衡量单词熟悉度的常用指标，它对单词识别具有高度预测性。研究人员在其研究中使用口语词频计数的情况则要少得多。这种差异最容易归因于美国英语缺乏大量的口语词频计数。本文报告了一个基于密歇根学术英语口语语料库（辛普森、斯韦尔斯和布里格斯，2002年）构建的160万词口语词频数据库。我们生成了34922个单词的口语词频计数，并从源材料中提取了说话者属性，以生成每个说话者类别所说单词的相对频率。我们评估了这些计数的预测效度，并讨论了单词识别研究之外的一些可能应用。

相似文献

Spoken word frequency counts based on 1.6 million words in American English.

Behav Res Methods. 2007 Nov;39(4):1025-8. doi: 10.3758/bf03193000.

Moving beyond Kucera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English.

Behav Res Methods. 2009 Nov;41(4):977-90. doi: 10.3758/BRM.41.4.977.

A preliminary study of subjective frequency estimates of words spoken in Cantonese.

Psychol Rep. 2001 Jun;88(3 Pt 2):1253-8. doi: 10.2466/pr0.2001.88.3c.1253.

Do the effects of subjective frequency and age of acquisition survive better word frequency norms?

Q J Exp Psychol (Hove). 2011 Mar;64(3):545-59. doi: 10.1080/17470218.2010.503374. Epub 2010 Aug 9.

A database of 629 English compound words: ratings of familiarity, lexeme meaning dominance, semantic transparency, age of acquisition, imageability, and sensory experience.

Behav Res Methods. 2015 Dec;47(4):1004-1019. doi: 10.3758/s13428-014-0523-6.

Time course of Chinese monosyllabic spoken word recognition: evidence from ERP analyses.

Neuropsychologia. 2011 Jun;49(7):1761-70. doi: 10.1016/j.neuropsychologia.2011.02.054. Epub 2011 Mar 4.

Children's early reading vocabulary: description and word frequency lists.

Br J Educ Psychol. 2003 Dec;73(Pt 4):585-98. doi: 10.1348/000709903322591253.

The word frequency effect: a review of recent developments and implications for the choice of frequency estimates in German.

Exp Psychol. 2011;58(5):412-24. doi: 10.1027/1618-3169/a000123.

Early L2 Spoken Word Recognition Combines Input-Based and Knowledge-Based Processing.

Lang Speech. 2018 Dec;61(4):632-656. doi: 10.1177/0023830918761762. Epub 2018 Mar 21.

Type-based bigram frequencies for five-letter words.

Behav Res Methods Instrum Comput. 2004 Aug;36(3):397-401. doi: 10.3758/bf03195587.

引用本文的文献

Effect of Working Memory Load and Typicality on Semantic Processing in Aphasia.

Am J Speech Lang Pathol. 2022 Jan 18;31(1):12-29. doi: 10.1044/2021_AJSLP-20-00283. Epub 2021 Jun 17.

Maintenance Versus Transmission Deficits: The Effect of Delay on Naming Performance in Aphasia.

Front Hum Neurosci. 2019 Nov 27;13:406. doi: 10.3389/fnhum.2019.00406. eCollection 2019.

Evaluating the Contribution of Executive Functions to Language Tasks in Cognitively Demanding Contexts.

Am J Speech Lang Pathol. 2020 Feb 21;29(1S):463-473. doi: 10.1044/2019_AJSLP-CAC48-18-0216. Epub 2019 Sep 13.

Assessment of linguistic and verbal short-term memory components of language abilities in aphasia.

J Neurolinguistics. 2018 Nov;48:199-225. doi: 10.1016/j.jneuroling.2018.02.006.

Effects of syntactic and semantic argument structure on sentence repetition in agrammatism: Things we can learn from particles and prepositions.

Aphasiology. 2011;25(6-7):736-747. doi: 10.1080/02687038.2010.537348. Epub 2011 Jan 10.

Subjective frequency ratings for 432 ASL signs.

Behav Res Methods. 2014 Jun;46(2):526-39. doi: 10.3758/s13428-013-0370-x.

Remediation of language processing in aphasia: Improving activation and maintenance of linguistic representations in (verbal) short-term memory.

Aphasiology. 2011 Jan 1;25(10):1095-1131. doi: 10.1080/02687038.2011.577284. Epub 2011 Aug 1.

Word intelligibility and age predict visual cortex activity during word listening.

Cereb Cortex. 2012 Jun;22(6):1360-71. doi: 10.1093/cercor/bhr211. Epub 2011 Aug 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于160万个美式英语单词的口语词汇频率统计。

Spoken word frequency counts based on 1.6 million words in American English.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献