• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SUBTLEX-CY:威尔士语新的单词频率数据库。

SUBTLEX-CY: A new word frequency database for Welsh.

机构信息

School of Psychology, University of Nottingham, Nottingham, UK.

School of Psychology, Wrexham Glyndŵr University, Wrexham, UK.

出版信息

Q J Exp Psychol (Hove). 2024 May;77(5):1052-1067. doi: 10.1177/17470218231190315. Epub 2023 Aug 30.

DOI:10.1177/17470218231190315
PMID:37649366
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11032624/
Abstract

We present SUBTLEX-CY, a new word frequency database created from a 32-million-word corpus of Welsh television subtitles. An experiment comprising a lexical decision task examined SUBTLEX-CY frequency estimates against words with inconsistent frequencies in a much smaller Welsh corpus that is often used by researchers, the (CEG), and three other Welsh word frequency databases. Words were selected that were classified as low frequency (LF) in SUBTLEX-CY and high frequency (HF) in CEG and compared with words that were classified as medium frequency (MF) in both SUBTLEX-CY and CEG. Reaction time analyses showed that HF words in CEG were responded to more slowly compared to MF words, suggesting that SUBTLEX-CY corpus provides a more reliable estimate of Welsh word frequencies. The new Welsh word frequency database that also includes part-of-speech, contextual diversity, and other lexical information is freely available for research purposes on the Open Science Framework repository at https://osf.io/9gkqm/.

摘要

我们呈现了 SUBTLEX-CY,这是一个基于 3200 万词威尔士电视字幕语料库创建的新的单词频率数据库。一项包含词汇判断任务的实验,将 SUBTLEX-CY 的频率估计与在一个经常被研究人员使用的较小的威尔士语料库(CEG)中频率不一致的单词进行了比较,该语料库还包括三个其他的威尔士单词频率数据库。我们选择了在 SUBTLEX-CY 中被归类为低频 (LF) 而在 CEG 中被归类为高频 (HF) 的单词,并将其与在 SUBTLEX-CY 和 CEG 中都被归类为中频 (MF) 的单词进行了比较。反应时间分析表明,CEG 中的 HF 单词的反应速度比 MF 单词慢,这表明 SUBTLEX-CY 语料库提供了更可靠的威尔士单词频率估计。这个新的威尔士单词频率数据库还包括词性、语境多样性和其他词汇信息,可在开放科学框架存储库(https://osf.io/9gkqm/)上免费用于研究目的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/29b4c8f1bd9f/10.1177_17470218231190315-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/53890ab0bb4b/10.1177_17470218231190315-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/46ba044c7f21/10.1177_17470218231190315-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/ab24a2e2fc50/10.1177_17470218231190315-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/29b4c8f1bd9f/10.1177_17470218231190315-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/53890ab0bb4b/10.1177_17470218231190315-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/46ba044c7f21/10.1177_17470218231190315-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/ab24a2e2fc50/10.1177_17470218231190315-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/11032624/29b4c8f1bd9f/10.1177_17470218231190315-fig4.jpg

相似文献

1
SUBTLEX-CY: A new word frequency database for Welsh.SUBTLEX-CY:威尔士语新的单词频率数据库。
Q J Exp Psychol (Hove). 2024 May;77(5):1052-1067. doi: 10.1177/17470218231190315. Epub 2023 Aug 30.
2
SUBTLEX-UK: a new and improved word frequency database for British English.SUBTLEX-UK:一个全新且经过改进的英式英语词汇频率数据库。
Q J Exp Psychol (Hove). 2014;67(6):1176-90. doi: 10.1080/17470218.2013.850521. Epub 2014 Jan 13.
3
Subtlex-pl: subtitle-based word frequency estimates for Polish.Subtlex-pl:基于波兰语字幕的词频估算
Behav Res Methods. 2015 Jun;47(2):471-83. doi: 10.3758/s13428-014-0489-4.
4
On the advantages of word frequency and contextual diversity measures extracted from subtitles: The case of Portuguese.论从字幕中提取的词频和语境多样性度量的优势:以葡萄牙语为例。
Q J Exp Psychol (Hove). 2015;68(4):680-96. doi: 10.1080/17470218.2014.964271. Epub 2014 Nov 7.
5
SUBTLEX-CAT: Subtitle word frequencies and contextual diversity for Catalan.SUBTLEX-CAT:加泰罗尼亚语字幕词频和上下文多样性。
Behav Res Methods. 2020 Feb;52(1):360-375. doi: 10.3758/s13428-019-01233-1.
6
SUBTLEX-CH: Chinese word and character frequencies based on film subtitles.SUBTLEX-CH:基于电影字幕的中文词频和字频。
PLoS One. 2010 Jun 2;5(6):e10729. doi: 10.1371/journal.pone.0010729.
7
SUBTLEX-NL: a new measure for Dutch word frequency based on film subtitles.SUBTLEX-NL:一种基于电影字幕的新的荷兰语词汇频率衡量标准。
Behav Res Methods. 2010 Aug;42(3):643-50. doi: 10.3758/BRM.42.3.643.
8
Moving beyond Kucera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English.超越库切拉和弗朗西斯:当前词频规范的批判性评估,以及美国英语新的、经过改进的词频衡量标准的引入。
Behav Res Methods. 2009 Nov;41(4):977-90. doi: 10.3758/BRM.41.4.977.
9
Shabd: A psycholinguistic database for Hindi.Shabd:一个印地语心理语言学数据库。
Behav Res Methods. 2022 Apr;54(2):830-844. doi: 10.3758/s13428-021-01625-2. Epub 2021 Aug 6.
10
Adding part-of-speech information to the SUBTLEX-US word frequencies.为 SUBTLEX-US 词频添加词性信息。
Behav Res Methods. 2012 Dec;44(4):991-7. doi: 10.3758/s13428-012-0190-4.

本文引用的文献

1
Disentangling contextual diversity: Communicative need as a lexical organizer.语境多样性的剖析:交际需求作为词汇组织工具。
Psychol Rev. 2021 Apr;128(3):525-557. doi: 10.1037/rev0000265. Epub 2021 Feb 11.
2
LexOPS: An R package and user interface for the controlled generation of word stimuli.LexOPS:一个用于受控生成单词刺激的 R 包和用户界面。
Behav Res Methods. 2020 Dec;52(6):2372-2382. doi: 10.3758/s13428-020-01389-1.
3
Bilinguals apply language-specific grain sizes during sentence reading.双语者在句子阅读过程中使用特定语言的粒度。
Cognition. 2019 Dec;193:104018. doi: 10.1016/j.cognition.2019.104018. Epub 2019 Jul 20.
4
Gorilla in our midst: An online behavioral experiment builder.潜伏在我们中间的大猩猩:一个在线行为实验构建器。
Behav Res Methods. 2020 Feb;52(1):388-407. doi: 10.3758/s13428-019-01237-x.
5
The ERP signature of the contextual diversity effect in visual word recognition.视觉单词识别中情境多样性效应的事件相关电位特征
Cogn Affect Behav Neurosci. 2017 Jun;17(3):461-474. doi: 10.3758/s13415-016-0491-7.
6
The influence of contextual diversity on word learning.语境多样性对词汇学习的影响。
Psychon Bull Rev. 2016 Aug;23(4):1214-20. doi: 10.3758/s13423-015-0980-7.
7
To transform or not to transform: using generalized linear mixed models to analyse reaction time data.转换还是不转换:使用广义线性混合模型分析反应时间数据。
Front Psychol. 2015 Aug 7;6:1171. doi: 10.3389/fpsyg.2015.01171. eCollection 2015.
8
Worldlex: Twitter and blog word frequencies for 66 languages.Worldlex:66种语言的推特和博客词汇频率
Behav Res Methods. 2016 Sep;48(3):963-72. doi: 10.3758/s13428-015-0621-0.
9
SUBTLEX-UK: a new and improved word frequency database for British English.SUBTLEX-UK:一个全新且经过改进的英式英语词汇频率数据库。
Q J Exp Psychol (Hove). 2014;67(6):1176-90. doi: 10.1080/17470218.2013.850521. Epub 2014 Jan 13.
10
Fast modulation of executive function by language context in bilinguals.双语者的语言语境能快速调节执行功能。
J Neurosci. 2013 Aug 14;33(33):13533-7. doi: 10.1523/JNEUROSCI.4760-12.2013.