• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

马来语词汇项目:包含 9592 个单词的词汇统计数据库。

The Malay Lexicon Project: a database of lexical statistics for 9,592 words.

机构信息

Department of Psychology, Faculty of Arts and Social Sciences, National University of Singapore, Republic of Singapore.

出版信息

Behav Res Methods. 2010 Nov;42(4):992-1003. doi: 10.3758/BRM.42.4.992.

DOI:10.3758/BRM.42.4.992
PMID:21139166
Abstract

Malay, a language spoken by 250 million people, has a shallow alphabetic orthography, simple syllable structures, and transparent affixation--characteristics that contrast sharply with those of English. In the present article, we first compare the letter-phoneme and letter-syllable ratios for a sample of alphabetic orthographies to highlight the importance of separating language-specific from language-universal reading processes. Then, in order to develop a better understanding of word recognition in orthographies with more consistent mappings to phonology than English, we compiled a database of lexical variables (letter length, syllable length, phoneme length, morpheme length, word frequency, orthographic and phonological neighborhood sizes, and orthographic and phonological Levenshtein distances) for 9,592 Malay words. Separate hierarchical regression analyses for Malay and English revealed how the consistency of orthography-phonology mappings selectively modulates the effects of different lexical variables on lexical decision and speeded pronunciation performance. The database of lexical and behavioral measures for Malay is available at http://brm.psychonomic-journals.org/content/supplemental.

摘要

马来语是一种有 2.5 亿人使用的语言,它的字母表拼写法较浅,音节结构简单,并且有明显的附加成分——这些特点与英语形成了鲜明的对比。在本文中,我们首先比较了一些字母表拼写法的字母-音素和字母-音节比率,以突出将语言特定和语言普遍的阅读过程分开的重要性。然后,为了更好地理解与英语相比具有更一致的语音映射的拼字法中的单词识别,我们为 9592 个马来语单词编制了一个词汇变量(字母长度、音节长度、音素长度、词素长度、单词频率、正字法和语音邻域大小以及正字法和语音 Levenshtein 距离)数据库。针对马来语和英语的单独分层回归分析揭示了拼字法-语音映射的一致性如何选择性地调节不同词汇变量对词汇判断和快速发音性能的影响。马来语的词汇和行为测量数据库可在 http://brm.psychonomic-journals.org/content/supplemental 上获取。

相似文献

1
The Malay Lexicon Project: a database of lexical statistics for 9,592 words.马来语词汇项目:包含 9592 个单词的词汇统计数据库。
Behav Res Methods. 2010 Nov;42(4):992-1003. doi: 10.3758/BRM.42.4.992.
2
The bigram trough hypothesis and the syllable number effect in lexical decision.词汇判断中的双字母低谷假说与音节数量效应
Q J Exp Psychol (Hove). 2012;65(11):2221-30. doi: 10.1080/17470218.2012.697176. Epub 2012 Jul 17.
3
Database of word-level statistics for Mandarin Chinese (DoWLS-MAN).汉语词级统计数据库(DoWLS-MAN)。
Behav Res Methods. 2022 Apr;54(2):987-1009. doi: 10.3758/s13428-021-01620-7. Epub 2021 Aug 17.
4
The French Lexicon Project: lexical decision data for 38,840 French words and 38,840 pseudowords.法语词汇项目:38840 个法语单词和 38840 个伪词的词汇判断数据。
Behav Res Methods. 2010 May;42(2):488-96. doi: 10.3758/BRM.42.2.488.
5
The development of the orthographic consistency effect in speech recognition: from sublexical to lexical involvement.语音识别中拼字一致性效应的发展:从次词汇参与到词汇参与。
Cognition. 2007 Dec;105(3):547-76. doi: 10.1016/j.cognition.2006.12.005. Epub 2007 Jan 23.
6
Procura-PALavras (P-PAL): A Web-based interface for a new European Portuguese lexical database.Procura-PALavras (P-PAL):一个新的欧洲葡萄牙语词汇数据库的网络界面。
Behav Res Methods. 2018 Aug;50(4):1461-1481. doi: 10.3758/s13428-018-1058-z.
7
Time course analyses of orthographic and phonological priming effects during word recognition in a transparent orthography.透明正字法中单词识别过程中拼写和语音启动效应的时间进程分析。
Q J Exp Psychol (Hove). 2014 Oct;67(10):1925-43. doi: 10.1080/17470218.2013.879192. Epub 2014 Mar 3.
8
From sound to meaning: Phonology-to-Semantics mapping in visual word recognition.从声音到意义:视觉单词识别中的音系到语义映射。
Psychon Bull Rev. 2017 Jun;24(3):887-893. doi: 10.3758/s13423-016-1152-0.
9
K-SPAN: A lexical database of Korean surface phonetic forms and phonological neighborhood density statistics.K-SPAN:一个韩语表面语音形式和音韵邻接密度统计的词汇数据库。
Behav Res Methods. 2017 Oct;49(5):1939-1950. doi: 10.3758/s13428-016-0836-8.
10
Chinese lexical database (CLD) : A large-scale lexical database for simplified Mandarin Chinese.中文词汇数据库 (CLD):一个大规模的简体中文词汇数据库。
Behav Res Methods. 2018 Dec;50(6):2606-2629. doi: 10.3758/s13428-018-1038-3.

引用本文的文献

1
Simplified Chinese lexicon project: A lexical decision database with 8105 characters and 4864 pseudocharacters.简体中文字典项目:一个包含8105个汉字和4864个假字的词汇判断数据库。
Behav Res Methods. 2025 Jun 23;57(7):206. doi: 10.3758/s13428-025-02701-7.
2
Kalimah norms: Ratings for 2,467 modern standard Arabic words on two scales.卡里玛规范:2467个现代标准阿拉伯语单词在两个量表上的评分。
Behav Res Methods. 2025 Jun 9;57(7):194. doi: 10.3758/s13428-025-02692-5.
3
Lexical decision times for nouns from the Croatian Psycholinguistic Database.
来自克罗地亚心理语言学数据库的名词的词汇判断时间。
Behav Res Methods. 2025 Apr 25;57(6):156. doi: 10.3758/s13428-025-02676-5.
4
Jiwar: A database and calculator for word neighborhood measures in 40 languages.吉瓦尔:一个包含40种语言的词邻域度量的数据库和计算器。
Behav Res Methods. 2025 Feb 19;57(3):98. doi: 10.3758/s13428-025-02612-7.
5
The Italian Crowdsourcing Project: Visual word recognition times for 130,495 Italian words.意大利众包项目:130495个意大利语单词的视觉单词识别时间
Behav Res Methods. 2024 Dec 28;57(1):26. doi: 10.3758/s13428-024-02548-4.
6
HeLP: The Hebrew Lexicon project.希伯来语词典项目(HeLP)。
Behav Res Methods. 2024 Dec;56(8):8761-8783. doi: 10.3758/s13428-024-02502-4. Epub 2024 Sep 9.
7
The role of individual differences in emotional word recognition: Insights from a large-scale lexical decision study.个体差异在情绪词汇识别中的作用:来自大规模词汇判断研究的启示。
Behav Res Methods. 2024 Dec;56(8):8501-8520. doi: 10.3758/s13428-024-02488-z. Epub 2024 Sep 4.
8
Malay Lexicon Project 3: The impact of orthographic-semantic consistency on lexical decision latencies.马来语词汇项目3:正字法-语义一致性对词汇判断潜伏期的影响。
Q J Exp Psychol (Hove). 2024 Mar 21;78(1):17470218241234668. doi: 10.1177/17470218241234668.
9
LexMAL: A quick and reliable lexical test for Malay speakers.LexMAL:一种快速可靠的马来语使用者词汇测试。
Behav Res Methods. 2024 Aug;56(5):4563-4581. doi: 10.3758/s13428-023-02202-5. Epub 2023 Sep 1.
10
The Chinese Lexicon Project II: A megastudy of speeded naming performance for 25,000+ traditional Chinese two-character words.《汉语词汇项目 II:25000 多个繁体中文字的快速命名表现的巨量研究》。
Behav Res Methods. 2023 Dec;55(8):4382-4402. doi: 10.3758/s13428-022-02022-z. Epub 2022 Nov 28.