中文词汇数据库 (CLD)：一个大规模的简体中文词汇数据库。

Chinese lexical database (CLD) : A large-scale lexical database for simplified Mandarin Chinese.

机构信息

Eberhard Karl's Universität Tübingen, Tübingen, Germany.

出版信息

Behav Res Methods. 2018 Dec;50(6):2606-2629. doi: 10.3758/s13428-018-1038-3.

Abstract

We present the Chinese Lexical Database (CLD): a large-scale lexical database for simplified Chinese. The CLD provides a wealth of lexical information for 3913 one-character words, 34,233 two-character words, 7143 three-character words, and 3355 four-character words, and is publicly available through http://www.chineselexicaldatabase.com . For each of the 48,644 words in the CLD, we provide a wide range of categorical predictors, as well as an extensive set of frequency measures, complexity measures, neighborhood density measures, orthography-phonology consistency measures, and information-theoretic measures. We evaluate the explanatory power of the lexical variables in the CLD in the context of experimental data through analyses of lexical decision latencies for one-character, two-character, three-character and four-character words, as well as word naming latencies for one-character and two-character words. The results of these analyses are discussed.

摘要

我们介绍了中文词汇数据库 (CLD)：一个大规模的简体中文词汇数据库。该数据库为 3913 个单字、34233 个双字、7143 个三字和 3355 个四字词汇提供了丰富的词汇信息，并通过 http://www.chineselexicaldatabase.com 公开提供。对于 CLD 中的 48644 个单词，我们提供了广泛的类别预测器，以及大量的频率度量、复杂度度量、邻域密度度量、正字法-音韵一致性度量和信息论度量。我们通过对单字、双字、三字和四字词汇的词汇决策潜伏期分析，以及对单字和双字词汇的命名潜伏期分析，在实验数据的背景下评估了 CLD 中词汇变量的解释力。我们讨论了这些分析的结果。

相似文献

Chinese lexical database (CLD) : A large-scale lexical database for simplified Mandarin Chinese.中文词汇数据库 (CLD)：一个大规模的简体中文词汇数据库。

Behav Res Methods. 2018 Dec;50(6):2606-2629. doi: 10.3758/s13428-018-1038-3.

MELD-SCH: A megastudy of lexical decision in simplified Chinese.MELD-SCH：简体中文词汇判断的一项巨量研究。

Behav Res Methods. 2018 Oct;50(5):1763-1777. doi: 10.3758/s13428-017-0944-0.

The Malay Lexicon Project: a database of lexical statistics for 9,592 words.马来语词汇项目：包含 9592 个单词的词汇统计数据库。

Behav Res Methods. 2010 Nov;42(4):992-1003. doi: 10.3758/BRM.42.4.992.

Familiarity ratings for 24,325 simplified Chinese words.24325个简体中文字的熟悉度评级

Behav Res Methods. 2023 Apr;55(3):1496-1509. doi: 10.3758/s13428-022-01878-5. Epub 2022 Jun 6.

The role of lexical variables in the visual recognition of two-character Chinese compound words: A megastudy analysis.双字汉语复合词视觉识别中词汇变量的作用：一项大型研究分析

Q J Exp Psychol (Hove). 2018 Sep;71(9):2022-2038. doi: 10.1177/1747021817738965. Epub 2018 Jan 1.

The role of information theory for compound words in Mandarin Chinese and English.信息论在汉英复合词中的作用。

Cognition. 2020 Dec;205:104389. doi: 10.1016/j.cognition.2020.104389. Epub 2020 Jul 31.

CCLOWW: A grade-level Chinese children's lexicon of written words.CCLOWW：一个中文儿童书面词汇的年级水平词库。

Behav Res Methods. 2023 Jun;55(4):1874-1889. doi: 10.3758/s13428-022-01890-9. Epub 2022 Jul 1.

Database of word-level statistics for Mandarin Chinese (DoWLS-MAN).汉语词级统计数据库（DoWLS-MAN）。

Behav Res Methods. 2022 Apr;54(2):987-1009. doi: 10.3758/s13428-021-01620-7. Epub 2021 Aug 17.

The Chinese Lexicon Project II: A megastudy of speeded naming performance for 25,000+ traditional Chinese two-character words.《汉语词汇项目 II：25000 多个繁体中文字的快速命名表现的巨量研究》。

Behav Res Methods. 2023 Dec;55(8):4382-4402. doi: 10.3758/s13428-022-02022-z. Epub 2022 Nov 28.

Neighborhood in Chinese lexicon: A megastudy analysis of lexical decision and naming of two-character Chinese words.汉语词汇中的邻接性：对双字汉语词汇的词汇判断和命名的大规模研究分析

J Exp Psychol Learn Mem Cogn. 2024 Sep;50(9):1489-1515. doi: 10.1037/xlm0001357. Epub 2024 Jul 25.

引用本文的文献

Information-theoretic measures for mapping regularities between orthography and phonology: A comprehensive quantification and validation in the Chinese writing system.用于映射正字法和音系学之间规律的信息论测度：中文书写系统中的全面量化与验证

Behav Res Methods. 2025 Jul 25;57(9):232. doi: 10.3758/s13428-025-02721-3.

Chipola: A Chinese Podcast Lexical Database for capturing spoken language nuances and predicting behavioral data.奇波拉：一个用于捕捉口语细微差别和预测行为数据的中文播客词汇数据库。

Behav Res Methods. 2025 May 8;57(6):166. doi: 10.3758/s13428-025-02697-0.

The effects of prediction representations on implicit learning: Evidence from sentence reading and perceptual identification.预测表征对内隐学习的影响：来自句子阅读和知觉识别的证据。

Heliyon. 2024 Oct 11;10(21):e39256. doi: 10.1016/j.heliyon.2024.e39256. eCollection 2024 Nov 15.

A normative database of Swahili-Chinese paired associates.斯瓦希里语-中文配对联想规范数据库。

Behav Res Methods. 2025 Jan 3;57(1):40. doi: 10.3758/s13428-024-02531-z.

Number of translations and translation direction in masked translation priming: evidence from unbalanced English-Chinese bilinguals.掩蔽翻译启动中的翻译数量及翻译方向：来自非平衡英汉双语者的证据。

Front Psychol. 2024 Nov 27;15:1500750. doi: 10.3389/fpsyg.2024.1500750. eCollection 2024.

Eye movements of children with and without developmental dyslexia in an alphabetic script during alphabetic and logographic tasks.发展性阅读障碍儿童和非阅读障碍儿童在进行字母和汉字任务时的眼球运动。

Sci Rep. 2024 Nov 20;14(1):28796. doi: 10.1038/s41598-024-78894-2.

Effects of Lexical Properties in L2 Chinese Compound Processing: A Multivariate Approach.二语汉语复合词加工中的词汇属性效应：一种多元方法。

J Psycholinguist Res. 2024 May 24;53(4):49. doi: 10.1007/s10936-024-10087-4.

The effect of target detection on memory retrieval.目标检测对记忆检索的影响。

Atten Percept Psychophys. 2024 Apr;86(3):838-854. doi: 10.3758/s13414-024-02851-4. Epub 2024 Feb 27.

A dataset of behavioral measures on Chinese word production in picture naming.中文命名图片中的汉字生成行为测量数据集。

Sci Data. 2024 Feb 10;11(1):185. doi: 10.1038/s41597-024-03022-8.

Frequency effects in linear discriminative learning.线性判别学习中的频率效应

Front Hum Neurosci. 2024 Jan 8;17:1242720. doi: 10.3389/fnhum.2023.1242720. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

中文词汇数据库 (CLD)：一个大规模的简体中文词汇数据库。

Chinese lexical database (CLD) : A large-scale lexical database for simplified Mandarin Chinese.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献