Suppr超能文献

中文词汇数据库 (CLD):一个大规模的简体中文词汇数据库。

Chinese lexical database (CLD) : A large-scale lexical database for simplified Mandarin Chinese.

机构信息

Eberhard Karl's Universität Tübingen, Tübingen, Germany.

出版信息

Behav Res Methods. 2018 Dec;50(6):2606-2629. doi: 10.3758/s13428-018-1038-3.

Abstract

We present the Chinese Lexical Database (CLD): a large-scale lexical database for simplified Chinese. The CLD provides a wealth of lexical information for 3913 one-character words, 34,233 two-character words, 7143 three-character words, and 3355 four-character words, and is publicly available through http://www.chineselexicaldatabase.com . For each of the 48,644 words in the CLD, we provide a wide range of categorical predictors, as well as an extensive set of frequency measures, complexity measures, neighborhood density measures, orthography-phonology consistency measures, and information-theoretic measures. We evaluate the explanatory power of the lexical variables in the CLD in the context of experimental data through analyses of lexical decision latencies for one-character, two-character, three-character and four-character words, as well as word naming latencies for one-character and two-character words. The results of these analyses are discussed.

摘要

我们介绍了中文词汇数据库 (CLD):一个大规模的简体中文词汇数据库。该数据库为 3913 个单字、34233 个双字、7143 个三字和 3355 个四字词汇提供了丰富的词汇信息,并通过 http://www.chineselexicaldatabase.com 公开提供。对于 CLD 中的 48644 个单词,我们提供了广泛的类别预测器,以及大量的频率度量、复杂度度量、邻域密度度量、正字法-音韵一致性度量和信息论度量。我们通过对单字、双字、三字和四字词汇的词汇决策潜伏期分析,以及对单字和双字词汇的命名潜伏期分析,在实验数据的背景下评估了 CLD 中词汇变量的解释力。我们讨论了这些分析的结果。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验