• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CLAD:基于语料库的汉语词汇联想数据库。

CLAD: A corpus-derived Chinese Lexical Association Database.

机构信息

Chinese Language and Technology Center, National Taiwan Normal University, Taipei, Taiwan.

Department of Educational Psychology and Counseling/Chinese Language and Technology Center/Institute for Research Excellence in Learning Sciences, National Taiwan Normal University, Taipei, Taiwan.

出版信息

Behav Res Methods. 2019 Oct;51(5):2310-2336. doi: 10.3758/s13428-019-01208-2.

DOI:10.3758/s13428-019-01208-2
PMID:31429062
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6797702/
Abstract

The application of word associations has become increasingly widespread. However, the association norms produced by traditional free association tests tend not to exceed 10,000 stimulus words, making the number of associated words too small to be representative of the overall language. In this study we used text corpora totaling over 400 million Chinese words, along with a multitude of association measures, to automatically construct a Chinese Lexical Association Database (CLAD) comprising the lexical association of over 80,000 words. Comparison of the CLAD with a database of traditional Chinese word association norms shows that word associations extracted from large text corpora are similar in strength to those elicited from free association tests but contain a much greater number of associative word pairs. Additionally, the relatively small numbers of participants involved in the creation of traditional norms result in relatively coarse scales of association measurement, whereas the differentiation of association strengths is greatly enhanced in the CLAD. The CLAD provides researchers with a great supplement to traditional word association norms. A query website at www.chinesereadability.net/LexicalAssociation/CLAD/ affords access to the database.

摘要

词联想的应用已经越来越广泛。然而,传统的自由联想测试产生的联想规范往往不超过 10000 个刺激词,使得联想词的数量太少,无法代表整体语言。在这项研究中,我们使用了超过 4 亿个中文单词的语料库,以及多种联想测量方法,自动构建了一个包含 80000 多个单词的中文词汇联想数据库(CLAD)。CLAD 与传统的中文词联想规范数据库的比较表明,从大型语料库中提取的词联想在强度上与从自由联想测试中得出的联想相似,但包含了更多的联想词对。此外,传统规范的创建涉及的参与者数量相对较少,导致联想测量的尺度相对较粗,而在 CLAD 中,联想强度的区分度大大增强。CLAD 为研究人员提供了对传统词联想规范的很好补充。一个查询网站 www.chinesereadability.net/LexicalAssociation/CLAD/ 提供了对该数据库的访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/084c2398d3fe/13428_2019_1208_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/7ac1e2b3f8aa/13428_2019_1208_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/4720068b7690/13428_2019_1208_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/2063c3112686/13428_2019_1208_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/de4318198e7b/13428_2019_1208_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/d377b19ef147/13428_2019_1208_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/084c2398d3fe/13428_2019_1208_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/7ac1e2b3f8aa/13428_2019_1208_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/4720068b7690/13428_2019_1208_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/2063c3112686/13428_2019_1208_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/de4318198e7b/13428_2019_1208_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/d377b19ef147/13428_2019_1208_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9501/6797702/084c2398d3fe/13428_2019_1208_Fig6_HTML.jpg

相似文献

1
CLAD: A corpus-derived Chinese Lexical Association Database.CLAD:基于语料库的汉语词汇联想数据库。
Behav Res Methods. 2019 Oct;51(5):2310-2336. doi: 10.3758/s13428-019-01208-2.
2
CCLOWW: A grade-level Chinese children's lexicon of written words.CCLOWW:一个中文儿童书面词汇的年级水平词库。
Behav Res Methods. 2023 Jun;55(4):1874-1889. doi: 10.3758/s13428-022-01890-9. Epub 2022 Jul 1.
3
Chinese lexical database (CLD) : A large-scale lexical database for simplified Mandarin Chinese.中文词汇数据库 (CLD):一个大规模的简体中文词汇数据库。
Behav Res Methods. 2018 Dec;50(6):2606-2629. doi: 10.3758/s13428-018-1038-3.
4
Examining the N400 semantic context effect item-by-item: relationship to corpus-based measures of word co-occurrence.逐项目检查N400语义语境效应:与基于语料库的词共现度量的关系。
Int J Psychophysiol. 2014 Dec;94(3):407-19. doi: 10.1016/j.ijpsycho.2014.10.012. Epub 2014 Nov 4.
5
Word association norms in Mexican Spanish.墨西哥西班牙语中的词汇联想规范。
Span J Psychol. 2014 Dec 19;17:E90. doi: 10.1017/sjp.2014.91.
6
Shabd: A psycholinguistic database for Hindi.Shabd:一个印地语心理语言学数据库。
Behav Res Methods. 2022 Apr;54(2):830-844. doi: 10.3758/s13428-021-01625-2. Epub 2021 Aug 6.
7
Familiarity ratings for 24,325 simplified Chinese words.24325个简体中文字的熟悉度评级
Behav Res Methods. 2023 Apr;55(3):1496-1509. doi: 10.3758/s13428-022-01878-5. Epub 2022 Jun 6.
8
A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks.基于词共现网络复杂网络分析的简体中文与繁体中文差异研究
Comput Intell Neurosci. 2020 Dec 3;2020:8863847. doi: 10.1155/2020/8863847. eCollection 2020.
9
The Chinese Lexicon Project: A megastudy of lexical decision performance for 25,000+ traditional Chinese two-character compound words.《汉语词汇项目:对25000多个繁体中文双字复合词的词汇判断表现的大规模研究》
Behav Res Methods. 2017 Aug;49(4):1503-1519. doi: 10.3758/s13428-016-0810-5.
10
CCLOOW: Chinese children's lexicon of oral words.儿童汉语词汇表
Behav Res Methods. 2024 Feb;56(2):846-859. doi: 10.3758/s13428-023-02077-6. Epub 2023 Mar 7.

引用本文的文献

1
The impact of interpreting students' gestures and speech content on speech fluency of consecutive interpreting.解读学生的手势和言语内容对交替传译言语流畅性的影响。
Front Psychol. 2025 May 23;16:1568341. doi: 10.3389/fpsyg.2025.1568341. eCollection 2025.
2
Weighting Assessment of the Effect of Chinese State-Changing Words on Emotions.汉语状态变化词对情绪影响的加权评估
J Psycholinguist Res. 2023 Dec;52(6):2545-2566. doi: 10.1007/s10936-023-09986-9. Epub 2023 Sep 9.
3
Can Masked Emotion-Laden Words Prime Emotion-Label Words? An ERP Test on the Mediated Account.

本文引用的文献

1
Speaking two "Languages" in America: A semantic space analysis of how presidential candidates and their supporters represent abstract political concepts differently.在美国说两种“语言”:语义空间分析总统候选人及其支持者如何以不同方式表达抽象政治概念。
Behav Res Methods. 2017 Oct;49(5):1668-1685. doi: 10.3758/s13428-017-0931-5.
2
Normative data for Chinese compound remote associate problems.中文联想复合测验常模。
Behav Res Methods. 2017 Dec;49(6):2163-2172. doi: 10.3758/s13428-016-0849-3.
3
The Chinese Lexicon Project: A megastudy of lexical decision performance for 25,000+ traditional Chinese two-character compound words.
带有情感的隐蔽词汇能否启动情感标签词汇?基于中介模型的ERP测试
Front Psychol. 2021 Oct 26;12:721783. doi: 10.3389/fpsyg.2021.721783. eCollection 2021.
4
Exploring Affective Priming Effect of Emotion-Label Words and Emotion-Laden Words: An Event-Related Potential Study.探索情绪标签词和情绪负载词的情感启动效应:一项事件相关电位研究。
Brain Sci. 2021 Apr 27;11(5):553. doi: 10.3390/brainsci11050553.
《汉语词汇项目:对25000多个繁体中文双字复合词的词汇判断表现的大规模研究》
Behav Res Methods. 2017 Aug;49(4):1503-1519. doi: 10.3758/s13428-016-0810-5.
4
Co-occurrence frequency evaluated with large language corpora boosts semantic priming effects.通过大语言语料库评估的共现频率增强了语义启动效应。
Q J Exp Psychol (Hove). 2017 Sep;70(9):1922-1934. doi: 10.1080/17470218.2016.1215479. Epub 2016 Aug 9.
5
CRIE: An automated analyzer for Chinese texts.CRIE:一款用于中文文本的自动分析器。
Behav Res Methods. 2016 Dec;48(4):1238-1251. doi: 10.3758/s13428-015-0649-1.
6
Latent semantic analysis cosines as a cognitive similarity measure: Evidence from priming studies.潜在语义分析余弦作为一种认知相似性度量:来自启动研究的证据。
Q J Exp Psychol (Hove). 2016;69(4):626-53. doi: 10.1080/17470218.2015.1038280. Epub 2015 May 8.
7
Semi-Supervised Text Classification With Universum Learning.基于全集学习的半监督文本分类
IEEE Trans Cybern. 2016 Feb;46(2):462-73. doi: 10.1109/TCYB.2015.2403573. Epub 2015 Feb 27.
8
Constructing and validating readability models: the method of integrating multilevel linguistic features with machine learning.构建和验证可读性模型:将多层次语言特征与机器学习相结合的方法。
Behav Res Methods. 2015 Jun;47(2):340-54. doi: 10.3758/s13428-014-0459-x.
9
Attentional control and asymmetric associative priming.注意控制与不对称联想启动。
J Exp Psychol Learn Mem Cogn. 2014 May;40(3):844-56. doi: 10.1037/a0035781. Epub 2014 Feb 17.
10
Natural language processing in an intelligent writing strategy tutoring system.智能写作策略辅导系统中的自然语言处理。
Behav Res Methods. 2013 Jun;45(2):499-515. doi: 10.3758/s13428-012-0258-1.