• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个高频感知列表。

A high-frequency sense list.

作者信息

Liu Lei, Gong Tongxi, Shi Jianjun, Guo Yi

机构信息

English Department, School of International Studies, Zhengzhou University, Zhengzhou, China.

Institute of Linguistics, Shanghai International Studies University, Shanghai, China.

出版信息

Front Psychol. 2024 Aug 9;15:1430060. doi: 10.3389/fpsyg.2024.1430060. eCollection 2024.

DOI:10.3389/fpsyg.2024.1430060
PMID:39184940
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11341445/
Abstract

A number of high-frequency word lists have been created to help foreign language learners master English vocabulary. These word lists, despite their widespread use, did not take word meaning into consideration. Foreign language learners are unclear on which meanings they should focus on first. To address this issue, we semantically annotated the Corpus of Contemporary American English (COCA) and the British National Corpus (BNC) with high accuracy using a BERT model. From these annotated corpora, we calculated the semantic frequency of different senses and filtered out 5000 senses to create a High-frequency Sense List. Subsequently, we checked the validity of this list and compared it with established influential word lists. This list exhibits three notable characteristics. First, it achieves stable coverage in different corpora. Second, it identifies high-frequency items with greater accuracy. It achieves comparable coverage with lists like GSL, NGSL, and New-GSL but with significantly fewer items. Especially, it includes everyday words that used to fall off high-frequency lists without requiring manual adjustments. Third, it describes clearly which senses are most frequently used and therefore should be focused on by beginning learners. This study represents a pioneering effort in semantic annotation of large corpora and the creation of a word list based on semantic frequency.

摘要

已经创建了许多高频词汇表来帮助外语学习者掌握英语词汇。这些词汇表尽管被广泛使用,但并未考虑词义。外语学习者不清楚他们应该首先关注哪些词义。为了解决这个问题,我们使用BERT模型对当代美国英语语料库(COCA)和英国国家语料库(BNC)进行了高精度的语义标注。从这些标注语料库中,我们计算了不同词义的语义频率,并筛选出5000个词义,创建了一个高频词义列表。随后,我们检查了该列表的有效性,并将其与已有的有影响力的词汇表进行了比较。该列表具有三个显著特点。第一,它在不同语料库中实现了稳定的覆盖范围。第二,它能更准确地识别高频词汇。它与GSL、NGSL和New-GSL等列表的覆盖范围相当,但词汇数量要少得多。特别是,它包含了一些曾经从高频列表中消失的日常词汇,且无需人工调整。第三,它清楚地描述了哪些词义最常用,因此初学者应该重点关注。这项研究代表了在大型语料库语义标注和基于语义频率创建词汇表方面的开创性努力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/c6db85d6a862/fpsyg-15-1430060-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/10fc010079e7/fpsyg-15-1430060-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/65a2ead41395/fpsyg-15-1430060-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/967adeb7b826/fpsyg-15-1430060-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/c6db85d6a862/fpsyg-15-1430060-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/10fc010079e7/fpsyg-15-1430060-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/65a2ead41395/fpsyg-15-1430060-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/967adeb7b826/fpsyg-15-1430060-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d076/11341445/c6db85d6a862/fpsyg-15-1430060-g0004.jpg

相似文献

1
A high-frequency sense list.一个高频感知列表。
Front Psychol. 2024 Aug 9;15:1430060. doi: 10.3389/fpsyg.2024.1430060. eCollection 2024.
2
Using Mobile Devices for Vocabulary Learning Outside the Classroom: Improving the English as Foreign Language Learners' Knowledge of High-Frequency Words.利用移动设备在课外学习词汇:提高外语学习者对高频词汇的掌握程度。
Front Psychol. 2022 Jun 24;13:899885. doi: 10.3389/fpsyg.2022.899885. eCollection 2022.
3
Adapting a receptive vocabulary test for preschool-aged Greek-speaking children.为希腊语学龄前儿童改编接受性词汇测试。
Int J Lang Commun Disord. 2011 Jan-Feb;46(1):95-107. doi: 10.3109/13682821003671486.
4
Developing morphological knowledge with online corpora in an ESL vocabulary classroom.在英语作为第二语言的词汇课堂中利用在线语料库培养形态学知识。
Front Psychol. 2022 Jul 27;13:927636. doi: 10.3389/fpsyg.2022.927636. eCollection 2022.
5
A comparative study of frequency effect on acquisition of grammar and meaning of words between Chinese and foreign learners of English language.英语中外学习者语法习得与词汇意义习得的频率效应比较研究。
Front Psychol. 2023 Jul 25;14:1125483. doi: 10.3389/fpsyg.2023.1125483. eCollection 2023.
6
The role of semantic activation during word recognition in Arabic.语义激活在阿拉伯语单词识别过程中的作用。
Cogn Process. 2019 Aug;20(3):333-337. doi: 10.1007/s10339-019-00915-0. Epub 2019 Mar 20.
7
Semantic Factors Predict the Rate of Lexical Replacement of Content Words.语义因素预测实词的词汇替换率。
PLoS One. 2016 Jan 28;11(1):e0147924. doi: 10.1371/journal.pone.0147924. eCollection 2016.
8
Word Senses as Clusters of Meaning Modulations: A Computational Model of Polysemy.词义作为意义调制的聚类:一词多义的计算模型。
Cogn Sci. 2021 Apr;45(4):e12955. doi: 10.1111/cogs.12955.
9
Word Learning in L2 Chinese: from Perspectives of Learner-Related and Language-Related Factors.二语中文的词汇学习:从学习者相关因素和语言相关因素的视角来看
J Psycholinguist Res. 2021 Jun;50(3):663-680. doi: 10.1007/s10936-020-09740-5. Epub 2020 Oct 23.
10
Creating a list of word alignments from parallel Russian simplification data.从平行的俄语简化数据创建单词对齐列表。
Front Artif Intell. 2022 Sep 12;5:984759. doi: 10.3389/frai.2022.984759. eCollection 2022.