Ha Hung Tan
School of Foreign Languages, University of Economics Ho Chi Minh City (UEH), Ho Chi Minh City, Vietnam.
Front Psychol. 2022 Feb 24;13:800983. doi: 10.3389/fpsyg.2022.800983. eCollection 2022.
The present study analyzed the vocabulary profile of the News on the Web (NOW) corpus, which contained 12 billion words from online newspapers and magazines in 20 countries to determine the vocabulary knowledge needed to reasonably understand online newspaper and magazine articles. The results showed that, in general, knowledge of the most frequent 4,000 word families in the British National Corpus/Corpus of Contemporary American English (BNC/COCA) wordlist plus proper nouns, marginal words, transparent compounds and acronyms was necessary to gain 95% coverage for the NOW corpus. However, when it came to the 98% coverage, online newspaper and magazine articles from different countries had relatively distinct lexical demands. In-depth analyses were carried out and the findings offered comprehensive insights into the issue. Implications for teaching and learning were also provided.
本研究分析了网络新闻(NOW)语料库的词汇概况,该语料库包含来自20个国家的在线报纸和杂志的120亿个单词,以确定合理理解在线报纸和杂志文章所需的词汇知识。结果表明,一般来说,要在NOW语料库中达到95%的覆盖率,需要掌握英国国家语料库/当代美国英语语料库(BNC/COCA)词表中最常用的4000个词族以及专有名词、边缘词、透明复合词和首字母缩略词。然而,当覆盖率达到98%时,不同国家的在线报纸和杂志文章有相对不同的词汇需求。进行了深入分析,研究结果为该问题提供了全面的见解。还提供了对教学的启示。