University of Hawaii at Manoa, Honolulu, HI, USA.
Georgia State University, Atlanta, Georgia.
Behav Res Methods. 2018 Jun;50(3):1030-1046. doi: 10.3758/s13428-017-0924-4.
This study introduces the second release of the Tool for the Automatic Analysis of Lexical Sophistication (TAALES 2.0), a freely available and easy-to-use text analysis tool. TAALES 2.0 is housed on a user's hard drive (allowing for secure data processing) and is available on most operating systems (Windows, Mac, and Linux). TAALES 2.0 adds 316 indices to the original tool. These indices are related to word frequency, word range, n-gram frequency, n-gram range, n-gram strength of association, contextual distinctiveness, word recognition norms, semantic network, and word neighbors. In this study, we validated TAALES 2.0 by investigating whether its indices could be used to model both holistic scores of lexical proficiency in free writes and word choice scores in narrative essays. The results indicated that the TAALES 2.0 indices could be used to explain 58% of the variance in lexical proficiency scores and 32% of the variance in word-choice scores. Newly added TAALES 2.0 indices, including those related to n-gram association strength, word neighborhood, and word recognition norms, featured heavily in these predictor models, suggesting that TAALES 2.0 represents a substantial upgrade.
本研究介绍了词汇复杂度自动分析工具(TAALES)的第二个版本,即 TAALES 2.0。这是一个免费且易于使用的文本分析工具,可在用户的硬盘上运行(允许进行安全的数据处理),并适用于大多数操作系统(Windows、Mac 和 Linux)。TAALES 2.0 在原始工具的基础上增加了 316 个指标。这些指标与词频、词域、n 元组频率、n 元组范围、n 元组关联强度、语境独特性、词识别规范、语义网络和词邻接有关。在本研究中,我们通过研究其指标是否可用于模拟自由写作中的词汇熟练程度综合得分和记叙文的选词得分,来验证 TAALES 2.0。结果表明,TAALES 2.0 指数可用于解释词汇熟练程度得分 58%的方差和选词得分 32%的方差。新增加的 TAALES 2.0 指数,包括与 n 元组关联强度、词邻接和词识别规范相关的指数,在这些预测模型中占据重要地位,表明 TAALES 2.0 是一个重大升级。