Vanderbilt University, Nashville, USA.
Georgia State University, Atlanta, USA.
Behav Res Methods. 2024 Sep;56(6):5918-5929. doi: 10.3758/s13428-023-02324-w. Epub 2023 Dec 29.
This study documents and assesses the Tool for Automatic Measurement of Morphological Information (TAMMI), which calculates measures related to basic morpheme counts, morphological variety, morphological complexity, morpheme type-token counts, and variables found in the MorphoLex database (Sánchez-Gutiérrez et al., 2018) including morpheme frequency/length, morpheme family size counts and frequency, and morpheme hapax counts. These measures are assessed in two studies that include a word frequency measure as a control variable. The first study examined links between morphological variables and judgements of reading ease in a corpus of ~ 5000 reading excerpts, finding that variables related to derivational variety, word frequency, affix frequency, and morpheme counts explained 40% of the variance in the reading scores. The second examined links between morphological variables and human assessments of vocabulary proficiency in a corpus of ~ 7000 essays written by English-language learners (ELLs), finding that the number of morphemes, morpheme variety, and the number of roots explained 21% of the variance in the human assessments.
本研究记录并评估了形态信息自动测量工具(Tool for Automatic Measurement of Morphological Information,TAMMI),该工具计算了与基本词素计数、形态变化、形态复杂度、词素类型-令牌计数以及 MorphoLex 数据库中发现的变量(Sánchez-Gutiérrez 等人,2018 年)相关的度量值,包括词素频率/长度、词素族大小计数和频率以及词素单次出现计数。这些度量值在两项研究中进行了评估,其中一项研究将单词频率作为控制变量纳入考量。第一项研究考察了形态变量与阅读流畅性判断之间的联系,该研究基于约 5000 个阅读片段的语料库进行,结果发现与派生变化、单词频率、词缀频率和词素计数相关的变量解释了阅读分数方差的 40%。第二项研究考察了形态变量与英语学习者(ELL)所写约 7000 篇论文中词汇能力的人类评估之间的联系,结果发现词素数量、词素变化和词素数量解释了人类评估方差的 21%。