Division of Psychology and Language Sciences, University College London, London, UK.
Department of Psychology, Education College, King Saud University, Riyadh, Saudi Arabia.
Int J Lang Commun Disord. 2024 Mar-Apr;59(2):678-697. doi: 10.1111/1460-6984.12954. Epub 2023 Oct 9.
Non-word repetition (NWR) tests are an important way speech and language therapists (SaLTs) assess language development. NWR tests are often scored whilst participants make their responses (i.e., in real time) in clinical and research reports (documented here via a secondary analysis of a published systematic review).
The main aim was to determine the extent to which real-time coding of NWR stimuli at the whole-item level (as correct/incorrect) was predicted by models that had varying levels of detail provided from phonemic transcriptions using several linear mixed method (LMM) models.
METHODS & PROCEDURES: Live scores and recordings of responses on the universal non-word repetition (UNWR) test were available for 146 children aged between 3 and 6 years where the sample included all children starting in five UK schools in one year or two consecutive years. Transcriptions were made of responses to two-syllable NWR stimuli for all children and these were checked for reliability within and between transcribers. Signal detection analysis showed that consonants were missed when judgments were made live. Statistical comparisons of the discrepancies between target stimuli and transcriptions of children's responses were then made and these were regressed against live score accuracy. Six LMM models (three normalized: 1a, 2a, 3a; and three non-normalized: 1b, 2b, 3b) were examined to identify which model(s) best captured the data variance. Errors on consonants for live scores were determined by comparison with the transcriptions in the following ways (the dependent variables for each pair of models): (1) consonants alone; (2) substitutions, deletions and insertions of consonants identified after automatic alignment of live and transcribed materials; and (3) as with (2) but where substitutions were coded further as place, manner and voicing errors.
OUTCOMES & RESULTS: The normalized model that coded consonants in non-words as 'incorrect' at the level of substitutions, deletions and insertions (2b) provided the best fit to the real-time coding responses in terms of marginal R, Akaike's information criterion (AIC) and Bayesian information criterion (BIC) statistics.
CONCLUSIONS & IMPLICATIONS: Errors that occur on consonants when non-word stimuli are scored in real time are characterized solely by the substitution, deletion and insertion measure. It is important to know that such errors arise when real-time judgments are made because NWR tasks are used to assess and diagnose several cognitive-linguistic impairments. One broader implication of the results is that future work could automate the analysis procedures to provide the required information objectively and quickly without having to transcribe data.
What is already known on this subject Children and patients with a wide range of cognitive and language difficulties are less accurate relative to controls when they attempt to repeat non-words. Responses to non-words are often scored as correct or incorrect at the time the test is conducted. Limited assessments of this scoring procedure have been conducted to date. What this study adds to the existing knowledge Live NWR scores made by 146 children were available and the accuracy of these judgements was assessed here against ones based on phonemic transcriptions. Signal detection analyses showed that live scoring missed consonant errors in children's responses. Further analyses, using linear mixed effect models, showed that live judgments led to consonant substitution, deletion and insertion errors. What are the practical and clinical implications of this work? Improved and practicable NWR scoring procedures are required to provide SaLTs with better indications about children's language development (typical and atypical) and for clinical assessments of older people. The procedures currently used miss substitutions, deletions and insertions. Hence, procedures are required that provide the information currently only available when materials are transcribed manually. The possibility of training automatic speech recognizers to provide this level of detail is raised.
非单词重复(NWR)测试是言语治疗师评估语言发展的重要方法。在临床和研究报告中(通过对已发表的系统评价的二次分析记录),NWR 测试通常在参与者做出反应时(即实时)进行评分。
主要目的是确定在何种程度上,通过使用几种线性混合方法(LMM)模型,从音素转录提供不同详细程度的模型,可以预测整个项目水平(正确/不正确)的 NWR 刺激的实时编码。
146 名年龄在 3 至 6 岁之间的儿童的实时分数和通用非单词重复(UNWR)测试的录音记录可用,其中包括来自英国五所学校的所有在一年内或连续两年开始的儿童。对所有儿童的双音节 NWR 刺激的反应进行了转录,并在转录员内部和之间对其进行了可靠性检查。信号检测分析表明,实时判断时会错过辅音。然后对儿童反应的目标刺激物与转录之间的差异进行了统计比较,并将其与实时分数的准确性进行了回归。检验了六个 LMM 模型(三个归一化:1a、2a、3a;三个非归一化:1b、2b、3b),以确定哪种模型最能捕捉数据方差。通过以下几种方式确定实时分数中的辅音错误(每种模型的两个变量):(1)仅辅音;(2)自动对齐实时和转录材料后识别的辅音替代、删除和插入;(3)与(2)相同,但将替换进一步编码为位置、方式和发音错误。
在实时评分中,将非单词中的辅音编码为替代、删除和插入的归一化模型(2b)在边缘 R、赤池信息量准则(AIC)和贝叶斯信息量准则(BIC)统计方面提供了最佳拟合实时编码反应。
实时判断非单词时出现的辅音错误仅由替代、删除和插入措施来描述。重要的是要知道,当实时判断时会出现此类错误,因为 NWR 任务用于评估和诊断几种认知语言障碍。结果的一个更广泛的意义是,未来的工作可以自动化分析程序,客观快速地提供所需的信息,而无需转录数据。
目前对此主题的了解儿童和患有各种认知和语言障碍的患者在尝试重复非单词时的准确性相对较低。通常在进行测试时,非单词的反应会被标记为正确或错误。到目前为止,对这种评分程序的评估有限。本研究增加了哪些新知识:146 名儿童的实时 NWR 分数可用,在此评估了这些判断的准确性,并将其与基于音素转录的准确性进行了比较。信号检测分析表明,实时评分遗漏了儿童反应中的辅音错误。进一步的分析使用线性混合效应模型表明,实时判断导致了辅音替代、删除和插入错误。这对临床工作有什么实际和临床意义?需要改进和可行的 NWR 评分程序,为言语治疗师提供有关儿童语言发展(典型和非典型)的更好信息,并为老年人的临床评估提供更好的信息。目前使用的程序会遗漏替代、删除和插入。因此,需要提供当前仅在手动转录材料时才可用的信息的程序。提出了训练自动语音识别器提供这种详细程度的可能性。