a Department of Psychology and Brain Health Research Centre , University of Otago , Dunedin , New Zealand.
b Brain Research New Zealand , Dunedin , New Zealand.
J Clin Exp Neuropsychol. 2019 Mar;41(2):179-191. doi: 10.1080/13803395.2018.1526888. Epub 2018 Oct 15.
: Repeated cognitive assessment is frequently required to monitor changes in cognitive functioning in older adults, but studies of repeated computerized testing in this population are scarce. To provide new insight into retest effects this study examined within- and between-day performance shifts and test-retest reliability among healthy older adults for test scores from a computerized cognitive battery. : Thirty older men (65-71 years) completed the battery six times. Testing occurred twice on each of three testing days, separated by 1 week. : Reaction times (RTs) on tasks intended to measure inhibition (Anti), response switching (Pro/Anti), selective attention (Simon and Flanker), and working memory (2-back) typically showed practice effects, which were most prominent between the first two time points. In most cases, these RTs showed moderate to good test-retest reliability (intraclass correlation coefficient, ICC, range = .34 to .93) with lower reliability between the first two time points. Two-back accuracy rates showed similar results. In contrast, RTs on a basic visuomotor task (Pro) and on compatible trials of the Simon task showed increases at later time points, presumably because of boredom, but demonstrated mostly moderate to good reliability (ICC range = .49 to .83). Scoring metrics from a computerized version of the Corsi Block-Tapping task (intended to measure short-term and working memory) and cost scores (performance differences between two related conditions/tasks) intended to isolate specific cognitive functions tended to show poor reliability (ICC range = -.23 to .62). : Most of the RT tasks investigated showed suitability for use in repeated testing among older adults, although longer familiarization periods appear to be warranted in many cases to reduce practice effects and improve initial reliability. However, poor reliability indicated that scoring metrics from the computerized Corsi Block-Tapping task and cost scores are unsuitable for repeated testing.
:在老年人中,经常需要进行重复的认知评估以监测认知功能的变化,但针对该人群的重复计算机测试研究却很少。为了深入了解重测效应,本研究在健康老年人中,考察了计算机化认知成套测验的分数在各次测试之间和日内的表现变化和测试-重测信度。:30 名老年男性(65-71 岁)共完成了 6 次测试。每次测试在 3 个测试日中的 2 天进行,两次测试之间间隔 1 周。:旨在测量抑制(Anti)、反应转换(Pro/Anti)、选择性注意(Simon 和 Flanker)和工作记忆(2 -back)的任务的反应时(RT)通常表现出练习效应,在前两个测试点之间最为明显。在大多数情况下,这些 RT 的测试-重测信度为中等至良好(组内相关系数 ICC,范围为.34 至.93),而在前两个测试点之间的信度较低。2-back 的准确率也表现出相似的结果。相反,在基本的视觉运动任务(Pro)和 Simon 任务的相容试验中,RT 在稍后的时间点上增加,这可能是由于无聊所致,但仍具有中等至良好的可靠性(ICC 范围为.49 至.83)。旨在测量短期和工作记忆的计算机化 Corsi 块追踪任务的得分指标和旨在分离特定认知功能的成本得分(两个相关条件/任务之间的表现差异),其可靠性往往较差(ICC 范围为.23 至.62)。:研究中调查的大多数 RT 任务都适合在老年人中进行重复测试,尽管在许多情况下,似乎需要更长的熟悉期,以减少练习效应并提高初始可靠性。然而,较差的可靠性表明,计算机化 Corsi 块追踪任务的得分指标和成本得分不适合重复测试。