Benedict Ralph H B
State University of New York at Buffalo School of Medicine, Department of Neurology, Buffalo General Hospital, New York 14203, USA.
J Int Neuropsychol Soc. 2005 Oct;11(6):727-36. doi: 10.1017/S1355617705050782.
Repeated neuropsychological testing gives rise to practice effects in that patients become familiar with test material as well as test-taking procedures. Using alternate forms prevents the learning of specific test stimuli, potentially mitigating practice effects. However, changing forms could diminish test-retest reliability coefficients. Our objective was to examine test-retest effects in multiple sclerosis (MS) patients randomly assigned to same- (SF) or alternate-form (AF) conditions. Thirty-four MS patients underwent neuropsychological evaluation. The battery included the California Verbal Learning Test II (CVLT-II) and the Brief Visuospatial Memory Test-Revised (BVMT-R), memory tests recommended by a recently convened consensus panel. Patients were randomly assigned to SF or AF groups and then tested at baseline and follow-up examination 1 week later. Analysis of variance tests (ANOVAs) revealed significant group x time interactions, with SF patients showing greater gain than AF patients. SF practice effects were often large, compromising test validity. Reliability coefficients were either equivalent or higher in the AF group, a finding attributed to ceiling effects and reduced variance in the SF group at retest. The generalizability of the findings may be limited to short test-retest intervals and the MS population. Nevertheless, I conclude that the use of CVLT-II and BVMT-R alternate forms likely helps preserve test validity without compromising test-retest reliability.
重复进行神经心理学测试会产生练习效应,因为患者会熟悉测试材料以及测试流程。使用替代形式可防止对特定测试刺激的学习,从而可能减轻练习效应。然而,更换测试形式可能会降低重测信度系数。我们的目的是研究随机分配到相同形式(SF)或替代形式(AF)条件下的多发性硬化症(MS)患者的重测效应。34名MS患者接受了神经心理学评估。测试组合包括加利福尼亚言语学习测试第二版(CVLT-II)和修订后的简短视觉空间记忆测试(BVMT-R),这是最近一个共识小组推荐的记忆测试。患者被随机分配到SF组或AF组,然后在基线时进行测试,并在1周后的随访检查中再次测试。方差分析(ANOVA)显示出显著的组×时间交互作用,SF组患者的进步比AF组患者更大。SF组的练习效应通常很大,损害了测试的有效性。AF组的信度系数要么相当,要么更高,这一发现归因于天花板效应以及SF组在重测时方差减小。这些发现的普遍性可能仅限于较短的重测间隔和MS人群。尽管如此,我得出结论,使用CVLT-II和BVMT-R的替代形式可能有助于保持测试有效性,同时又不损害重测信度。