Lee Woo-Yeol, Cho Sun-Joo, McGugin Rankin W, Van Gulick Ana Beth, Gauthier Isabel
J Vis. 2015;15(13):23. doi: 10.1167/15.13.23.
The Vanderbilt Expertise Test for cars (VETcar) is a test of visual learning for contemporary car models. We used item response theory to assess the VETcar and in particular used differential item functioning (DIF) analysis to ask if the test functions the same way in laboratory versus online settings and for different groups based on age and gender. An exploratory factor analysis found evidence of multidimensionality in the VETcar, although a single dimension was deemed sufficient to capture the recognition ability measured by the test. We selected a unidimensional three-parameter logistic item response model to examine item characteristics and subject abilities. The VETcar had satisfactory internal consistency. A substantial number of items showed DIF at a medium effect size for test setting and for age group, whereas gender DIF was negligible. Because online subjects were on average older than those tested in the lab, we focused on the age groups to conduct a multigroup item response theory analysis. This revealed that most items on the test favored the younger group. DIF could be more the rule than the exception when measuring performance with familiar object categories, therefore posing a challenge for the measurement of either domain-general visual abilities or category-specific knowledge.
范德比尔特汽车专业技能测试(VETcar)是一项针对当代汽车模型的视觉学习测试。我们运用项目反应理论对VETcar进行评估,尤其采用差异项目功能(DIF)分析来探究该测试在实验室环境与在线环境中,以及针对不同年龄和性别的群体时,其功能是否相同。探索性因素分析发现VETcar存在多维性的证据,不过单一维度被认为足以涵盖该测试所测量的识别能力。我们选择了单维三参数逻辑斯蒂项目反应模型来检验项目特征和受试者能力。VETcar具有令人满意的内部一致性。相当数量的项目在测试环境和年龄组方面显示出中等效应大小的DIF,而性别DIF可忽略不计。由于在线受试者平均年龄比实验室测试的受试者大,我们聚焦于年龄组进行多组项目反应理论分析。这表明测试中的大多数项目对年轻群体更有利。在用熟悉的物体类别测量表现时,DIF可能更多是常态而非例外情况,因此这对领域通用视觉能力或特定类别知识的测量构成了挑战。