Leung Chi-Keung, Chang Hua-Hua, Hau Kit-Tai
Department of Mathematics, Hong Kong Institute of Education, Hong Kong.
Br J Math Stat Psychol. 2005 Nov;58(Pt 2):239-57. doi: 10.1348/000711005X62945.
In computerized adaptive testing (CAT), traditionally the most discriminating items are selected to provide the maximum information so as to attain the highest efficiency in trait (theta) estimation. The maximum information (MI) approach typically results in unbalanced item exposure and hence high item-overlap rates across examinees. Recently, Yi and Chang (2003) proposed the multiple stratification (MS) method to remedy the shortcomings of MI. In MS, items are first sorted according to content, then difficulty and finally discrimination parameters. As discriminating items are used strategically, MS offers a better utilization of the entire item pool. However, for testing with imposed non-statistical constraints, this new stratification approach may not maintain its high efficiency. Through a series of simulation studies, this research explored the possible benefits of a mixture item selection approach (MS-MI), integrating the MS and MI approaches, in testing with non-statistical constraints. In all simulation conditions, MS consistently outperformed the other two competing approaches in item pool utilization, while the MS-MI and the MI approaches yielded higher measurement efficiency and offered better conformity to the constraints. Furthermore, the MS-MI approach was shown to perform better than MI on all evaluation criteria when control of item exposure was imposed.
在计算机自适应测试(CAT)中,传统上会选择区分度最高的题目以提供最大信息量,从而在特质(θ)估计中实现最高效率。最大信息量(MI)方法通常会导致题目曝光不均衡,进而使考生之间的题目重叠率很高。最近,易和张(2003)提出了多重分层(MS)方法来弥补MI的缺点。在MS中,题目首先按内容排序,然后按难度排序,最后按区分度参数排序。由于有策略地使用区分度高的题目,MS能更好地利用整个题目库。然而,对于有非统计约束的测试,这种新的分层方法可能无法保持其高效率。通过一系列模拟研究,本研究探索了在有非统计约束的测试中,整合MS和MI方法的混合题目选择方法(MS-MI)可能带来的好处。在所有模拟条件下,MS在题目库利用率方面始终优于其他两种竞争方法,而MS-MI和MI方法则产生了更高的测量效率,并更好地符合约束条件。此外,当对题目曝光进行控制时,MS-MI方法在所有评估标准上都表现得比MI更好。