Suppr超能文献

比较等级反应模型的比分检验和其他局部相依性诊断方法。

Comparing score tests and other local dependence diagnostics for the graded response model.

作者信息

Liu Yang, Thissen David

机构信息

Department of Psychology, The University of North Carolina, Chapel Hill, North Carolina, USA.

出版信息

Br J Math Stat Psychol. 2014 Nov;67(3):496-513. doi: 10.1111/bmsp.12030. Epub 2013 Nov 25.

Abstract

Score tests for identifying locally dependent item pairs have been proposed for binary item response models. In this article, both the bifactor and the threshold shift score tests are generalized to the graded response model. For the bifactor test, the generalization is straightforward; it adds one secondary dimension associated only with one pair of items. For the threshold shift test, however, multiple generalizations are possible: in particular, conditional, uniform, and linear shift tests are discussed in this article. Simulation studies show that all of the score tests have accurate Type I error rates given large enough samples, although their small-sample behaviour is not as good as that of Pearson's Χ2 and M2 as proposed in other studies for the purpose of local dependence (LD) detection. All score tests have the highest power to detect the LD which is consistent with their parametric form, and in this case they are uniformly more powerful than Χ2 and M2 ; even wrongly specified score tests are more powerful than Χ2 and M2 in most conditions. An example using empirical data is provided for illustration.

摘要

针对二元项目反应模型,已提出用于识别局部相关项目对的计分检验。在本文中,双因素计分检验和阈值移动计分检验都被推广到了等级反应模型。对于双因素检验,推广很直接;它增加了一个仅与一对项目相关的次要维度。然而,对于阈值移动检验,有多种推广方式:特别是,本文讨论了条件、均匀和线性移动检验。模拟研究表明,在样本量足够大的情况下,所有计分检验的第一类错误率都是准确的,尽管它们的小样本行为不如其他研究中为检测局部相依性(LD)而提出的Pearson卡方检验和M2检验。所有计分检验在检测与它们的参数形式一致的LD时具有最高的功效,在这种情况下,它们比卡方检验和M2检验具有一致更强的功效;即使是错误设定的计分检验在大多数情况下也比卡方检验和M2检验更有功效。提供了一个使用实证数据的例子进行说明。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验