多分类Rasch模型的基于树的全局模型检验

Tree-Based Global Model Tests for Polytomous Rasch Models.

作者信息

Komboz Basil, Strobl Carolin, Zeileis Achim

机构信息

Ludwig-Maximilians-Universität München, München, Germany.

Universität Zürich, Zürich, Switzerland.

出版信息

Educ Psychol Meas. 2018 Feb;78(1):128-166. doi: 10.1177/0013164416664394. Epub 2016 Oct 6.

DOI:10.1177/0013164416664394

PMID:29795950

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5965621/

Abstract

Psychometric measurement models are only valid if measurement invariance holds between test takers of different groups. Global model tests, such as the well-established likelihood ratio (LR) test, are sensitive to violations of measurement invariance, such as differential item functioning and differential step functioning. However, these traditional approaches are only applicable when comparing previously specified reference and focal groups, such as males and females. Here, we propose a new framework for global model tests for polytomous Rasch models based on a model-based recursive partitioning algorithm. With this approach, a priori specification of reference and focal groups is no longer necessary, because they are automatically detected in a data-driven way. The statistical background of the new framework is introduced along with an instructive example. A series of simulation studies illustrates and compares its statistical properties to the well-established LR test. While both the LR test and the new framework are sensitive to differential item functioning and differential step functioning and respect a given significance level regardless of true differences in the ability distributions, the new data-driven approach is more powerful when the group structure is not known a priori-as will usually be the case in practical applications. The usage and interpretation of the new method are illustrated in an empirical application example. A software implementation is freely available in the R system for statistical computing.

摘要

心理测量模型只有在不同群体的测试者之间测量不变性成立时才有效。全局模型检验，如成熟的似然比（LR）检验，对测量不变性的违反很敏感，如项目功能差异和步长功能差异。然而，这些传统方法仅适用于比较先前指定的参考组和焦点组，如男性和女性。在此，我们基于基于模型的递归划分算法，为多分类Rasch模型的全局模型检验提出了一个新框架。通过这种方法，不再需要先验指定参考组和焦点组，因为它们是以数据驱动的方式自动检测出来的。新框架的统计背景与一个指导性示例一起介绍。一系列模拟研究说明了新框架的统计特性，并将其与成熟的LR检验进行了比较。虽然LR检验和新框架对项目功能差异和步长功能差异都很敏感，并且无论能力分布的真实差异如何都尊重给定的显著性水平，但当群体结构不是先验已知时（实际应用中通常如此），新的数据驱动方法更强大。在一个实证应用示例中说明了新方法的使用和解释。在R统计计算系统中可免费获得该方法的软件实现。

相似文献

Tree-Based Global Model Tests for Polytomous Rasch Models.多分类Rasch模型的基于树的全局模型检验

Educ Psychol Meas. 2018 Feb;78(1):128-166. doi: 10.1177/0013164416664394. Epub 2016 Oct 6.

Rasch Trees: A New Method for Detecting Differential Item Functioning in the Rasch Model.拉施树：一种检测拉施模型中项目功能差异的新方法。

Psychometrika. 2015 Jun;80(2):289-316. doi: 10.1007/s11336-013-9388-3. Epub 2013 Dec 19.

Item-focussed Trees for the Identification of Items in Differential Item Functioning.用于识别差异性项目功能中项目的基于项目聚焦的树状图

Psychometrika. 2016 Sep;81(3):727-50. doi: 10.1007/s11336-015-9488-3. Epub 2015 Nov 23.

An R toolbox for score-based measurement invariance tests in IRT models.IRT 模型中基于评分的测量不变性检验的 R 工具箱。

Behav Res Methods. 2022 Oct;54(5):2101-2113. doi: 10.3758/s13428-021-01689-0. Epub 2021 Dec 16.

Score-Based Tests of Differential Item Functioning via Pairwise Maximum Likelihood Estimation.基于评分的成对最大似然估计的项目区分功能差异检验。

Psychometrika. 2018 Mar;83(1):132-155. doi: 10.1007/s11336-017-9591-8. Epub 2017 Nov 17.

Polytomous Item Explanatory Item Response Theory Models.多分类项目解释性项目反应理论模型

Educ Psychol Meas. 2020 Aug;80(4):726-755. doi: 10.1177/0013164419892667. Epub 2019 Dec 13.

Using the dichotomous Rasch model to analyze polytomous items.使用二分法Rasch模型分析多分类项目。

J Appl Meas. 2013;14(1):44-56.

Rasch Mixture Models for DIF Detection: A Comparison of Old and New Score Specifications.用于差异项目功能（DIF）检测的拉施克混合模型：新旧分数规范的比较

Educ Psychol Meas. 2015 Apr;75(2):208-234. doi: 10.1177/0013164414536183. Epub 2014 Jun 22.

Assessment of Psychometric Properties of an Oral Health Care Measure of Cultural Competence Among Dental Students Using Rasch Partial Credit Model.使用拉施克部分计分模型评估牙科学生文化能力口腔保健测量工具的心理测量特性。

J Dent Educ. 2018 Oct;82(10):1105-1114. doi: 10.21815/JDE.018.107.

An Evaluation of Overall Goodness-of-Fit Tests for the Rasch Model.拉施模型整体拟合优度检验的评估

Front Psychol. 2019 Jan 10;9:2710. doi: 10.3389/fpsyg.2018.02710. eCollection 2018.

引用本文的文献

Tree-based item-response theory model for evaluating differential item functioning in patient-reported outcome measures: a web-based R Shiny implementation.用于评估患者报告结局指标中项目功能差异的基于树的项目反应理论模型：基于网络的R Shiny实现

Qual Life Res. 2025 Aug 22. doi: 10.1007/s11136-025-04046-2.

Tree-based latent variable model for assessing differential item functioning in patient-reported outcome measures: a simulation study.用于评估患者报告结局指标中项目功能差异的基于树的潜在变量模型：一项模拟研究。

Qual Life Res. 2025 Jul 18. doi: 10.1007/s11136-025-04018-6.

Evaluating the Performance of a Regularized Differential Item Functioning Method for Testlet-Based Polytomous Items.评估基于测验题组的多值项目的正则化差异项目功能方法的性能。

Educ Psychol Meas. 2025 May 31:00131644251342512. doi: 10.1177/00131644251342512.

Investigating heterogeneity in IRTree models for multiple response processes with score-based partitioning.使用基于分数的划分方法研究用于多个响应过程的IRT树模型中的异质性。

Br J Math Stat Psychol. 2025 May;78(2):420-439. doi: 10.1111/bmsp.12367. Epub 2024 Nov 4.

Latent Variable Forests for Latent Variable Score Estimation.用于潜在变量得分估计的潜在变量森林

Educ Psychol Meas. 2024 Dec;84(6):1138-1172. doi: 10.1177/00131644241237502. Epub 2024 Apr 1.

Screening for depression in patients with epilepsy: same questions but different meaning to different patients.癫痫患者的抑郁筛查：相同的问题，但对不同患者有不同的含义。

Qual Life Res. 2024 Dec;33(12):3409-3419. doi: 10.1007/s11136-024-03782-1. Epub 2024 Sep 9.

Detecting Differential Item Functioning in Multidimensional Graded Response Models With Recursive Partitioning.使用递归划分法在多维等级反应模型中检测项目差异功能

Appl Psychol Meas. 2024 May;48(3):83-103. doi: 10.1177/01466216241238743. Epub 2024 Mar 13.

Adapting a self-efficacy scale to the task of teaching scientific reasoning: collecting evidence for its psychometric quality using Rasch measurement.使自我效能感量表适用于科学推理教学任务：使用拉施测量法收集其心理测量学质量的证据。

Front Psychol. 2024 Feb 7;15:1339615. doi: 10.3389/fpsyg.2024.1339615. eCollection 2024.

Unsupervised item response theory models for assessing sample heterogeneity in patient-reported outcomes measures.用于评估患者报告结局测量中样本异质性的无监督项目反应理论模型。

Qual Life Res. 2024 Mar;33(3):853-864. doi: 10.1007/s11136-023-03560-5. Epub 2023 Dec 21.

Detecting heterogeneity in the causal direction of dependence: A model-based recursive partitioning approach.检测依赖因果方向中的异质性：一种基于模型的递归划分方法。

Behav Res Methods. 2024 Apr;56(4):2711-2730. doi: 10.3758/s13428-023-02253-8. Epub 2023 Oct 19.

本文引用的文献

Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches.差异项目功能分析的锚定选择策略：综述、评估及新方法

Educ Psychol Meas. 2015 Feb;75(1):22-56. doi: 10.1177/0013164414529792. Epub 2014 Apr 21.

Tests of measurement invariance without subgroups: a generalization of classical methods.无亚组的测量不变性检验：经典方法的推广

Psychometrika. 2013 Jan;78(1):59-82. doi: 10.1007/s11336-012-9302-4. Epub 2012 Dec 13.

Rasch Trees: A New Method for Detecting Differential Item Functioning in the Rasch Model.拉施树：一种检测拉施模型中项目功能差异的新方法。

Psychometrika. 2015 Jun;80(2):289-316. doi: 10.1007/s11336-013-9388-3. Epub 2013 Dec 19.

An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests.递归分区介绍：分类和回归树、装袋和随机森林的原理、应用和特点。

Psychol Methods. 2009 Dec;14(4):323-48. doi: 10.1037/a0016973.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验