• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于得分的贝叶斯极大后验估计在项目反应理论中的测量不变性检验。

Score-based measurement invariance checks for Bayesian maximum-a-posteriori estimates in item response theory.

机构信息

Department of Psychology, University of Zurich, Switzerland.

Epidemiology, Biostatistics and Prevention Institute (EBPI), University of Zurich, Switzerland.

出版信息

Br J Math Stat Psychol. 2022 Nov;75(3):728-752. doi: 10.1111/bmsp.12275. Epub 2022 Jun 6.

DOI:10.1111/bmsp.12275
PMID:35670000
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9796736/
Abstract

A family of score-based tests has been proposed in recent years for assessing the invariance of model parameters in several models of item response theory (IRT). These tests were originally developed in a maximum likelihood framework. This study discusses analogous tests for Bayesian maximum-a-posteriori estimates and multiple-group IRT models. We propose two families of statistical tests, which are based on an approximation using a pooled variance method, or on a simulation approach based on asymptotic results. The resulting tests were evaluated by a simulation study, which investigated their sensitivity against differential item functioning with respect to a categorical or continuous person covariate in the two- and three-parametric logistic models. Whereas the method based on pooled variance was found to be useful in practice with maximum likelihood as well as maximum-a-posteriori estimates, the simulation-based approach was found to require large sample sizes to lead to satisfactory results.

摘要

近年来,提出了一类基于得分的检验方法,用于评估项目反应理论(IRT)中几种模型的模型参数不变性。这些检验最初是在最大似然框架中开发的。本研究讨论了贝叶斯最大后验估计和多组 IRT 模型的类似检验。我们提出了两类统计检验,它们基于使用合并方差方法的近似值,或基于渐近结果的模拟方法。通过模拟研究评估了得到的检验,该研究调查了它们对二参数和三参数逻辑模型中类别或连续个体协变量的差异项目功能的敏感性。尽管基于合并方差的方法在最大似然和最大后验估计中都被发现具有实际用途,但基于模拟的方法发现需要大样本量才能得到令人满意的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/067c667267d4/BMSP-75-728-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/cbb69dc201b3/BMSP-75-728-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/b5a16c070936/BMSP-75-728-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/bda01cdb8bb6/BMSP-75-728-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/471dfe742fdd/BMSP-75-728-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/fbf3fd71e38b/BMSP-75-728-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/17bed7469cd8/BMSP-75-728-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/2d040bc6594f/BMSP-75-728-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/4673956783a3/BMSP-75-728-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/e80e4eec3865/BMSP-75-728-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/c9bc25f1b28e/BMSP-75-728-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/8d61531aad5e/BMSP-75-728-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/d0944cd18d8b/BMSP-75-728-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/067c667267d4/BMSP-75-728-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/cbb69dc201b3/BMSP-75-728-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/b5a16c070936/BMSP-75-728-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/bda01cdb8bb6/BMSP-75-728-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/471dfe742fdd/BMSP-75-728-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/fbf3fd71e38b/BMSP-75-728-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/17bed7469cd8/BMSP-75-728-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/2d040bc6594f/BMSP-75-728-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/4673956783a3/BMSP-75-728-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/e80e4eec3865/BMSP-75-728-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/c9bc25f1b28e/BMSP-75-728-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/8d61531aad5e/BMSP-75-728-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/d0944cd18d8b/BMSP-75-728-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f90e/9796736/067c667267d4/BMSP-75-728-g013.jpg

相似文献

1
Score-based measurement invariance checks for Bayesian maximum-a-posteriori estimates in item response theory.基于得分的贝叶斯极大后验估计在项目反应理论中的测量不变性检验。
Br J Math Stat Psychol. 2022 Nov;75(3):728-752. doi: 10.1111/bmsp.12275. Epub 2022 Jun 6.
2
Robustness of the performance of the optimized hierarchical two-parameter logistic IRT model for small-sample item calibration.优化的层次双参数逻辑斯蒂克IRT 模型在小样本项目标定中性能的稳健性。
Behav Res Methods. 2023 Dec;55(8):3965-3983. doi: 10.3758/s13428-022-02000-5. Epub 2022 Nov 4.
3
Score-Based Tests of Differential Item Functioning via Pairwise Maximum Likelihood Estimation.基于评分的成对最大似然估计的项目区分功能差异检验。
Psychometrika. 2018 Mar;83(1):132-155. doi: 10.1007/s11336-017-9591-8. Epub 2017 Nov 17.
4
Testing Differential Item Functioning in Small Samples.小样本中的差异项目功能测试。
Multivariate Behav Res. 2020 Sep-Oct;55(5):722-747. doi: 10.1080/00273171.2019.1671162. Epub 2019 Oct 4.
5
Efficient Standard Error Formulas of Ability Estimators with Dichotomous Item Response Models.二分项目反应模型下能力估计器的有效标准误差公式
Psychometrika. 2016 Mar;81(1):184-200. doi: 10.1007/s11336-015-9443-3. Epub 2015 Feb 18.
6
On Latent Trait Estimation in Multidimensional Compensatory Item Response Models.多维补偿性项目反应模型中的潜在特质估计
Psychometrika. 2015 Jun;80(2):428-49. doi: 10.1007/s11336-013-9399-0. Epub 2014 Mar 7.
7
Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT.边缘极大似然框架下 Wald、LR、Score 和梯度检验的功效分析:IRT 中的应用。
Psychometrika. 2023 Dec;88(4):1249-1298. doi: 10.1007/s11336-022-09883-5. Epub 2022 Aug 27.
8
HBMIRT: A SAS macro for estimating uni- and multidimensional 1- and 2-parameter item response models in small (and large!) samples.HBMIRT:一个用于在小(和大!)样本中估计单维和多维 1 参和 2 参项目反应模型的 SAS 宏。
Behav Res Methods. 2024 Apr;56(4):4130-4161. doi: 10.3758/s13428-024-02366-8. Epub 2024 Mar 22.
9
An R toolbox for score-based measurement invariance tests in IRT models.IRT 模型中基于评分的测量不变性检验的 R 工具箱。
Behav Res Methods. 2022 Oct;54(5):2101-2113. doi: 10.3758/s13428-021-01689-0. Epub 2021 Dec 16.
10
Second-Order Probability Matching Priors for the Person Parameter in Unidimensional IRT Models.二维概率匹配先验在单维 IRT 模型中的个人参数。
Psychometrika. 2019 Sep;84(3):701-718. doi: 10.1007/s11336-019-09675-4. Epub 2019 Jul 1.

引用本文的文献

1
Investigating heterogeneity in IRTree models for multiple response processes with score-based partitioning.使用基于分数的划分方法研究用于多个响应过程的IRT树模型中的异质性。
Br J Math Stat Psychol. 2025 May;78(2):420-439. doi: 10.1111/bmsp.12367. Epub 2024 Nov 4.

本文引用的文献

1
A flexible moderated factor analysis approach to test for measurement invariance across a continuous variable.一种灵活的中介因子分析方法,用于检验连续变量的测量不变性。
Psychol Methods. 2021 Dec;26(6):660-679. doi: 10.1037/met0000360. Epub 2020 Oct 15.
2
Investigating Measurement Invariance by Means of Parameter Instability Tests for 2PL and 3PL Models.通过两参数逻辑斯蒂模型和三参数逻辑斯蒂模型的参数稳定性检验来研究测量不变性
Educ Psychol Meas. 2019 Apr;79(2):385-398. doi: 10.1177/0013164418777784. Epub 2018 May 24.
3
Tree-Based Global Model Tests for Polytomous Rasch Models.
多分类Rasch模型的基于树的全局模型检验
Educ Psychol Meas. 2018 Feb;78(1):128-166. doi: 10.1177/0013164416664394. Epub 2016 Oct 6.
4
Different Approaches to Covariate Inclusion in the Mixture Rasch Model.混合Rasch模型中协变量纳入的不同方法。
Educ Psychol Meas. 2016 Oct;76(5):848-872. doi: 10.1177/0013164415610380. Epub 2015 Oct 13.
5
Score-Based Tests of Differential Item Functioning via Pairwise Maximum Likelihood Estimation.基于评分的成对最大似然估计的项目区分功能差异检验。
Psychometrika. 2018 Mar;83(1):132-155. doi: 10.1007/s11336-017-9591-8. Epub 2017 Nov 17.
6
Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees.基于广义线性混合效应模型树检测聚类数据中的治疗亚组交互作用。
Behav Res Methods. 2018 Oct;50(5):2016-2034. doi: 10.3758/s13428-017-0971-x.
7
A more general model for testing measurement invariance and differential item functioning.更一般的测量不变性和项目区分功能检验模型。
Psychol Methods. 2017 Sep;22(3):507-526. doi: 10.1037/met0000077. Epub 2016 Jun 6.
8
Revisiting the 4-Parameter Item Response Model: Bayesian Estimation and Application.重新审视四参数项目反应模型:贝叶斯估计与应用。
Psychometrika. 2016 Dec;81(4):1142-1163. doi: 10.1007/s11336-015-9477-6. Epub 2015 Sep 23.
9
Modeling and Testing Differential Item Functioning in Unidimensional Binary Item Response Models with a Single Continuous Covariate: A Functional Data Analysis Approach.使用单一连续协变量的单维二元项目反应模型中差异项目功能的建模与检验:一种函数数据分析方法
Psychometrika. 2016 Jun;81(2):371-98. doi: 10.1007/s11336-015-9473-x. Epub 2015 Jul 9.
10
Tests of measurement invariance without subgroups: a generalization of classical methods.无亚组的测量不变性检验:经典方法的推广
Psychometrika. 2013 Jan;78(1):59-82. doi: 10.1007/s11336-012-9302-4. Epub 2012 Dec 13.