• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

带协变量的项目反应理论(IRT-C):评估三参数逻辑斯蒂模型的项目恢复与项目功能差异

Item Response Theory With Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model.

作者信息

Tay Louis, Huang Qiming, Vermunt Jeroen K

机构信息

Purdue University, West Lafayette, IN, USA.

Tilburg University, Tilburg, Netherlands.

出版信息

Educ Psychol Meas. 2016 Feb;76(1):22-42. doi: 10.1177/0013164415579488. Epub 2015 Apr 6.

DOI:10.1177/0013164415579488
PMID:29795855
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5965573/
Abstract

In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To assess the utility of the IRT-C procedure, we conducted a simulation study. Using SAT data for realistic parameters, uniform DIF on three covariates were simulated: gender (dichotomous), race/ethnicity (categorical), and income (continuous). Simulations were conducted across several conditions: two test lengths (14 items, 21 items), four sample sizes (5,000, 10,000, 20,000, 40,000), and two DIF effect sizes (medium, large). It was found that the IRT-C procedure could accurately recover the latent means and the three-parameter logistic model parameters well with a substantial sample size of 20,000. There was good control of Type I error rates to the nominal rates across the sample sizes. Good power to detect DIF across all covariates (>.80) was observed when the sample size was 20,000 for large DIF effect size and 40,000 for medium DIF effect size. Practical implications for the use of the IRT-C procedure are discussed.

摘要

在大规模测试中,多组方法在评估多个变量的项目功能差异(DIF)方面存在局限性,因为DIF是针对每个变量分别进行检验的。相比之下,带协变量的项目反应理论(IRT-C)程序可用于同时检验多个变量(协变量)的DIF。为了评估IRT-C程序的效用,我们进行了一项模拟研究。使用SAT数据的实际参数,模拟了三个协变量上的均匀DIF:性别(二分变量)、种族/民族(分类变量)和收入(连续变量)。在几种条件下进行了模拟:两种测试长度(14个项目、21个项目)、四个样本量(5000、10000、20000、40000)和两种DIF效应大小(中等、大)。结果发现,在样本量达到20000时,IRT-C程序能够很好地准确恢复潜在均值和三参数逻辑模型参数。在所有样本量下,第一类错误率都能很好地控制在名义水平。当大DIF效应大小的样本量为20000且中等DIF效应大小的样本量为40000时,观察到在所有协变量上检测DIF的能力良好(>.80)。本文还讨论了IRT-C程序使用的实际意义。

相似文献

1
Item Response Theory With Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model.带协变量的项目反应理论(IRT-C):评估三参数逻辑斯蒂模型的项目恢复与项目功能差异
Educ Psychol Meas. 2016 Feb;76(1):22-42. doi: 10.1177/0013164415579488. Epub 2015 Apr 6.
2
Testing Differential Item Functioning in Small Samples.小样本中的差异项目功能测试。
Multivariate Behav Res. 2020 Sep-Oct;55(5):722-747. doi: 10.1080/00273171.2019.1671162. Epub 2019 Oct 4.
3
Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures.用于检测项目功能差异的现代心理测量方法:在认知评估测量中的应用。
Stat Med. 2000;19(11-12):1651-83. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1651::aid-sim453>3.0.co;2-h.
4
Differential Item Functioning Analyses of the Patient-Reported Outcomes Measurement Information System (PROMIS®) Measures: Methods, Challenges, Advances, and Future Directions.患者报告结局测量信息系统(PROMIS®)测评的项目区分度分析:方法、挑战、进展及未来方向。
Psychometrika. 2021 Sep;86(3):674-711. doi: 10.1007/s11336-021-09775-0. Epub 2021 Jul 12.
5
Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Pain Interference Short Form Items: Application to Ethnically Diverse Cancer and Palliative Care Populations.患者报告结局测量信息系统(PROMIS)疼痛干扰简表条目的测量等效性:在不同种族癌症和姑息治疗人群中的应用。
Psychol Test Assess Model. 2016;58(2):309-352.
6
Multidimensional Extension of Multiple Indicators Multiple Causes Models to Detect DIF.用于检测差异项目功能的多指标多原因模型的多维扩展
Educ Psychol Meas. 2017 Aug;77(4):545-569. doi: 10.1177/0013164416651116. Epub 2016 May 25.
7
Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Applied Cognition - General Concerns, Short Forms in Ethnically Diverse Groups.患者报告结局测量信息系统(PROMIS)应用认知量表在不同种族群体中的测量等效性——一般问题及简表
Psychol Test Assess Model. 2016;58(2):255-307.
8
After Differential Item Functioning Is Detected: IRT Item Calibration and Scoring in the Presence of DIF.在检测到项目功能差异之后:存在项目功能差异时的项目反应理论项目校准与计分
Appl Psychol Meas. 2016 Nov;40(8):573-591. doi: 10.1177/0146621616664304. Epub 2016 Sep 24.
9
A Power Formula for the Mantel-Haenszel Test for Differential Item Functioning.用于项目功能差异的曼特尔-亨泽尔检验的功效公式。
Appl Psychol Meas. 2015 Jul;39(5):373-388. doi: 10.1177/0146621614568805. Epub 2015 Feb 5.
10
Power and Type I Error of the Mean and Covariance Structure Analysis Model for Detecting Differential Item Functioning in Graded Response Items.用于检测等级反应项目中项目功能差异的均值和协方差结构分析模型的功效与I型错误
Multivariate Behav Res. 2006 Mar 1;41(1):29-53. doi: 10.1207/s15327906mbr4101_3.

引用本文的文献

1
DIF Analysis with Unknown Groups and Anchor Items.不同组别和锚定项目的 DIF 分析。
Psychometrika. 2024 Mar;89(1):267-295. doi: 10.1007/s11336-024-09948-7. Epub 2024 Feb 21.
2
Identification of sources of DIF using covariates in patient-reported outcome measures: a simulation study comparing two approaches based on Rasch family models.在患者报告结局测量中使用协变量识别差异项目功能的来源:一项基于Rasch族模型比较两种方法的模拟研究
Front Psychol. 2023 Aug 10;14:1191107. doi: 10.3389/fpsyg.2023.1191107. eCollection 2023.
3
DIF Statistical Inference Without Knowing Anchoring Items.不知晓锚定项目的 DIF 统计推断。
Psychometrika. 2023 Dec;88(4):1097-1122. doi: 10.1007/s11336-023-09930-9. Epub 2023 Aug 7.
4
Differential Item Functioning Analysis Without A Priori Information on Anchor Items: QQ Plots and Graphical Test.无锚点先验信息的项目区分度分析:QQ 图和图形检验。
Psychometrika. 2021 Jun;86(2):345-377. doi: 10.1007/s11336-021-09746-5. Epub 2021 Mar 3.
5
The Bayesian Expectation-Maximization-Maximization for the 3PLM.用于三参数逻辑斯蒂模型的贝叶斯期望最大化算法。
Front Psychol. 2019 May 31;10:1175. doi: 10.3389/fpsyg.2019.01175. eCollection 2019.
6
Investigating Measurement Invariance by Means of Parameter Instability Tests for 2PL and 3PL Models.通过两参数逻辑斯蒂模型和三参数逻辑斯蒂模型的参数稳定性检验来研究测量不变性
Educ Psychol Meas. 2019 Apr;79(2):385-398. doi: 10.1177/0013164418777784. Epub 2018 May 24.
7
Expectation-Maximization-Maximization: A Feasible MLE Algorithm for the Three-Parameter Logistic Model Based on a Mixture Modeling Reformulation.期望最大化最大化:一种基于混合建模重新表述的三参数逻辑模型的可行极大似然估计算法。
Front Psychol. 2018 Jan 5;8:2302. doi: 10.3389/fpsyg.2017.02302. eCollection 2017.

本文引用的文献

1
How Item Residual Heterogeneity Affects Tests for Differential Item Functioning.项目残差异质性如何影响项目功能差异检验。
Appl Psychol Meas. 2015 Jun;39(4):251-263. doi: 10.1177/0146621614561313. Epub 2014 Dec 11.
2
Evaluation of MIMIC-Model Methods for DIF Testing With Comparison to Two-Group Analysis.评估 MIMIC 模型方法在差异分析中的应用,并与两组分析进行比较。
Multivariate Behav Res. 2009 Jan-Feb;44(1):1-27. doi: 10.1080/00273170802620121.
3
Illustration of MIMIC-Model DIF Testing with the Schedule for Nonadaptive and Adaptive Personality.使用非适应性和适应性人格量表对MIMIC模型差异项目功能测试的说明。
J Psychopathol Behav Assess. 2009;31(4):320-330. doi: 10.1007/s10862-008-9118-9.
4
Detecting differential item functioning with confirmatory factor analysis and item response theory: toward a unified strategy.运用验证性因素分析和项目反应理论检测项目功能差异:迈向统一策略
J Appl Psychol. 2006 Nov;91(6):1292-306. doi: 10.1037/0021-9010.91.6.1292.