• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估基于测验题组的多值项目的正则化差异项目功能方法的性能。

Evaluating the Performance of a Regularized Differential Item Functioning Method for Testlet-Based Polytomous Items.

作者信息

Huang Jing, Miller M David, Huggins-Manley Anne Corinne, Leite Walter L, Knopf Herman T, Ritzhaupt Albert D

机构信息

University of Florida, Gainesville, FL, USA.

出版信息

Educ Psychol Meas. 2025 May 31:00131644251342512. doi: 10.1177/00131644251342512.

DOI:10.1177/00131644251342512
PMID:40458608
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12126468/
Abstract

This study investigated the effect of testlets on regularization-based differential item functioning (DIF) detection in polytomous items, focusing on the generalized partial credit model with lasso penalization (GPCMlasso) DIF method. Five factors were manipulated: sample size, magnitude of testlet effect, magnitude of DIF, number of DIF items, and type of DIF-inducing covariates. Model performance was evaluated using false-positive rate (FPR) and true-positive rate (TPR). Results showed that the simulation had effective control of FPR across conditions, while the TPR was differentially influenced by the manipulated factors. Generally, the small testlet effect did not noticeably affect the GPCMlasso model's performance regarding FPR and TPR. The findings provide evidence of the effectiveness of the GPCMlasso method for DIF detection in polytomous items when testlets were present. The implications for future research and limitations were also discussed.

摘要

本研究调查了测试组对多分类项目中基于正则化的差异项目功能(DIF)检测的影响,重点关注具有套索惩罚的广义部分计分模型(GPCMlasso)DIF方法。操纵了五个因素:样本量、测试组效应的大小、DIF的大小、DIF项目的数量以及诱发DIF的协变量类型。使用假阳性率(FPR)和真阳性率(TPR)评估模型性能。结果表明,模拟在各种条件下有效控制了FPR,而TPR受到操纵因素的不同影响。一般来说,较小的测试组效应在FPR和TPR方面对GPCMlasso模型的性能没有明显影响。研究结果为GPCMlasso方法在存在测试组时对多分类项目进行DIF检测的有效性提供了证据。还讨论了对未来研究的启示和局限性。

相似文献

1
Evaluating the Performance of a Regularized Differential Item Functioning Method for Testlet-Based Polytomous Items.评估基于测验题组的多值项目的正则化差异项目功能方法的性能。
Educ Psychol Meas. 2025 May 31:00131644251342512. doi: 10.1177/00131644251342512.
2
A regularization approach for the detection of differential item functioning in generalized partial credit models.广义部分信用模型中差异项目功能检测的正则化方法。
Behav Res Methods. 2020 Feb;52(1):279-294. doi: 10.3758/s13428-019-01224-2.
3
Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference.用于技术增强创新型项目的多分类测试题组反应模型:对模型拟合和特质推断的影响
Educ Psychol Meas. 2022 Aug;82(4):811-838. doi: 10.1177/00131644211032261. Epub 2021 Aug 2.
4
Computerized adaptive testing for testlet-based innovative items.基于测试单元的创新项目的计算机化自适应测试。
Br J Math Stat Psychol. 2022 Feb;75(1):136-157. doi: 10.1111/bmsp.12252. Epub 2021 Aug 30.
5
An Extension of Testlet-Based Equating to the Polytomous Testlet Response Theory Model.基于题组的等值方法向多值题组反应理论模型的扩展
Front Psychol. 2022 Jan 12;12:743362. doi: 10.3389/fpsyg.2021.743362. eCollection 2021.
6
Exploring the Utility of Logistic Mixed Modeling Approaches to Simultaneously Investigate Item and Testlet DIF on Testlet-based Data.探索逻辑混合建模方法在基于测验项目组数据同时研究项目和测验项目组差异项目功能(DIF)方面的效用。
J Appl Meas. 2016;17(1):79-90.
7
Penalization approaches in the conditional maximum likelihood and Rasch modelling context.条件最大似然和拉施模型背景下的惩罚方法。
Br J Math Stat Psychol. 2023 Feb;76(1):154-191. doi: 10.1111/bmsp.12287. Epub 2022 Sep 14.
8
Real and Artificial Differential Item Functioning in Polytomous Items.多分类项目中的真实和人为差异项目功能
Educ Psychol Meas. 2015 Apr;75(2):185-207. doi: 10.1177/0013164414534258. Epub 2014 May 16.
9
Assessment of differential item functioning.差异项目功能评估。
J Appl Meas. 2008;9(4):387-408.
10
Polytomous multilevel testlet models for testlet-based assessments with complex sampling designs.用于具有复杂抽样设计的基于测验题组评估的多分类多级测验题组模型。
Br J Math Stat Psychol. 2015 Feb;68(1):65-83. doi: 10.1111/bmsp.12035. Epub 2014 Feb 27.

本文引用的文献

1
DIF Analysis with Unknown Groups and Anchor Items.不同组别和锚定项目的 DIF 分析。
Psychometrika. 2024 Mar;89(1):267-295. doi: 10.1007/s11336-024-09948-7. Epub 2024 Feb 21.
2
Implementing a Standardized Effect Size in the POLYSIBTEST Procedure.在POLYSIBTEST程序中实施标准化效应量
Educ Psychol Meas. 2023 Apr;83(2):401-427. doi: 10.1177/00131644221081011. Epub 2022 Feb 28.
3
The Upper Extremity Functional Scale for Prosthesis Users (UEFS-P): subscales for one and two-handed tasks.上肢假肢使用者功能量表(UEFS-P):单手和双手任务的分量表。
Disabil Rehabil. 2023 Nov;45(22):3768-3778. doi: 10.1080/09638288.2022.2138572. Epub 2022 Nov 10.
4
Semi-automated Rasch analysis with differential item functioning.半自动化 Rasch 分析与差异项目功能。
Behav Res Methods. 2023 Sep;55(6):3129-3148. doi: 10.3758/s13428-022-01947-9. Epub 2022 Sep 7.
5
Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference.用于技术增强创新型项目的多分类测试题组反应模型:对模型拟合和特质推断的影响
Educ Psychol Meas. 2022 Aug;82(4):811-838. doi: 10.1177/00131644211032261. Epub 2021 Aug 2.
6
A comparison of methods to address item non-response when testing for differential item functioning in multidimensional patient-reported outcome measures.多维患者报告结局测量中测试差异项目功能时处理项目无反应的方法比较。
Qual Life Res. 2022 Sep;31(9):2837-2848. doi: 10.1007/s11136-022-03129-8. Epub 2022 Apr 7.
7
Using Lasso and Adaptive Lasso to Identify DIF in Multidimensional 2PL Models.使用套索和自适应套索识别多维 2PL 模型中的 DIF。
Multivariate Behav Res. 2023 Mar-Apr;58(2):387-407. doi: 10.1080/00273171.2021.1985950. Epub 2022 Jan 28.
8
Structural validity and reliability of the patient experience measure: A new approach to assessing psychosocial experience of upper limb prosthesis users.患者体验量表的结构效度和信度:一种评估上肢假肢使用者心理社会体验的新方法。
PLoS One. 2021 Dec 28;16(12):e0261865. doi: 10.1371/journal.pone.0261865. eCollection 2021.
9
A Machine Learning Approach to Assess Differential Item Functioning in Psychometric Questionnaires Using the Elastic Net Regularized Ordinal Logistic Regression in Small Sample Size Groups.一种使用弹性网络正则化有序逻辑回归在小样本量组中评估心理计量问卷中差异项目功能的机器学习方法。
Biomed Res Int. 2021 Dec 15;2021:6854477. doi: 10.1155/2021/6854477. eCollection 2021.
10
An R toolbox for score-based measurement invariance tests in IRT models.IRT 模型中基于评分的测量不变性检验的 R 工具箱。
Behav Res Methods. 2022 Oct;54(5):2101-2113. doi: 10.3758/s13428-021-01689-0. Epub 2021 Dec 16.