• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过Rasch模型的双尺度纯化改进大规模项目中差异项目功能的评估:以国际学生评估项目(PISA)为例

Improving the Assessment of Differential Item Functioning in Large-Scale Programs With Dual-Scale Purification of Rasch Models: The PISA Example.

作者信息

Chen Cheng-Te, Hwu Bo-Sien

机构信息

National Tsing Hua University, Hsinchu, Taiwan.

National Sun Yat-sen University, Kaohsiung, Taiwan.

出版信息

Appl Psychol Meas. 2018 May;42(3):206-220. doi: 10.1177/0146621617726786. Epub 2017 Aug 29.

DOI:10.1177/0146621617726786
PMID:29881122
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5985702/
Abstract

By design, large-scale educational testing programs often have a large proportion of missing data. Since the effect of missing data on differential item functioning (DIF) assessment has been investigated in recent years and it has been found that Type I error rates tend to be inflated, it is of great importance to adapt existing DIF assessment methods to the inflation. The DIF-free-then-DIF (DFTD) strategy, which originally involved one single-scale purification procedure to identify DIF-free items, has been extended to involve another scale purification procedure for the DIF assessment in this study, and this new method is called the dual-scale purification (DSP) procedure. The performance of the DSP procedure in assessing DIF in large-scale programs, such as Program for International Student Assessment (PISA), was compared with the DFTD strategy through a series of simulation studies. Results showed the superiority of the DSP procedure over the DFTD strategy when tests consisted of many DIF items and when data were missing by design as in large-scale programs. Moreover, an empirical study of the PISA 2009 Taiwan sample was provided to show the implications of the DSP procedure. The applications as well as further studies of DSP procedure are also discussed.

摘要

从设计角度来看,大规模教育测试项目往往存在大量缺失数据。近年来,由于缺失数据对项目功能差异(DIF)评估的影响已得到研究,并且发现第一类错误率往往会被夸大,因此使现有的DIF评估方法适应这种夸大情况至关重要。“无DIF然后DIF”(DFTD)策略最初涉及一个单一量表净化程序以识别无DIF项目,在本研究中已扩展为涉及另一个用于DIF评估的量表净化程序,这种新方法称为双量表净化(DSP)程序。通过一系列模拟研究,将DSP程序在评估大规模项目(如国际学生评估项目(PISA))中的DIF时的表现与DFTD策略进行了比较。结果表明,当测试包含许多DIF项目且数据如大规模项目那样因设计而缺失时,DSP程序优于DFTD策略。此外,还提供了对2009年PISA台湾样本的实证研究,以展示DSP程序的意义。同时也讨论了DSP程序的应用以及进一步的研究。

相似文献

1
Improving the Assessment of Differential Item Functioning in Large-Scale Programs With Dual-Scale Purification of Rasch Models: The PISA Example.通过Rasch模型的双尺度纯化改进大规模项目中差异项目功能的评估:以国际学生评估项目(PISA)为例
Appl Psychol Meas. 2018 May;42(3):206-220. doi: 10.1177/0146621617726786. Epub 2017 Aug 29.
2
The Matching Criterion Purification for Differential Item Functioning Analyses in a Large-Scale Assessment.大规模评估中差异项目功能分析的匹配标准净化
Educ Psychol Meas. 2016 Feb;76(1):141-163. doi: 10.1177/0013164415585166. Epub 2015 May 18.
3
Using Odds Ratios to Detect Differential Item Functioning.使用优势比来检测项目功能差异
Appl Psychol Meas. 2018 Nov;42(8):613-629. doi: 10.1177/0146621618762738. Epub 2018 Mar 21.
4
Combining Item Purification and Multiple Comparison Adjustment Methods in Detection of Differential Item Functioning.结合项目纯化和多重比较调整方法检测差异项目功能。
Multivariate Behav Res. 2024 Jan-Feb;59(1):46-61. doi: 10.1080/00273171.2023.2205393. Epub 2023 May 23.
5
A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning.一种基于用于项目功能差异的曼特尔-亨塞尔效应量度量的拉施树新停止准则。
Educ Psychol Meas. 2023 Feb;83(1):181-212. doi: 10.1177/00131644221077135. Epub 2022 Feb 28.
6
Item Response Theory With Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model.带协变量的项目反应理论(IRT-C):评估三参数逻辑斯蒂模型的项目恢复与项目功能差异
Educ Psychol Meas. 2016 Feb;76(1):22-42. doi: 10.1177/0013164415579488. Epub 2015 Apr 6.
7
Monitoring Countries in a Changing World: A New Look at DIF in International Surveys.不断变化的世界中的监测国家:国际调查中差异项目功能(DIF)的新视角
Psychometrika. 2017 Mar;82(1):210-232. doi: 10.1007/s11336-016-9543-8. Epub 2016 Nov 14.
8
The effect of missing data and imputation on the detection of bias in cognitive testing using differential item functioning methods.缺失数据和插补对使用差异项目功能方法检测认知测试偏倚的影响。
BMC Med Res Methodol. 2022 Mar 27;22(1):81. doi: 10.1186/s12874-022-01572-2.
9
The Interaction of Ability Differences and Guessing When Modeling Differential Item Functioning With the Rasch Model: Conventional and Tailored Calibration.使用拉施模型对项目功能差异进行建模时能力差异与猜测的相互作用:传统校准与定制校准
Educ Psychol Meas. 2015 Aug;75(4):610-633. doi: 10.1177/0013164414554082. Epub 2014 Oct 20.
10
Improving the assessment of measurement invariance: Using regularization to select anchor items and identify differential item functioning.改进测量不变性评估:使用正则化选择锚定项目并识别差异项目功能。
Psychol Methods. 2020 Dec;25(6):673-690. doi: 10.1037/met0000253. Epub 2020 Jan 9.

引用本文的文献

1
Examination of Gender-Related Differential Item Functioning Through Poly-BW Indices.通过多元布赖克-威廉姆斯指数检验与性别相关的项目差异功能
Front Psychol. 2022 Feb 25;13:821459. doi: 10.3389/fpsyg.2022.821459. eCollection 2022.
2
Detecting Rater Biases in Sparse Rater-Mediated Assessment Networks.在稀疏评分者介导的评估网络中检测评分者偏差
Educ Psychol Meas. 2021 Oct;81(5):996-1022. doi: 10.1177/0013164420988108. Epub 2021 Jan 19.

本文引用的文献

1
How Item Residual Heterogeneity Affects Tests for Differential Item Functioning.项目残差异质性如何影响项目功能差异检验。
Appl Psychol Meas. 2015 Jun;39(4):251-263. doi: 10.1177/0146621614561313. Epub 2014 Dec 11.
2
Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches.差异项目功能分析的锚定选择策略:综述、评估及新方法
Educ Psychol Meas. 2015 Feb;75(1):22-56. doi: 10.1177/0013164414529792. Epub 2014 Apr 21.
3
Assessment of differential item functioning.差异项目功能评估。
J Appl Meas. 2008;9(4):387-408.