• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

可能差异不大:改进的差异检验功能统计量,考虑了抽样变异性。

It Might Not Make a Big DIF: Improved Differential Test Functioning Statistics That Account for Sampling Variability.

作者信息

Chalmers R Philip, Counsell Alyssa, Flora David B

机构信息

York University, Toronto, Ontario, Canada.

出版信息

Educ Psychol Meas. 2016 Feb;76(1):114-140. doi: 10.1177/0013164415584576. Epub 2015 Jun 29.

DOI:10.1177/0013164415584576
PMID:29795859
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5965577/
Abstract

Differential test functioning, or DTF, occurs when one or more items in a test demonstrate differential item functioning (DIF) and the aggregate of these effects are witnessed at the test level. In many applications, DTF can be more important than DIF when the overall effects of DIF at the test level can be quantified. However, optimal statistical methodology for detecting and understanding DTF has not been developed. This article proposes improved DTF statistics that properly account for sampling variability in item parameter estimates while avoiding the necessity of predicting provisional latent trait estimates to create two-step approximations. The properties of the DTF statistics were examined with two Monte Carlo simulation studies using dichotomous and polytomous IRT models. The simulation results revealed that the improved DTF statistics obtained optimal and consistent statistical properties, such as obtaining consistent Type I error rates. Next, an empirical analysis demonstrated the application of the proposed methodology. Applied settings where the DTF statistics can be beneficial are suggested and future DTF research areas are proposed.

摘要

差异测验功能(DTF)是指当测验中的一个或多个项目表现出差异项目功能(DIF),且这些效应的总和在测验层面上被观察到时出现的情况。在许多应用中,当测验层面上DIF的总体效应可以量化时,DTF可能比DIF更重要。然而,尚未开发出用于检测和理解DTF的最佳统计方法。本文提出了改进的DTF统计量,该统计量能够恰当地考虑项目参数估计中的抽样变异性,同时避免了预测临时潜在特质估计以创建两步近似值的必要性。使用二分和多分IRT模型的两项蒙特卡罗模拟研究检验了DTF统计量的性质。模拟结果表明,改进后的DTF统计量获得了最优且一致的统计性质,例如获得一致的I类错误率。接下来,实证分析展示了所提出方法的应用。提出了DTF统计量可能有益的应用场景,并提出了未来DTF的研究领域。

相似文献

1
It Might Not Make a Big DIF: Improved Differential Test Functioning Statistics That Account for Sampling Variability.可能差异不大:改进的差异检验功能统计量,考虑了抽样变异性。
Educ Psychol Meas. 2016 Feb;76(1):114-140. doi: 10.1177/0013164415584576. Epub 2015 Jun 29.
2
Assessing the equivalence of Web-based and paper-and-pencil questionnaires using differential item and test functioning (DIF and DTF) analysis: a case of the Four-Dimensional Symptom Questionnaire (4DSQ).使用基于网络的和纸质问卷的等效性评估,采用不同项目和测试功能(DIF 和 DTF)分析:以四维度症状问卷(4DSQ)为例。
Qual Life Res. 2018 May;27(5):1191-1200. doi: 10.1007/s11136-018-1816-5. Epub 2018 Feb 21.
3
After Differential Item Functioning Is Detected: IRT Item Calibration and Scoring in the Presence of DIF.在检测到项目功能差异之后:存在项目功能差异时的项目反应理论项目校准与计分
Appl Psychol Meas. 2016 Nov;40(8):573-591. doi: 10.1177/0146621616664304. Epub 2016 Sep 24.
4
Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures.用于检测项目功能差异的现代心理测量方法:在认知评估测量中的应用。
Stat Med. 2000;19(11-12):1651-83. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1651::aid-sim453>3.0.co;2-h.
5
Model-Based Measures for Detecting and Quantifying Response Bias.基于模型的响应偏差检测和量化方法
Psychometrika. 2018 Sep;83(3):696-732. doi: 10.1007/s11336-018-9626-9. Epub 2018 Jun 15.
6
Generalisability of the Barthel Index and the Functional Independence Measure: robustness of disability measures to Differential Item Functioning.巴氏指数和功能独立性测量的可推广性:残疾测量对项目功能差异的稳健性。
Disabil Rehabil. 2025 Apr;47(8):2134-2145. doi: 10.1080/09638288.2024.2391554. Epub 2024 Sep 2.
7
A Monte Carlo Study of an Iterative Wald Test Procedure for DIF Analysis.用于差异项目功能分析的迭代 Wald 检验程序的蒙特卡罗研究。
Educ Psychol Meas. 2017 Jan;77(1):104-118. doi: 10.1177/0013164416637104. Epub 2016 Mar 7.
8
Differential Item Functioning via Robust Scaling.稳健标度的差异项目功能分析。
Psychometrika. 2024 Sep;89(3):796-821. doi: 10.1007/s11336-024-09957-6. Epub 2024 May 4.
9
lordif: An R Package for Detecting Differential Item Functioning Using Iterative Hybrid Ordinal Logistic Regression/Item Response Theory and Monte Carlo Simulations.lordif:一个用于使用迭代混合有序逻辑回归/项目反应理论和蒙特卡罗模拟检测项目功能差异的R包。
J Stat Softw. 2011 Mar 1;39(8):1-30. doi: 10.18637/jss.v039.i08.
10
Item Response Theory With Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model.带协变量的项目反应理论(IRT-C):评估三参数逻辑斯蒂模型的项目恢复与项目功能差异
Educ Psychol Meas. 2016 Feb;76(1):22-42. doi: 10.1177/0013164415579488. Epub 2015 Apr 6.

引用本文的文献

1
Evaluating the Performance of a Regularized Differential Item Functioning Method for Testlet-Based Polytomous Items.评估基于测验题组的多值项目的正则化差异项目功能方法的性能。
Educ Psychol Meas. 2025 May 31:00131644251342512. doi: 10.1177/00131644251342512.
2
Harmonization of SDQ and ASEBA Phenotypes: Measurement Variance Across Cohorts.优势与困难问卷(SDQ)和青少年自我报告表(ASEBA)表型的协调:不同队列间的测量差异
J Psychopathol Behav Assess. 2025;47(1):27. doi: 10.1007/s10862-025-10204-0. Epub 2025 Mar 7.
3
Measuring visual ability in linguistically diverse populations.在语言多样化人群中测量视觉能力。
Behav Res Methods. 2024 Dec 30;57(1):36. doi: 10.3758/s13428-024-02579-x.
4
Sense of coherence in students while studying abroad.留学生的连贯感。
Health SA. 2024 Sep 30;29:2585. doi: 10.4102/hsag.v29i0.2585. eCollection 2024.
5
Establishing and evaluating the gradient of item naming difficulty in post-stroke aphasia and semantic dementia.建立和评估脑卒中后失语症和语义性痴呆患者命名项目难度的梯度。
Cortex. 2024 Oct;179:103-111. doi: 10.1016/j.cortex.2024.07.007. Epub 2024 Aug 10.
6
Equivalence of Alcohol Use Disorder Symptom Assessments in Routine Clinical Care When Completed Remotely via Online Patient Portals Versus In Clinic via Paper Questionnaires: Psychometric Evaluation.远程通过在线患者门户而非临床面对面通过纸质问卷完成酒精使用障碍症状评估用于常规临床护理时的等效性:心理测量学评估。
J Med Internet Res. 2024 Jul 22;26:e52101. doi: 10.2196/52101.
7
Ant colony optimization for parallel test assembly.蚁群优化并行测试装配。
Behav Res Methods. 2024 Sep;56(6):5834-5848. doi: 10.3758/s13428-023-02319-7. Epub 2024 Jan 26.
8
Differential item functioning of material deprivation assessment in households with or without children.有子女和无子女家庭物质剥夺评估的项目差异功能。
PLoS One. 2023 Aug 17;18(8):e0290112. doi: 10.1371/journal.pone.0290112. eCollection 2023.
9
Psychometric Analysis of the Modified Differential Emotions Scale and the Six-Item Life Orientation Test-Revised in a Cohort of Older Women from the Women's Health Initiative.《女性健康倡议中老年女性队列中改良差异情绪量表和六项目生活取向测验修订版的心理计量学分析》。
J Womens Health (Larchmt). 2023 Sep;32(9):992-1005. doi: 10.1089/jwh.2023.0056. Epub 2023 Jul 17.
10
Changes in Tobacco Dependence and Association With Onset and Progression of Use by Product Type From Waves 1 to 3 of the Population Assessment of Tobacco and Health (PATH) Study.《人口评估烟草与健康(PATH)研究》第 1 波至第 3 波中,产品类型使用的起始和进展与烟草依赖变化的关系。
Nicotine Tob Res. 2023 Sep 4;25(11):1781-1790. doi: 10.1093/ntr/ntad107.

本文引用的文献

1
Information matrices and standard errors for MLEs of item parameters in IRT.IRT中项目参数极大似然估计的信息矩阵和标准误差。
Psychometrika. 2014 Apr;79(2):232-54. doi: 10.1007/s11336-013-9334-4. Epub 2013 Mar 27.
2
Characterizing Sources of Uncertainty in IRT Scale Scores.刻画IRT量表分数中的不确定性来源
Educ Psychol Meas. 2012 Apr 1;72(2):264-290. doi: 10.1177/0013164411410056. Epub 2011 Aug 25.
3
Incorporating Measurement Non-Equivalence in a Cross-Study Latent Growth Curve Analysis.在跨研究潜在增长曲线分析中纳入测量非等效性
Struct Equ Modeling. 2008 Oct 1;15(4):676-704. doi: 10.1080/10705510802339080.
4
SEM of another flavour: two new applications of the supplemented EM algorithm.另一种变体的扫描电子显微镜:补充期望最大化算法的两个新应用。
Br J Math Stat Psychol. 2008 Nov;61(Pt 2):309-29. doi: 10.1348/000711007X249603. Epub 2007 Oct 29.