差异项目功能在筛查中何时重要？一种实证评估方法。

When Does Differential Item Functioning Matter for Screening? A Method for Empirical Evaluation.

作者信息

Gonzalez Oscar, Pelham William E

机构信息

University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.

Arizona State University, Tempe, AZ, USA.

出版信息

Assessment. 2021 Mar;28(2):446-456. doi: 10.1177/1073191120913618. Epub 2020 Apr 4.

DOI:10.1177/1073191120913618

PMID:32248701

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9705193/

Abstract

When items in a screening measure exhibit differential item functioning (DIF) across groups (e.g., males vs. females), DIF might affect which individuals are "caught" in the screening. This phenomenon is common, but DIF detection procedures do not typically provide guidance on whether the presence of DIF will meaningfully affect screening accuracy. Millsap and Kwok proposed a method to quantify the impact of DIF on screening accuracy, but their approach had limitations that prevent its use in scenarios where items are discrete. We extend the Millsap and Kwok procedure to accommodate discrete items and provide functions to apply the procedure to the user's own data. We illustrate our approach using published screening information and evaluate the proposed methodology with a small simulation study. Overall, we encourage researchers to use empirical methods to evaluate the extent to which the presence of DIF in a screening measure materially affects screening performance.

摘要

当筛查指标中的项目在不同群体（如男性与女性）间表现出项目功能差异（DIF）时，DIF可能会影响哪些个体在筛查中被“检出”。这种现象很常见，但DIF检测程序通常不会就DIF的存在是否会对筛查准确性产生有意义的影响提供指导。米尔萨普和郭提出了一种量化DIF对筛查准确性影响的方法，但其方法存在局限性，无法用于项目为离散型的情况。我们扩展了米尔萨普和郭的程序以适应离散型项目，并提供了将该程序应用于用户自身数据的函数。我们使用已发表的筛查信息来说明我们的方法，并通过一个小型模拟研究对所提出的方法进行评估。总体而言，我们鼓励研究人员采用实证方法来评估筛查指标中DIF的存在对筛查性能产生实质性影响的程度。

相似文献

When Does Differential Item Functioning Matter for Screening? A Method for Empirical Evaluation.

Assessment. 2021 Mar;28(2):446-456. doi: 10.1177/1073191120913618. Epub 2020 Apr 4.

Psychometric Properties and Performance of the Patient Reported Outcomes Measurement Information System (PROMIS) Depression Short Forms in Ethnically Diverse Groups.

Psychol Test Assess Model. 2016;58(1):141-181.

Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Pain Interference Short Form Items: Application to Ethnically Diverse Cancer and Palliative Care Populations.

Psychol Test Assess Model. 2016;58(2):309-352.

Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures.

Stat Med. 2000;19(11-12):1651-83. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1651::aid-sim453>3.0.co;2-h.

Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Anxiety Short Forms in Ethnically Diverse Groups.

Psychol Test Assess Model. 2016;58(1):183-219.

Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress.

Qual Life Res. 2007;16 Suppl 1:43-68. doi: 10.1007/s11136-007-9186-4. Epub 2007 May 5.

Generalisability of the Barthel Index and the Functional Independence Measure: robustness of disability measures to Differential Item Functioning.

Disabil Rehabil. 2025 Apr;47(8):2134-2145. doi: 10.1080/09638288.2024.2391554. Epub 2024 Sep 2.

After Differential Item Functioning Is Detected: IRT Item Calibration and Scoring in the Presence of DIF.

Appl Psychol Meas. 2016 Nov;40(8):573-591. doi: 10.1177/0146621616664304. Epub 2016 Sep 24.

Improvement in Detection of Differential Item Functioning Using a Mixture Item Response Theory Model.

Multivariate Behav Res. 2010 Nov 30;45(6):975-99. doi: 10.1080/00273171.2010.533047.

Wald χ Test for Differential Item Functioning Detection with Polytomous Items in Multilevel Data.

Educ Psychol Meas. 2024 Jun;84(3):530-548. doi: 10.1177/00131644231181688. Epub 2023 Jul 11.

引用本文的文献

"What does 'often' even mean?" Revising and validating the Comprehensive Autistic Trait Inventory in partnership with autistic people.

Mol Autism. 2025 Feb 6;16(1):7. doi: 10.1186/s13229-025-00643-7.

Are there subgroup differences in the accuracy of 'screening' questions for mood and anxiety disorder diagnostic interviews?

Int J Methods Psychiatr Res. 2024 Dec;33(4):e70008. doi: 10.1002/mpr.70008.

Protocol for a systematic review evaluating psychometric properties and gender-related measurement (non)invariance of self-report assessment tools for autism in adults.

Syst Rev. 2024 Jul 19;13(1):188. doi: 10.1186/s13643-024-02604-2.

Measurement invariance of the parent-reported Strengths and Difficulties Questionnaire in autistic adolescents.

Autism. 2024 Oct;28(10):2623-2636. doi: 10.1177/13623613241236805. Epub 2024 Mar 13.

Gender bias in autism screening: measurement invariance of different model frameworks of the Autism Spectrum Quotient.

BJPsych Open. 2023 Oct 2;9(5):e173. doi: 10.1192/bjo.2023.562.

Summary Intervals for Model-Based Classification Accuracy and Consistency Indices.

Educ Psychol Meas. 2023 Apr;83(2):240-261. doi: 10.1177/00131644221092347. Epub 2022 Apr 28.

Mental Health and Well-being Measures for Mean Comparison and Screening in Adolescents: An Assessment of Unidimensionality and Sex and Age Measurement Invariance.

Assessment. 2024 Mar;31(2):219-236. doi: 10.1177/10731911231158623. Epub 2023 Mar 2.

Psychometric properties of sum scores and factor scores differ even when their correlation is 0.98: A response to Widaman and Revelle.

Behav Res Methods. 2023 Dec;55(8):4269-4290. doi: 10.3758/s13428-022-02016-x. Epub 2022 Nov 17.

How Accurate and Consistent Are Score-Based Assessment Decisions? A Procedure Using the Linear Factor Model.

Assessment. 2023 Jul;30(5):1640-1650. doi: 10.1177/10731911221113568. Epub 2022 Aug 11.

Measurement Invariance of Psychological Distress, Substance Use, and Adult Social Support across Race/Ethnicity and Sex among Sexual Minority Youth.

J Sex Res. 2023 May-Jun;60(5):674-688. doi: 10.1080/00224499.2022.2038059. Epub 2022 Feb 24.

本文引用的文献

Quantifying the impact of partial measurement invariance in diagnostic research: An application to addiction research.

Addict Behav. 2019 Jul;94:50-56. doi: 10.1016/j.addbeh.2018.11.029. Epub 2018 Nov 22.

It Might Not Make a Big DIF: Improved Differential Test Functioning Statistics That Account for Sampling Variability.

Educ Psychol Meas. 2016 Feb;76(1):114-140. doi: 10.1177/0013164415584576. Epub 2015 Jun 29.

Differential item functioning magnitude and impact measures from item response theory models.

Psychol Test Assess Model. 2016;58(1):79-98.

Screening for Depression in the General Population with the Center for Epidemiologic Studies Depression (CES-D): A Systematic Review with Meta-Analysis.

PLoS One. 2016 May 16;11(5):e0155431. doi: 10.1371/journal.pone.0155431. eCollection 2016.

A primer on receiver operating characteristic analysis and diagnostic efficiency statistics for pediatric psychology: we are ready to ROC.

J Pediatr Psychol. 2014 Mar;39(2):204-21. doi: 10.1093/jpepsy/jst062. Epub 2013 Aug 21.

When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions.

Psychol Methods. 2012 Sep;17(3):354-73. doi: 10.1037/a0029315. Epub 2012 Jul 16.

Modifying measures based on differential item functioning (DIF) impact analyses.

J Aging Health. 2012 Sep;24(6):1044-76. doi: 10.1177/0898264312436877. Epub 2012 Mar 15.

The value of item response theory in clinical assessment: a review.

Assessment. 2011 Sep;18(3):291-307. doi: 10.1177/1073191110374797. Epub 2010 Jul 19.

A taxonomy of effect size measures for the differential functioning of items and scales.

J Appl Psychol. 2010 Jul;95(4):728-43. doi: 10.1037/a0018966.

Item factor analysis: current approaches and future directions.

Psychol Methods. 2007 Mar;12(1):58-79. doi: 10.1037/1082-989X.12.1.58.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

差异项目功能在筛查中何时重要？一种实证评估方法。

When Does Differential Item Functioning Matter for Screening? A Method for Empirical Evaluation.

作者信息

Gonzalez Oscar, Pelham William E

机构信息

University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.

Arizona State University, Tempe, AZ, USA.

出版信息

Assessment. 2021 Mar;28(2):446-456. doi: 10.1177/1073191120913618. Epub 2020 Apr 4.

DOI:10.1177/1073191120913618

PMID:32248701

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9705193/

Abstract

摘要

差异项目功能在筛查中何时重要？一种实证评估方法。

When Does Differential Item Functioning Matter for Screening? A Method for Empirical Evaluation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

差异项目功能在筛查中何时重要？一种实证评估方法。

When Does Differential Item Functioning Matter for Screening? A Method for Empirical Evaluation.

作者信息

机构信息

出版信息