多维患者报告结局测量中测试差异项目功能时处理项目无反应的方法比较。

A comparison of methods to address item non-response when testing for differential item functioning in multidimensional patient-reported outcome measures.

机构信息

Department of Community Health Sciences, University of Manitoba, S113-750 Bannatyne Avenue, Winnipeg, MB, R3E 0W3, Canada.

Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.

出版信息

Qual Life Res. 2022 Sep;31(9):2837-2848. doi: 10.1007/s11136-022-03129-8. Epub 2022 Apr 7.

DOI:10.1007/s11136-022-03129-8

PMID:35389187

Abstract

PURPOSE

Item non-response (i.e., missing data) may mask the detection of differential item functioning (DIF) in patient-reported outcome measures or result in biased DIF estimates. Non-response can be challenging to address in ordinal data. We investigated an unsupervised machine-learning method for ordinal item-level imputation and compared it with commonly-used item non-response methods when testing for DIF.

METHODS

Computer simulation and real-world data were used to assess several item non-response methods using the item response theory likelihood ratio test for DIF. The methods included: (a) list-wise deletion (LD), (b) half-mean imputation (HMI), (c) full information maximum likelihood (FIML), and (d) non-negative matrix factorization (NNMF), which adopts a machine-learning approach to impute missing values. Control of Type I error rates were evaluated using a liberal robustness criterion for α = 0.05 (i.e., 0.025-0.075). Statistical power was assessed with and without adoption of an item non-response method; differences > 10% were considered substantial.

RESULTS

Type I error rates for detecting DIF using LD, FIML and NNMF methods were controlled within the bounds of the robustness criterion for > 95% of simulation conditions, although the NNMF occasionally resulted in inflated rates. The HMI method always resulted in inflated error rates with 50% missing data. Differences in power to detect moderate DIF effects for LD, FIML and NNMF methods were substantial with 50% missing data and otherwise insubstantial.

CONCLUSION

The NNMF method demonstrated comparable performance to commonly-used non-response methods. This computationally-efficient method represents a promising approach to address item-level non-response when testing for DIF.

摘要

目的

项目无应答（即缺失数据）可能会掩盖患者报告结局测量中的差异项目功能（DIF）的检测，或者导致有偏差的 DIF 估计。在有序数据中，无应答可能难以解决。我们研究了一种用于有序项目级插补的无监督机器学习方法，并在测试 DIF 时将其与常用的项目无应答方法进行了比较。

方法

使用项目反应理论似然比检验对几种项目无应答方法进行了计算机模拟和真实数据评估，用于 DIF 的测试。这些方法包括：（a）逐项删除（LD），（b）半均值插补（HMI），（c）完全信息最大似然（FIML）和（d）非负矩阵分解（NNMF），它采用机器学习方法来插补缺失值。使用宽松的稳健性标准（即 0.025-0.075）评估 α=0.05 的Ⅰ类错误率控制。评估了在采用和不采用项目无应答方法的情况下的统计功效；差异>10%被认为是显著的。

结果

使用 LD、FIML 和 NNMF 方法检测 DIF 的Ⅰ类错误率在稳健性标准范围内得到控制，对于>95%的模拟条件，尽管 NNMF 偶尔会导致膨胀的速率。对于 50%的缺失数据，HMI 方法总是导致膨胀的错误率。对于 LD、FIML 和 NNMF 方法，在 50%的缺失数据下，检测中度 DIF 效应的功效差异很大，而在其他情况下则不显著。

结论

NNMF 方法与常用的无应答方法表现相当。当测试 DIF 时，这种计算效率高的方法代表了解决项目级无应答的有前途的方法。

相似文献

A comparison of methods to address item non-response when testing for differential item functioning in multidimensional patient-reported outcome measures.

Qual Life Res. 2022 Sep;31(9):2837-2848. doi: 10.1007/s11136-022-03129-8. Epub 2022 Apr 7.

Differential Item Functioning Analyses of the Patient-Reported Outcomes Measurement Information System (PROMIS®) Measures: Methods, Challenges, Advances, and Future Directions.

Psychometrika. 2021 Sep;86(3):674-711. doi: 10.1007/s11336-021-09775-0. Epub 2021 Jul 12.

A Machine Learning Approach to Assess Differential Item Functioning in Psychometric Questionnaires Using the Elastic Net Regularized Ordinal Logistic Regression in Small Sample Size Groups.

Biomed Res Int. 2021 Dec 15;2021:6854477. doi: 10.1155/2021/6854477. eCollection 2021.

The effect of missing data and imputation on the detection of bias in cognitive testing using differential item functioning methods.

BMC Med Res Methodol. 2022 Mar 27;22(1):81. doi: 10.1186/s12874-022-01572-2.

A comparison of discriminant logistic regression and Item Response Theory Likelihood-Ratio Tests for Differential Item Functioning (IRTLRDIF) in polytomous short tests.

Psicothema. 2016;28(1):83-8. doi: 10.7334/psicothema2015.142.

Bayesian Approaches for Detecting Differential Item Functioning Using the Generalized Graded Unfolding Model.

Appl Psychol Meas. 2022 Mar;46(2):98-115. doi: 10.1177/01466216211066606. Epub 2022 Feb 10.

Improving the assessment of measurement invariance: Using regularization to select anchor items and identify differential item functioning.

Psychol Methods. 2020 Dec;25(6):673-690. doi: 10.1037/met0000253. Epub 2020 Jan 9.

DIF Statistical Inference Without Knowing Anchoring Items.

Psychometrika. 2023 Dec;88(4):1097-1122. doi: 10.1007/s11336-023-09930-9. Epub 2023 Aug 7.

A Simulation Study to Assess the Effect of the Number of Response Categories on the Power of Ordinal Logistic Regression for Differential Item Functioning Analysis in Rating Scales.

Comput Math Methods Med. 2016;2016:5080826. doi: 10.1155/2016/5080826. Epub 2016 Jun 15.

Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress.

Qual Life Res. 2007;16 Suppl 1:43-68. doi: 10.1007/s11136-007-9186-4. Epub 2007 May 5.

引用本文的文献

Evaluating the Performance of a Regularized Differential Item Functioning Method for Testlet-Based Polytomous Items.

Educ Psychol Meas. 2025 May 31:00131644251342512. doi: 10.1177/00131644251342512.

Analysis of chronic disease comorbidity patterns in middle-aged and elderly smokers in China: The China Health and Retirement Longitudinal Study.

PLoS One. 2025 Mar 28;20(3):e0319026. doi: 10.1371/journal.pone.0319026. eCollection 2025.

本文引用的文献

Factor Retention in Exploratory Factor Analysis With Missing Data.

Educ Psychol Meas. 2022 Jun;82(3):444-464. doi: 10.1177/00131644211022031. Epub 2021 Jun 11.

Impact of missing data on bias and precision when estimating change in patient-reported outcomes from a clinical registry.

Health Qual Life Outcomes. 2019 Jun 20;17(1):106. doi: 10.1186/s12955-019-1181-2.

Evaluating methods for handling missing ordinal data in structural equation modeling.

Behav Res Methods. 2019 Oct;51(5):2337-2355. doi: 10.3758/s13428-018-1187-4.

Examining gender-related differential item functioning of the Veterans Rand 12-item Health Survey.

Qual Life Res. 2017 Oct;26(10):2877-2883. doi: 10.1007/s11136-017-1638-x. Epub 2017 Jul 3.

Baduanjin Mind-Body Intervention Improves the Executive Control Function.

Front Psychol. 2017 Jan 13;7:2015. doi: 10.3389/fpsyg.2016.02015. eCollection 2016.

The current practice of handling and reporting missing outcome data in eight widely used PROMs in RCT publications: a review of the current literature.

Qual Life Res. 2016 Jul;25(7):1613-23. doi: 10.1007/s11136-015-1206-1. Epub 2016 Jan 28.

Differential item functioning (DIF) of SF-12 and Q-LES-Q-SF items among french substance users.

Health Qual Life Outcomes. 2015 Oct 24;13:172. doi: 10.1186/s12955-015-0365-7.

Modeling local dependence in longitudinal IRT models.

Behav Res Methods. 2015 Dec;47(4):1413-1424. doi: 10.3758/s13428-014-0553-0.

Missing data in a multi-item instrument were best handled by multiple imputation at the item score level.

J Clin Epidemiol. 2014 Mar;67(3):335-42. doi: 10.1016/j.jclinepi.2013.09.009. Epub 2013 Dec 2.

Practical and statistical issues in missing data for longitudinal patient-reported outcomes.

Stat Methods Med Res. 2014 Oct;23(5):440-59. doi: 10.1177/0962280213476378. Epub 2013 Feb 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多维患者报告结局测量中测试差异项目功能时处理项目无反应的方法比较。

A comparison of methods to address item non-response when testing for differential item functioning in multidimensional patient-reported outcome measures.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献