利用潜在变量混合模型在测验编制中识别不变项目。

The use of latent variable mixture models to identify invariant items in test construction.

机构信息

School of Nursing, Trinity Western University, 7600 Glover Rd, Langley, BC, V2Y1Y1, Canada.

Centre for Health Evaluation and Outcome Sciences, Providence Health Care, Vancouver, BC, Canada.

出版信息

Qual Life Res. 2018 Jul;27(7):1745-1755. doi: 10.1007/s11136-017-1680-8. Epub 2017 Aug 23.

DOI:10.1007/s11136-017-1680-8

PMID:28836090

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5997718/

Abstract

PURPOSE

Patient-reported outcome measures (PROMs) are frequently used in heterogeneous patient populations. PROM scores may lead to biased inferences when sources of heterogeneity (e.g., gender, ethnicity, and social factors) are ignored. Latent variable mixture models (LVMMs) can be used to examine measurement invariance (MI) when sources of heterogeneity in the population are not known a priori. The goal of this article is to discuss the use of LVMMs to identify invariant items within the context of test construction.

METHODS

The Draper-Lindely-de Finetti (DLD) framework for the measurement of latent variables provides a theoretical context for the use of LVMMs to identify the most invariant items in test construction. In an expository analysis using 39 items measuring daily activities, LVMMs were conducted to compare 1- and 2-class item response theory models (IRT). If the 2-class model had better fit, item-level logistic regression differential item functioning (DIF) analyses were conducted to identify items that were not invariant. These items were removed and LVMMs and DIF testing repeated until all remaining items showed MI.

RESULTS

The 39 items had an essentially unidimensional measurement structure. However, a 1-class IRT model resulted in many statistically significant bivariate residuals, indicating suboptimal fit due to remaining local dependence. A 2-class LVMM had better fit. Through subsequent rounds of LVMMs and DIF testing, nine items were identified as being most invariant.

CONCLUSIONS

The DLD framework and the use of LVMMs have significant potential for advancing theoretical developments and research on item selection and the development of PROMs for heterogeneous populations.

摘要

目的

患者报告结局测量（PROM）常用于异质患者群体。如果忽略异质源（例如，性别、种族和社会因素），PROM 评分可能会导致有偏差的推断。潜在变量混合模型（LVMM）可用于在人群中异质源未知的情况下检查测量不变性（MI）。本文的目的是讨论在测试构建中使用 LVMM 识别不变项目。

方法

Draper-Lindely-de Finetti（DLD）框架用于测量潜在变量，为使用 LVMM 识别测试构建中最不变的项目提供了理论背景。在使用 39 个测量日常活动的项目的说明性分析中，进行了 LVMM 以比较 1 类和 2 类项目反应理论模型（IRT）。如果 2 类模型拟合更好，则进行项目级逻辑回归差异项目功能（DIF）分析以识别不变的项目。删除这些项目，并重复 LVMM 和 DIF 测试，直到所有剩余项目都显示 MI。

结果

39 个项目具有基本的单维测量结构。然而，1 类 IRT 模型导致许多统计学上显著的双变量残差，表明由于剩余的局部依赖性，拟合不理想。2 类 LVMM 拟合更好。通过随后几轮的 LVMM 和 DIF 测试，确定了 9 个最不变的项目。

结论

DLD 框架和 LVMM 的使用对推进异质人群的项目选择和 PROM 开发的理论发展和研究具有重要潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7264/5997718/543b1d1026f3/11136_2017_1680_Fig1_HTML.jpg

相似文献

The use of latent variable mixture models to identify invariant items in test construction.

Qual Life Res. 2018 Jul;27(7):1745-1755. doi: 10.1007/s11136-017-1680-8. Epub 2017 Aug 23.

Latent variable mixture models to test for differential item functioning: a population-based analysis.

Health Qual Life Outcomes. 2017 May 15;15(1):102. doi: 10.1186/s12955-017-0674-0.

Latent variable mixture models to address heterogeneity in patient-reported outcome data.

Methods. 2022 Aug;204:151-159. doi: 10.1016/j.ymeth.2022.03.010. Epub 2022 Mar 18.

Quality of Life in Epilepsy: Same questions, but different meaning to different people.

Epilepsia. 2021 Sep;62(9):2094-2102. doi: 10.1111/epi.17012. Epub 2021 Jul 26.

Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress.

Qual Life Res. 2007;16 Suppl 1:43-68. doi: 10.1007/s11136-007-9186-4. Epub 2007 May 5.

Accuracy of mixture item response theory models for identifying sample heterogeneity in patient-reported outcomes: a simulation study.

Qual Life Res. 2022 Dec;31(12):3423-3432. doi: 10.1007/s11136-022-03169-0. Epub 2022 Jun 18.

Differential Item Functioning Analyses of the Patient-Reported Outcomes Measurement Information System (PROMIS®) Measures: Methods, Challenges, Advances, and Future Directions.

Psychometrika. 2021 Sep;86(3):674-711. doi: 10.1007/s11336-021-09775-0. Epub 2021 Jul 12.

The Impact of Test and Sample Characteristics on Model Selection and Classification Accuracy in the Multilevel Mixture IRT Model.

Front Psychol. 2020 Feb 14;11:197. doi: 10.3389/fpsyg.2020.00197. eCollection 2020.

The Accuracy of Computerized Adaptive Testing in Heterogeneous Populations: A Mixture Item-Response Theory Analysis.

PLoS One. 2016 Mar 1;11(3):e0150563. doi: 10.1371/journal.pone.0150563. eCollection 2016.

An essay on measurement and factorial invariance.

Med Care. 2006 Nov;44(11 Suppl 3):S69-77. doi: 10.1097/01.mlr.0000245438.73837.89.

引用本文的文献

Unsupervised item response theory models for assessing sample heterogeneity in patient-reported outcomes measures.

Qual Life Res. 2024 Mar;33(3):853-864. doi: 10.1007/s11136-023-03560-5. Epub 2023 Dec 21.

How to Improve Interpretability of Patient-Reported Outcome Measures for Clinical Use: A Perspective on Measuring Abilities and Feelings.

Patient Relat Outcome Meas. 2022 Mar 25;13:69-77. doi: 10.2147/PROM.S355679. eCollection 2022.

Introduction to special section: test construction.

Qual Life Res. 2018 Jul;27(7):1671-1672. doi: 10.1007/s11136-018-1886-4.

本文引用的文献

Latent variable mixture models to test for differential item functioning: a population-based analysis.

Health Qual Life Outcomes. 2017 May 15;15(1):102. doi: 10.1186/s12955-017-0674-0.

Montreal Accord on Patient-Reported Outcomes (PROs) use series-Paper 7: modern perspectives of measurement validation emphasize justification of inferences based on patient reported outcome scores.

J Clin Epidemiol. 2017 Sep;89:154-159. doi: 10.1016/j.jclinepi.2016.12.002. Epub 2016 Dec 18.

Testing Students with Special Educational Needs in Large-Scale Assessments - Psychometric Properties of Test Scores and Associations with Test Taking Behavior.

Front Psychol. 2016 Feb 23;7:154. doi: 10.3389/fpsyg.2016.00154. eCollection 2016.

The Accuracy of Computerized Adaptive Testing in Heterogeneous Populations: A Mixture Item-Response Theory Analysis.

PLoS One. 2016 Mar 1;11(3):e0150563. doi: 10.1371/journal.pone.0150563. eCollection 2016.

Modeling Qualitative Variation Within Latent Trait Dimensions: Application of Mixed-Measurement to Personality Assessment.

Multivariate Behav Res. 1995 Jul 1;30(3):341-58. doi: 10.1207/s15327906mbr3003_3.

Improvement in Detection of Differential Item Functioning Using a Mixture Item Response Theory Model.

Multivariate Behav Res. 2010 Nov 30;45(6):975-99. doi: 10.1080/00273171.2010.533047.

Detecting Social Desirability Bias Using Factor Mixture Models.

Multivariate Behav Res. 2010 Mar 31;45(2):271-93. doi: 10.1080/00273171003680245.

Factor mixture modeling of anxiety sensitivity: a three-class structure.

Psychol Assess. 2014 Dec;26(4):1184-95. doi: 10.1037/a0037436. Epub 2014 Jul 28.

Models and Strategies for Factor Mixture Analysis: An Example Concerning the Structure Underlying Psychological Disorders.

Struct Equ Modeling. 2013 Oct 1;20(4). doi: 10.1080/10705511.2013.824786.

Growth Mixture Modeling: A Method for Identifying Differences in Longitudinal Change Among Unobserved Groups.

Int J Behav Dev. 2009;33(6):565-576. doi: 10.1177/0165025409343765.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用潜在变量混合模型在测验编制中识别不变项目。

The use of latent variable mixture models to identify invariant items in test construction.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献