Suppr超能文献

患者报告结局测量信息系统(PROMIS)抑郁项目库中项目差异功能分析:一种项目反应理论方法。

Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS): An item response theory approach.

作者信息

Teresi Jeanne A, Ocepek-Welikson Katja, Kleinman Marjorie, Eimicke Joseph P, Crane Paul K, Jones Richard N, Lai Jin-Shei, Choi Seung W, Hays Ron D, Reeve Bryce B, Reise Steven P, Pilkonis Paul A, Cella David

机构信息

Columbia University Stroud Center; Faculty of Medicine, New York State Psychiatric Institute.

出版信息

Psychol Sci Q. 2009;51(2):148-180.

Abstract

The aims of this paper are to present findings related to differential item functioning (DIF) in the Patient Reported Outcome Measurement Information System (PROMIS) depression item bank, and to discuss potential threats to the validity of results from studies of DIF. The 32 depression items studied were modified from several widely used instruments. DIF analyses of gender, age and education were performed using a sample of 735 individuals recruited by a survey polling firm. DIF hypotheses were generated by asking content experts to indicate whether or not they expected DIF to be present, and the direction of the DIF with respect to the studied comparison groups. Primary analyses were conducted using the graded item response model (for polytomous, ordered response category data) with likelihood ratio tests of DIF, accompanied by magnitude measures. Sensitivity analyses were performed using other item response models and approaches to DIF detection. Despite some caveats, the items that are recommended for exclusion or for separate calibration were "I felt like crying" and "I had trouble enjoying things that I used to enjoy." The item, "I felt I had no energy," was also flagged as evidencing DIF, and recommended for additional review. On the one hand, false DIF detection (Type 1 error) was controlled to the extent possible by ensuring model fit and purification. On the other hand, power for DIF detection might have been compromised by several factors, including sparse data and small sample sizes. Nonetheless, practical and not just statistical significance should be considered. In this case the overall magnitude and impact of DIF was small for the groups studied, although impact was relatively large for some individuals.

摘要

本文旨在呈现与患者报告结局测量信息系统(PROMIS)抑郁项目库中差异项目功能(DIF)相关的研究结果,并讨论DIF研究结果有效性的潜在威胁。所研究的32个抑郁项目是从几种广泛使用的工具中修改而来的。使用一家调查民意测验公司招募的735名个体样本,对性别、年龄和教育程度进行了DIF分析。通过询问内容专家来生成DIF假设,以表明他们是否预期存在DIF,以及DIF相对于所研究比较组的方向。主要分析采用分级项目反应模型(用于多分类、有序反应类别数据)以及DIF的似然比检验,并伴有效应量测量。使用其他项目反应模型和DIF检测方法进行了敏感性分析。尽管存在一些注意事项,但建议排除或单独校准的项目是“我想哭”和“我难以享受曾经喜欢的事情”。“我觉得自己没有精力”这一项目也被标记为存在DIF证据,并建议进行进一步审查。一方面,通过确保模型拟合和净化尽可能地控制了错误的DIF检测(I型错误)。另一方面,DIF检测的效能可能受到了几个因素的影响,包括数据稀疏和样本量小。尽管如此,应考虑实际意义而非仅仅是统计学意义。在这种情况下,对于所研究的群体,DIF的总体效应量和影响较小,尽管对某些个体的影响相对较大。

相似文献

10
Language-related differential item functioning between English and German PROMIS Depression items is negligible.
Int J Methods Psychiatr Res. 2017 Dec;26(4). doi: 10.1002/mpr.1530. Epub 2016 Oct 16.

引用本文的文献

1
Psychometric properties of the Chinese version of the PROMIS-Cancer-Anxiety item bank assessed using a graded response model.
Asia Pac J Oncol Nurs. 2023 Sep 28;10(12):100312. doi: 10.1016/j.apjon.2023.100312. eCollection 2023 Dec.
2
Development of The Chinese Version of Ultra-Low Vision Visual Functioning Questionnaire-150.
Transl Vis Sci Technol. 2023 Jun 1;12(6):9. doi: 10.1167/tvst.12.6.9.
3
Differential Item Functioning of the Jaw Functional Limitation Scale.
J Oral Facial Pain Headache. 2023 Winter;37(1):33-46. doi: 10.11607/ofph.3026.
4
Tattoo discrimination in Mexico motivates interest in tattoo removal among structurally vulnerable adults.
Front Public Health. 2022 Aug 18;10:894486. doi: 10.3389/fpubh.2022.894486. eCollection 2022.
5
Common measures or common metrics? A plea to harmonize measurement results.
Clin Psychol Psychother. 2022 Sep;29(5):1755-1767. doi: 10.1002/cpp.2742. Epub 2022 Jun 19.
6
Bayesian Approaches for Detecting Differential Item Functioning Using the Generalized Graded Unfolding Model.
Appl Psychol Meas. 2022 Mar;46(2):98-115. doi: 10.1177/01466216211066606. Epub 2022 Feb 10.
7
Patient-reported outcome measures for masticatory function in adults: a systematic review.
BMC Oral Health. 2021 Nov 23;21(1):603. doi: 10.1186/s12903-021-01949-7.
8
Evaluations of the sum-score-based and item response theory-based tests of group mean differences under various simulation conditions.
Stat Methods Med Res. 2021 Dec;30(12):2604-2618. doi: 10.1177/09622802211043263. Epub 2021 Oct 7.
9
Pediatric Palliative Care Parents' Distress, Financial Difficulty, and Child Symptoms.
J Pain Symptom Manage. 2022 Feb;63(2):271-282. doi: 10.1016/j.jpainsymman.2021.08.004. Epub 2021 Aug 20.

本文引用的文献

1
Evaluation of MIMIC-Model Methods for DIF Testing With Comparison to Two-Group Analysis.
Multivariate Behav Res. 2009 Jan-Feb;44(1):1-27. doi: 10.1080/00273170802620121.
2
A Cautionary Note on Using G(2)(dif) to Assess Relative Model Fit in Categorical Data Analysis.
Multivariate Behav Res. 2006 Mar 1;41(1):55-64. doi: 10.1207/s15327906mbr4101_4.
3
Representativeness of the Patient-Reported Outcomes Measurement Information System Internet panel.
J Clin Epidemiol. 2010 Nov;63(11):1169-78. doi: 10.1016/j.jclinepi.2009.11.021. Epub 2010 Aug 5.
5
A simulation study provided sample size guidance for differential item functioning (DIF) studies using short scales.
J Clin Epidemiol. 2009 Mar;62(3):288-95. doi: 10.1016/j.jclinepi.2008.06.003. Epub 2008 Sep 6.
6
Measurement invariance versus selection invariance: is fair selection possible?
Psychol Methods. 2008 Jun;13(2):75-98. doi: 10.1037/1082-989X.13.2.75.
10
The role of the bifactor model in resolving dimensionality issues in health outcomes measures.
Qual Life Res. 2007;16 Suppl 1:19-31. doi: 10.1007/s11136-007-9183-7. Epub 2007 May 4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验