• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用项目反应理论对数似然比(IRTLR)方法评估测量等价性,以评估项目功能差异(DIF):身体功能能力和一般痛苦测量的应用(附说明)

Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress.

作者信息

Teresi Jeanne A, Ocepek-Welikson Katja, Kleinman Marjorie, Cook Karon F, Crane Paul K, Gibbons Laura E, Morales Leo S, Orlando-Edelen Maria, Cella David

机构信息

Faculty of Medicine, New York State Psychiatric Institute, Columbia University Stroud Center, New York, NY, USA.

出版信息

Qual Life Res. 2007;16 Suppl 1:43-68. doi: 10.1007/s11136-007-9186-4. Epub 2007 May 5.

DOI:10.1007/s11136-007-9186-4
PMID:17484039
Abstract

BACKGROUND

Methods based on item response theory (IRT) that can be used to examine differential item functioning (DIF) are illustrated. An IRT-based approach to the detection of DIF was applied to physical function and general distress item sets. DIF was examined with respect to gender, age and race. The method used for DIF detection was the item response theory log-likelihood ratio (IRTLR) approach. DIF magnitude was measured using the differences in the expected item scores, expressed as the unsigned probability differences, and calculated using the non-compensatory DIF index (NCDIF). Finally, impact was assessed using expected scale scores, expressed as group differences in the total test (measure) response functions.

METHODS

The example for the illustration of the methods came from a study of 1,714 patients with cancer or HIV/AIDS. The measure contained 23 items measuring physical functioning ability and 15 items addressing general distress, scored in the positive direction.

RESULTS

The substantive findings were of relatively small magnitude DIF. In total, six items showed relatively larger magnitude (expected item score differences greater than the cutoff) of DIF with respect to physical function across the three comparisons: "trouble with a long walk" (race), "vigorous activities" (race, age), "bending, kneeling stooping" (age), "lifting or carrying groceries" (race), "limited in hobbies, leisure" (age), "lack of energy" (race). None of the general distress items evidenced high magnitude DIF; although "worrying about dying" showed some DIF with respect to both age and race, after adjustment.

CONCLUSIONS

The fact that many physical function items showed DIF with respect to age, even after adjustment for multiple comparisons, indicates that the instrument may be performing differently for these groups. While the magnitude and impact of DIF at the item and scale level was minimal, caution should be exercised in the use of subsets of these items, as might occur with selection for clinical decisions or computerized adaptive testing. The issues of selection of anchor items, and of criteria for DIF detection, including the integration of significance and magnitude measures remain as issues requiring investigation. Further research is needed regarding the criteria and guidelines appropriate for DIF detection in the context of health-related items.

摘要

背景

阐述了基于项目反应理论(IRT)可用于检验项目功能差异(DIF)的方法。一种基于IRT的DIF检测方法应用于身体功能和一般困扰项目集。针对性别、年龄和种族对DIF进行了检验。用于DIF检测的方法是项目反应理论对数似然比(IRTLR)方法。使用预期项目得分的差异来衡量DIF大小,以无符号概率差异表示,并使用非补偿性DIF指数(NCDIF)进行计算。最后,使用预期量表得分评估影响,以总测试(测量)反应函数中的组间差异表示。

方法

用于说明这些方法的示例来自一项对1714名癌症或艾滋病毒/艾滋病患者的研究。该测量包含23个测量身体功能能力的项目和15个涉及一般困扰的项目,得分呈正向。

结果

实质性发现是DIF的大小相对较小。总体而言,在三项比较中,共有六个项目在身体功能方面表现出相对较大的DIF(预期项目得分差异大于临界值):“长距离行走困难”(种族)、“剧烈活动”(种族、年龄)、“弯腰、跪、蹲”(年龄)、“提或搬杂货”(种族)、“爱好、休闲受限”(年龄)、“缺乏精力”(种族)。没有一个一般困扰项目显示出高大小的DIF;尽管“担心死亡”在调整后在年龄和种族方面都显示出一些DIF。

结论

即使在对多重比较进行调整之后,许多身体功能项目在年龄方面仍显示出DIF,这一事实表明该工具在这些群体中的表现可能有所不同。虽然项目和量表层面DIF的大小和影响最小,但在使用这些项目的子集时应谨慎,如在临床决策选择或计算机自适应测试中可能出现的情况。锚定项目的选择问题以及DIF检测标准,包括显著性和大小测量的整合,仍然是需要研究的问题。在健康相关项目的背景下,需要进一步研究适用于DIF检测的标准和指南。

相似文献

1
Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress.使用项目反应理论对数似然比(IRTLR)方法评估测量等价性,以评估项目功能差异(DIF):身体功能能力和一般痛苦测量的应用(附说明)
Qual Life Res. 2007;16 Suppl 1:43-68. doi: 10.1007/s11136-007-9186-4. Epub 2007 May 5.
2
Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Anxiety Short Forms in Ethnically Diverse Groups.患者报告结局测量信息系统(PROMIS)焦虑简表在不同种族群体中的测量等效性
Psychol Test Assess Model. 2016;58(1):183-219.
3
Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Pain Interference Short Form Items: Application to Ethnically Diverse Cancer and Palliative Care Populations.患者报告结局测量信息系统(PROMIS)疼痛干扰简表条目的测量等效性:在不同种族癌症和姑息治疗人群中的应用。
Psychol Test Assess Model. 2016;58(2):309-352.
4
A comparison of three sets of criteria for determining the presence of differential item functioning using ordinal logistic regression.使用有序逻辑回归确定差异项目功能存在的三组标准的比较。
Qual Life Res. 2007;16 Suppl 1:69-84. doi: 10.1007/s11136-007-9185-5. Epub 2007 Jun 7.
5
Psychometric Properties and Performance of the Patient Reported Outcomes Measurement Information System (PROMIS) Depression Short Forms in Ethnically Diverse Groups.患者报告结局测量信息系统(PROMIS)抑郁简表在不同种族群体中的心理测量特性及表现
Psychol Test Assess Model. 2016;58(1):141-181.
6
Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Applied Cognition - General Concerns, Short Forms in Ethnically Diverse Groups.患者报告结局测量信息系统(PROMIS)应用认知量表在不同种族群体中的测量等效性——一般问题及简表
Psychol Test Assess Model. 2016;58(2):255-307.
7
Examination of the Measurement Equivalence of the Functional Assessment in Acute Care MCAT (FAMCAT) Mobility Item Bank Using Differential Item Functioning Analyses.使用差异项目功能分析检验急性护理 MCAT(FAMCAT)移动项目库中功能评估的测量等效性。
Arch Phys Med Rehabil. 2022 May;103(5S):S84-S107.e38. doi: 10.1016/j.apmr.2021.03.044. Epub 2021 Jun 16.
8
Differential item functioning and health assessment.项目功能差异与健康评估。
Qual Life Res. 2007;16 Suppl 1:33-42. doi: 10.1007/s11136-007-9184-6. Epub 2007 Apr 19.
9
Differential item functioning impact in a modified version of the Roland-Morris Disability Questionnaire.罗兰-莫里斯残疾问卷修订版中的项目功能差异影响
Qual Life Res. 2007 Aug;16(6):981-90. doi: 10.1007/s11136-007-9200-x. Epub 2007 Apr 19.
10
Differential Item Functioning in the SF-36 Physical Functioning and Mental Health Sub-Scales: A Population-Based Investigation in the Canadian Multicentre Osteoporosis Study.SF-36身体功能和心理健康子量表中的项目功能差异:加拿大多中心骨质疏松症研究中的一项基于人群的调查。
PLoS One. 2016 Mar 21;11(3):e0151519. doi: 10.1371/journal.pone.0151519. eCollection 2016.

引用本文的文献

1
Uncovering potential interviewer-related biases in self-efficacy assessment: a study among chronic disease patients.揭示自我效能评估中与面试官相关的潜在偏差:一项针对慢性病患者的研究。
BMC Psychol. 2025 Mar 25;13(1):299. doi: 10.1186/s40359-025-02579-2.
2
Exposome Burden Scores to Summarize Environmental Chemical Mixtures: Creating a Fair and Common Scale for Cross-study Harmonization, Report-back and Precision Environmental Health.用于总结环境化学混合物的暴露组负担评分:创建一个公平且通用的尺度以实现跨研究协调、反馈报告和精准环境卫生。
Curr Environ Health Rep. 2025 Feb 18;12(1):13. doi: 10.1007/s40572-024-00467-2.
3
Measuring visual ability in linguistically diverse populations.

本文引用的文献

1
A comparison of three sets of criteria for determining the presence of differential item functioning using ordinal logistic regression.使用有序逻辑回归确定差异项目功能存在的三组标准的比较。
Qual Life Res. 2007;16 Suppl 1:69-84. doi: 10.1007/s11136-007-9185-5. Epub 2007 Jun 7.
2
Measurement in a multi-ethnic society. Overview to the special issue.多民族社会中的测量。特刊概述。
Med Care. 2006 Nov;44(11 Suppl 3):S3-4. doi: 10.1097/01.mlr.0000245437.46695.4a.
3
Different approaches to differential item functioning in health applications. Advantages, disadvantages and some neglected topics.
在语言多样化人群中测量视觉能力。
Behav Res Methods. 2024 Dec 30;57(1):36. doi: 10.3758/s13428-024-02579-x.
4
Applying Latent Variable Models to Estimate Cumulative Exposure Burden to Chemical Mixtures and Identify Latent Exposure Subgroups: A Critical Review and Future Directions.应用潜变量模型估计化学混合物的累积暴露负担并识别潜在暴露亚组:批判性综述与未来方向
Stat Biosci. 2024 Jul;16(2):482-502. doi: 10.1007/s12561-023-09410-9. Epub 2024 Jan 22.
5
Pre-natal and early life lead exposure and childhood inhibitory control: an item response theory approach to improve measurement precision of inhibitory control.产前和生命早期的铅暴露与儿童抑制控制:一种提高抑制控制测量精度的项目反应理论方法。
Environ Health. 2024 Sep 5;23(1):71. doi: 10.1186/s12940-023-01015-5.
6
Exposure to per- and polyfluoroalkyl substances and alterations in plasma microRNA profiles in children.儿童接触全氟和多氟烷基物质与血浆 microRNA 谱的改变。
Environ Res. 2024 Oct 15;259:119496. doi: 10.1016/j.envres.2024.119496. Epub 2024 Jun 25.
7
Phthalate mixtures and insulin resistance: an item response theory approach to quantify exposure burden to phthalate mixtures.邻苯二甲酸酯混合物与胰岛素抵抗:一种用于量化邻苯二甲酸酯混合物暴露负担的项目反应理论方法。
J Expo Sci Environ Epidemiol. 2024 Jul;34(4):581-590. doi: 10.1038/s41370-023-00535-z. Epub 2023 Mar 25.
8
Developing an Exposure Burden Score for Chemical Mixtures Using Item Response Theory, with Applications to PFAS Mixtures.运用项目反应理论为化学混合物开发暴露负担评分,应用于 PFAS 混合物。
Environ Health Perspect. 2022 Nov;130(11):117001. doi: 10.1289/EHP10125. Epub 2022 Nov 2.
9
Methodological and Statistical Considerations for the National Children's Study.全国儿童研究的方法学与统计学考量
Front Pediatr. 2021 Aug 20;9:595059. doi: 10.3389/fped.2021.595059. eCollection 2021.
10
Differential Item Functioning Analyses of the Patient-Reported Outcomes Measurement Information System (PROMIS®) Measures: Methods, Challenges, Advances, and Future Directions.患者报告结局测量信息系统(PROMIS®)测评的项目区分度分析:方法、挑战、进展及未来方向。
Psychometrika. 2021 Sep;86(3):674-711. doi: 10.1007/s11336-021-09775-0. Epub 2021 Jul 12.
健康应用中项目功能差异的不同方法。优点、缺点及一些被忽视的主题。
Med Care. 2006 Nov;44(11 Suppl 3):S152-70. doi: 10.1097/01.mlr.0000245142.74628.ab.
4
Item and scale differential functioning of the Mini-Mental State Exam assessed using the Differential Item and Test Functioning (DFIT) Framework.使用差异项目与测验功能(DFIT)框架评估简易精神状态检查表的项目与量表差异功能。
Med Care. 2006 Nov;44(11 Suppl 3):S143-51. doi: 10.1097/01.mlr.0000245141.70946.29.
5
Identification of differential item functioning using item response theory and the likelihood-based model comparison approach. Application to the Mini-Mental State Examination.使用项目反应理论和基于似然的模型比较方法识别差异项目功能。在简易精神状态检查表中的应用。
Med Care. 2006 Nov;44(11 Suppl 3):S134-42. doi: 10.1097/01.mlr.0000245251.83359.8c.
6
Differential item functioning on the Mini-Mental State Examination. An application of the Mantel-Haenszel and standardization procedures.简易精神状态检查表中的项目功能差异。Mantel-Haenszel法与标准化程序的应用。
Med Care. 2006 Nov;44(11 Suppl 3):S107-14. doi: 10.1097/01.mlr.0000245182.36914.4a.
7
Measuring activity limitations in climbing stairs: development of a hierarchical scale for patients with lower-extremity disorders living at home.测量爬楼梯时的活动受限情况:为居家的下肢疾病患者制定一个分级量表。
Arch Phys Med Rehabil. 2004 Jun;85(6):967-71. doi: 10.1016/j.apmr.2003.11.018.
8
Test bias in a cognitive test: differential item functioning in the CASI.认知测试中的测试偏差:认知能力筛查量表中的项目功能差异
Stat Med. 2004 Jan 30;23(2):241-56. doi: 10.1002/sim.1713.
9
Demographic variation in SF-12 scores: true differences or differential item functioning?SF-12评分中的人口统计学差异:真实差异还是项目功能差异?
Med Care. 2003 Jul;41(7 Suppl):III75-III86. doi: 10.1097/01.MLR.0000076052.42628.CF.
10
Differential item functioning in a Spanish translation of the PTSD checklist: detection and evaluation of impact.创伤后应激障碍检查表西班牙语翻译中的项目功能差异:影响的检测与评估
Psychol Assess. 2002 Mar;14(1):50-9. doi: 10.1037//1040-3590.14.1.50.