• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在评估临床试验中临床结局评估的背景下研究项目反应理论模型的性能。

Investigating item response theory model performance in the context of evaluating clinical outcome assessments in clinical trials.

作者信息

Ayasse Nicolai D, Coon Cheryl D

机构信息

Clinical Outcome Assessment Program, Critical Path Institute, Tucson, AZ, USA.

出版信息

Qual Life Res. 2025 Apr;34(4):1125-1136. doi: 10.1007/s11136-024-03873-z. Epub 2024 Dec 12.

DOI:10.1007/s11136-024-03873-z
PMID:39666253
Abstract

PURPOSE

Item response theory (IRT) models are an increasingly popular method choice for evaluating clinical outcome assessments (COAs) for use in clinical trials. Given common constraints in clinical trial design, such as limits on sample size and assessment lengths, the current study aimed to examine the appropriateness of commonly used polytomous IRT models, specifically the graded response model (GRM) and partial credit model (PCM), in the context of how they are frequently used for psychometric evaluation of COAs in clinical trials.

METHODS

Data were simulated under varying sample sizes, measure lengths, response category numbers, and slope strengths, as well as under conditions that violated some model assumptions, namely, unidimensionality and equality of item slopes. Model fit, detection of item local dependence, and detection of item misfit were all examined to identify conditions where one model may be preferable or results may contain a degree of bias.

RESULTS

For unidimensional item sets and equal item slopes, the PCM and GRM performed similarly, and GRM performance remained consistent as slope variability increased. For not-unidimensional item sets, the PCM was somewhat more sensitive to this unidimensionality violation. Looking across conditions, the PCM did not demonstrate a clear advantage over the GRM for small sample sizes or shorter measure lengths.

CONCLUSION

Overall, the GRM and the PCM each demonstrated advantages and disadvantages depending on underlying data conditions and the model outcome investigated. We recommend careful consideration of the known, or expected, data characteristics when choosing a model and interpreting its results.

摘要

目的

项目反应理论(IRT)模型是评估用于临床试验的临床结局评估(COA)时越来越常用的方法选择。鉴于临床试验设计中的常见限制,如样本量和评估长度的限制,本研究旨在探讨常用的多分类IRT模型,特别是等级反应模型(GRM)和部分计分模型(PCM),在它们常用于临床试验中COA的心理测量评估的背景下的适用性。

方法

在不同的样本量、测量长度、反应类别数量和斜率强度下模拟数据,以及在违反一些模型假设的条件下,即单维性和项目斜率相等的条件下模拟数据。检查模型拟合、项目局部依赖性的检测和项目不拟合的检测,以确定一个模型可能更可取或结果可能存在一定程度偏差的条件。

结果

对于单维项目集和相等的项目斜率,PCM和GRM的表现相似,并且随着斜率变异性的增加,GRM的表现保持一致。对于非单维项目集,PCM对这种单维性违反更为敏感。综合各种条件来看,对于小样本量或较短的测量长度,PCM并没有显示出比GRM有明显优势。

结论

总体而言,GRM和PCM各有优缺点,这取决于潜在的数据条件和所研究的模型结果。我们建议在选择模型并解释其结果时,仔细考虑已知的或预期的数据特征。

相似文献

1
Investigating item response theory model performance in the context of evaluating clinical outcome assessments in clinical trials.在评估临床试验中临床结局评估的背景下研究项目反应理论模型的性能。
Qual Life Res. 2025 Apr;34(4):1125-1136. doi: 10.1007/s11136-024-03873-z. Epub 2024 Dec 12.
2
Psychometric properties for the Balanced Inventory of Desirable Responding: dichotomous versus polytomous conventional and IRT scoring.效度量表为理想反应的平衡量表:二分法与多分法传统和IRT 评分。
Psychol Assess. 2014 Sep;26(3):878-91. doi: 10.1037/a0036430. Epub 2014 Apr 7.
3
Patient Satisfaction with Navigator Interpersonal Relationship (PSN-I): item-level psychometrics using IRT analysis.患者对导航员人际关系满意度量表(PSN-I):使用IRT 分析的项目级心理测量学。
Support Care Cancer. 2020 Feb;28(2):541-550. doi: 10.1007/s00520-019-04833-x. Epub 2019 May 10.
4
Getting started with the graded response model: An introduction and tutorial in R.分级反应模型入门:R语言中的介绍与教程
Int J Psychol. 2025 Feb;60(1):e13265. doi: 10.1002/ijop.13265. Epub 2024 Nov 12.
5
Graded response model fit, measurement invariance and (comparative) precision of the Dutch-Flemish PROMIS® Upper Extremity V2.0 item bank in patients with upper extremity disorders.荷兰-佛兰芒 PROMIS®上肢 V2.0 项目库在上肢疾病患者中的分级反应模型拟合、测量不变性和(比较)精度。
BMC Musculoskelet Disord. 2020 Mar 16;21(1):170. doi: 10.1186/s12891-020-3178-8.
6
Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.运用项目反应理论比较中国抑郁症患者的五种抑郁量表:项目特性、测量精度及分数可比性检验
Health Qual Life Outcomes. 2017 Apr 4;15(1):60. doi: 10.1186/s12955-017-0631-y.
7
A comparison of three polytomous item response theory models in the context of testlet scoring.在分块计分背景下三种多分类项目反应理论模型的比较
J Outcome Meas. 1999;3(1):1-20.
8
Assessing fit of alternative unidimensional polytomous IRT models using posterior predictive model checking.使用后验预测模型检查评估替代单维多项 IRT 模型的拟合情况。
Psychol Methods. 2017 Jun;22(2):397-408. doi: 10.1037/met0000082. Epub 2016 May 30.
9
Item Response Theory Validation of the Forgotten Joint Score for Persons Undergoing Total Knee Replacement.全膝关节置换术后遗忘关节评分的项目反应理论验证。
J Bone Joint Surg Am. 2024 Jun 19;106(12):1091-1099. doi: 10.2106/JBJS.23.00814. Epub 2024 Mar 19.
10
Barthel Index of activities of daily living: item response theory analysis of ratings for long-term care residents.日常生活活动能力巴氏指数:长期护理居民评分的项目反应理论分析
Nurs Res. 2015 Mar-Apr;64(2):88-99. doi: 10.1097/NNR.0000000000000072.

本文引用的文献

1
The importance of patient-reported outcomes in clinical trials and strategies for future optimization.患者报告结局在临床试验中的重要性及未来优化策略。
Patient Relat Outcome Meas. 2018 Nov 1;9:353-367. doi: 10.2147/PROM.S156279. eCollection 2018.
2
Power and Sample Size Calculations in Clinical Trials with Patient-Reported Outcomes under Equal and Unequal Group Sizes Based on Graded Response Model: A Simulation Study.基于等级反应模型的相等和不相等组大小情况下患者报告结局的临床试验中的功效和样本量计算:一项模拟研究
Value Health. 2016 Jul-Aug;19(5):639-47. doi: 10.1016/j.jval.2016.03.1857. Epub 2016 Jul 29.
3
Using classical test theory, item response theory, and Rasch measurement theory to evaluate patient-reported outcome measures: a comparison of worked examples.
运用经典测试理论、项目反应理论和拉施测量理论评估患者报告的结局指标:实例比较
Value Health. 2015 Jan;18(1):25-34. doi: 10.1016/j.jval.2014.10.005.
4
The Benefits of Patient Involvement for Translational Research.患者参与转化研究的益处。
Health Care Anal. 2017 Sep;25(3):225-241. doi: 10.1007/s10728-014-0289-0.
5
Overview of classical test theory and item response theory for the quantitative assessment of items in developing patient-reported outcomes measures.开发患者报告结局指标时用于项目定量评估的经典测试理论和项目反应理论概述。
Clin Ther. 2014 May;36(5):648-62. doi: 10.1016/j.clinthera.2014.04.006. Epub 2014 May 5.
6
An introduction to item response theory for patient-reported outcome measurement.患者报告结局测量的项目反应理论介绍。
Patient. 2014;7(1):23-35. doi: 10.1007/s40271-013-0041-0.
7
Patient-reported outcomes in randomized clinical trials: development of ISOQOL reporting standards.随机临床试验中的患者报告结局:ISOQOL 报告标准的制定。
Qual Life Res. 2013 Aug;22(6):1161-75. doi: 10.1007/s11136-012-0252-1. Epub 2012 Sep 18.
8
A review of patient-reported outcome labels in the United States: 2006 to 2010.美国患者报告结局标签的回顾:2006 年至 2010 年。
Value Health. 2012 May;15(3):437-42. doi: 10.1016/j.jval.2011.11.032. Epub 2012 Feb 2.
9
Democratizing clinical research.使临床研究民主化。
Nature. 2011 Jun 15;474(7351):277-8. doi: 10.1038/474277a.
10
Identifying and prioritizing uncertainties: patient and clinician engagement in the identification of research questions.确定和优先处理不确定性:患者和临床医生参与确定研究问题。
J Eval Clin Pract. 2010 Jun;16(3):627-31. doi: 10.1111/j.1365-2753.2009.01262.x. Epub 2010 May 5.