• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用经典测试理论、项目反应理论和拉施测量理论评估患者报告的结局指标:实例比较

Using classical test theory, item response theory, and Rasch measurement theory to evaluate patient-reported outcome measures: a comparison of worked examples.

作者信息

Petrillo Jennifer, Cano Stefan J, McLeod Lori D, Coon Cheryl D

机构信息

Novartis AG, Basel, Switzerland.

Plymouth University Peninsula Schools of Medicine and Dentistry, Plymouth, UK.

出版信息

Value Health. 2015 Jan;18(1):25-34. doi: 10.1016/j.jval.2014.10.005.

DOI:10.1016/j.jval.2014.10.005
PMID:25595231
Abstract

OBJECTIVE

To provide comparisons and a worked example of item- and scale-level evaluations based on three psychometric methods used in patient-reported outcome development-classical test theory (CTT), item response theory (IRT), and Rasch measurement theory (RMT)-in an analysis of the National Eye Institute Visual Functioning Questionnaire (VFQ-25).

METHODS

Baseline VFQ-25 data from 240 participants with diabetic macular edema from a randomized, double-masked, multicenter clinical trial were used to evaluate the VFQ at the total score level. CTT, RMT, and IRT evaluations were conducted, and results were assessed in a head-to-head comparison.

RESULTS

Results were similar across the three methods, with IRT and RMT providing more detailed diagnostic information on how to improve the scale. CTT led to the identification of two problematic items that threaten the validity of the overall scale score, sets of redundant items, and skewed response categories. IRT and RMT additionally identified poor fit for one item, many locally dependent items, poor targeting, and disordering of over half the response categories.

CONCLUSIONS

Selection of a psychometric approach depends on many factors. Researchers should justify their evaluation method and consider the intended audience. If the instrument is being developed for descriptive purposes and on a restricted budget, a cursory examination of the CTT-based psychometric properties may be all that is possible. In a high-stakes situation, such as the development of a patient-reported outcome instrument for consideration in pharmaceutical labeling, however, a thorough psychometric evaluation including IRT or RMT should be considered, with final item-level decisions made on the basis of both quantitative and qualitative results.

摘要

目的

在对美国国立眼科研究所视觉功能问卷(VFQ - 25)的分析中,基于患者报告结局发展中使用的三种心理测量方法——经典测验理论(CTT)、项目反应理论(IRT)和拉施测量理论(RMT),提供项目层面和量表层面评估的比较及实例。

方法

来自一项随机、双盲、多中心临床试验的240名糖尿病性黄斑水肿患者的基线VFQ - 25数据用于在总分水平上评估VFQ。进行了CTT、RMT和IRT评估,并对结果进行了直接比较。

结果

三种方法的结果相似,IRT和RMT提供了关于如何改进量表的更详细诊断信息。CTT识别出两个有问题的项目,这些项目威胁到整体量表分数的有效性、冗余项目集以及偏态的反应类别。IRT和RMT还额外识别出一个项目拟合不佳、许多局部依赖项目、靶向性差以及超过一半的反应类别无序。

结论

心理测量方法的选择取决于许多因素。研究人员应说明其评估方法并考虑目标受众。如果该工具是出于描述目的且预算有限而开发的,那么对基于CTT的心理测量特性进行粗略检查可能就是所能做的一切。然而,在高风险情况下,例如开发用于药品标签考虑的患者报告结局工具时,应考虑进行包括IRT或RMT在内的全面心理测量评估,并根据定量和定性结果做出最终的项目层面决策。

相似文献

1
Using classical test theory, item response theory, and Rasch measurement theory to evaluate patient-reported outcome measures: a comparison of worked examples.运用经典测试理论、项目反应理论和拉施测量理论评估患者报告的结局指标:实例比较
Value Health. 2015 Jan;18(1):25-34. doi: 10.1016/j.jval.2014.10.005.
2
THE DEPRESSION INVENTORY DEVELOPMENT SCALE: Assessment of Psychometric Properties Using Classical and Modern Measurement Theory in a CAN-BIND Trial.抑郁量表发展量表:在CAN - BIND试验中使用经典和现代测量理论对心理测量特性进行评估
Innov Clin Neurosci. 2020 Jul 1;17(7-9):30-40.
3
State of the psychometric methods: comments on the ISOQOL SIG psychometric papers.心理测量方法的现状:对国际生活质量研究学会(ISOQOL)特别兴趣小组心理测量论文的评论
J Patient Rep Outcomes. 2019 Jul 30;3(1):49. doi: 10.1186/s41687-019-0134-1.
4
Rasch analysis of the quality of life and vision function questionnaire.生活质量与视觉功能问卷的拉施分析
Optom Vis Sci. 2009 Jul;86(7):E836-44. doi: 10.1097/OPX.0b013e3181ae1ec7.
5
Development and validation of the McGill body image concerns scale for use in head and neck oncology (MBIS-HNC): A mixed-methods approach.发展和验证用于头颈部肿瘤学的麦吉尔身体意象关注量表(MBIS-HNC):一种混合方法。
Psychooncology. 2019 Jan;28(1):116-121. doi: 10.1002/pon.4918. Epub 2018 Nov 12.
6
A primer on classical test theory and item response theory for assessments in medical education.医学教育评估中的经典测量理论和项目反应理论简介。
Med Educ. 2010 Jan;44(1):109-17. doi: 10.1111/j.1365-2923.2009.03425.x.
7
Improving the evaluation of therapeutic interventions in multiple sclerosis: the role of new psychometric methods.改善多发性硬化症治疗干预措施的评估:新心理测量方法的作用。
Health Technol Assess. 2009 Feb;13(12):iii, ix-x, 1-177. doi: 10.3310/hta13120.
8
A critique of Rasch analysis using the Dyspnoea-12 as an illustrative example.使用呼吸困难量表 12 作为说明性示例对 Rasch 分析的批判。
J Adv Nurs. 2012 Jan;68(1):191-8. doi: 10.1111/j.1365-2648.2011.05723.x. Epub 2011 Jun 9.
9
Measuring the ICF components of impairment, activity limitation and participation restriction: an item analysis using classical test theory and item response theory.测量损伤、活动受限和参与限制的国际功能、残疾和健康分类(ICF)成分:使用经典测试理论和项目反应理论的项目分析
Health Qual Life Outcomes. 2009 May 7;7:41. doi: 10.1186/1477-7525-7-41.
10
The Impact of Vision Impairment Questionnaire: an evaluation of its measurement properties using Rasch analysis.视力损害问卷的影响:使用拉施分析对其测量属性的评估。
Invest Ophthalmol Vis Sci. 2006 Nov;47(11):4732-41. doi: 10.1167/iovs.06-0220.

引用本文的文献

1
Assessment of Dysfunctional Grief due to Death from COVID-19 in Peru: Adaptation and Validation of a Spanish Version of the Pandemic Grief Scale.秘鲁因新冠疫情死亡导致的功能失调性悲伤评估:《大流行悲伤量表》西班牙语版本的改编与验证
Trends Psychol. 2021;29(4):595-616. doi: 10.1007/s43076-021-00091-1. Epub 2021 Jul 14.
2
A Rasch analysis of the High Potential Trait Indicator: A South African sample.高潜力特质指标的拉施分析:一个南非样本。
Afr J Psychol Assess. 2023 Feb 8;5:115. doi: 10.4102/ajopa.v5i0.115. eCollection 2023.
3
Validation of the short index of job satisfaction in Chinese nurses: classical test theory and item response theory.
中国护士工作满意度简短指标的验证:经典测试理论与项目反应理论
Int J Nurs Stud Adv. 2025 Mar 22;8:100321. doi: 10.1016/j.ijnsa.2025.100321. eCollection 2025 Jun.
4
Validation of the Thai World Health Organization Quality of Life-OLD (WHOQOL-OLD) among Thai older adults: Rasch analysis.泰国老年人中世界卫生组织老年生活质量量表(WHOQOL-OLD)的效度验证:拉施分析
Sci Rep. 2025 Apr 15;15(1):12978. doi: 10.1038/s41598-025-97824-4.
5
Gender-specific changes in vision-related quality of life over time - results from the population-based Gutenberg Health Study.基于人群的古登堡健康研究结果:随时间推移与视力相关的生活质量的性别差异变化
Graefes Arch Clin Exp Ophthalmol. 2025 Feb 11. doi: 10.1007/s00417-025-06741-9.
6
Psychometric properties and post-hoc CAT analysis of the pediatric PROMIS® item banks anxiety and depressive symptoms in a combined Swedish Child and Adolescent Psychiatry and School sample.瑞典儿童青少年精神病学与学校联合样本中儿科患者报告结果测量信息系统(PROMIS®)焦虑和抑郁症状项目库的心理测量特性及事后项目特征曲线分析
Qual Life Res. 2025 May;34(5):1265-1275. doi: 10.1007/s11136-025-03898-y. Epub 2025 Jan 30.
7
Measurement Properties of the Dysphagiameter for the Assessment of Dysphagia in Oculopharyngeal Muscular Dystrophy.用于评估眼咽型肌营养不良吞咽困难的吞咽直径测量属性
Dysphagia. 2024 Dec 21. doi: 10.1007/s00455-024-10791-2.
8
Investigating item response theory model performance in the context of evaluating clinical outcome assessments in clinical trials.在评估临床试验中临床结局评估的背景下研究项目反应理论模型的性能。
Qual Life Res. 2025 Apr;34(4):1125-1136. doi: 10.1007/s11136-024-03873-z. Epub 2024 Dec 12.
9
Rasch Measurement Theory (RMT) Analyses of the Huntington's Disease Everyday Functioning (Hi-DEF) to Evaluate Item Fit and Performance.Rasch 测量理论(RMT)分析亨廷顿病日常生活功能(Hi-DEF),以评估项目拟合度和表现。
J Huntingtons Dis. 2024;13(3):385-397. doi: 10.3233/JHD-240001.
10
Psychometric Properties of the Teleprimary Care Oral Health Clinical Information System (TPC-OHCIS) Questionnaire Using the Rasch Model.使用拉施模型的远程初级保健口腔健康临床信息系统(TPC-OHCIS)问卷的心理测量特性
Cureus. 2024 Jun 24;16(6):e63064. doi: 10.7759/cureus.63064. eCollection 2024 Jun.