NH & MRC Centre for Clinical Eye Research, Discipline of Optometry and Vision Science, Flinders Medical Centre and Flinders University of South Australia, Adelaide, South Australia, 5042, Australia.
Health Qual Life Outcomes. 2012 Jul 13;10:80. doi: 10.1186/1477-7525-10-80.
A critical component that influences the measurement properties of a patient-reported outcome (PRO) instrument is the rating scale. Yet, there is a lack of general consensus regarding optimal rating scale format, including aspects of question structure, the number and the labels of response categories. This study aims to explore the characteristics of rating scales that function well and those that do not, and thereby develop guidelines for formulating rating scales.
Seventeen existing PROs designed to measure vision-related quality of life dimensions were mailed for self-administration, in sets of 10, to patients who were on a waiting list for cataract extraction. These PROs included questions with ratings of difficulty, frequency, severity, and global ratings. Using Rasch analysis, performance of rating scales were assessed by examining hierarchical ordering (indicating categories are distinct from each other and follow a logical transition from lower to higher value), evenness (indicating relative utilization of categories), and range (indicating coverage of the attribute by the rating scale).
The rating scales with complicated question format, a large number of response categories, or unlabelled categories, tended to be dysfunctional. Rating scales with five or fewer response categories tended to be functional. Most of the rating scales measuring difficulty performed well. The rating scales measuring frequency and severity demonstrated hierarchical ordering but the categories lacked even utilization.
Developers of PRO instruments should use a simple question format, fewer (four to five) and labelled response categories.
影响患者报告结局(PRO)工具测量特性的一个关键组成部分是评分量表。然而,关于最佳评分量表格式,包括问题结构、反应类别数量和标签等方面,尚未达成普遍共识。本研究旨在探讨功能良好和功能不佳的评分量表的特征,并制定制定评分量表的指南。
将 17 种现有的旨在测量与视觉相关的生活质量维度的 PRO 寄给等候白内障手术的患者进行自我管理,每组 10 个。这些 PRO 包括对难度、频率、严重程度和总体评分的评估。使用 Rasch 分析,通过检查等级顺序(表明类别彼此不同,并遵循从低到高值的逻辑过渡)、均匀性(表明类别相对利用率)和范围(表明评分量表对属性的覆盖范围)来评估评分量表的性能。
具有复杂问题格式、大量反应类别或无标签类别的评分量表往往功能失调。具有五个或更少反应类别的评分量表往往具有功能。大多数测量难度的评分量表表现良好。测量频率和严重程度的评分量表表现出等级顺序,但类别缺乏均匀利用。
PRO 工具的开发者应使用简单的问题格式,较少(四到五个)和标记的反应类别。