Suppr超能文献

计算机自适应测试(CAT)在头痛影响评估中的应用。

Applications of computerized adaptive testing (CAT) to the assessment of headache impact.

作者信息

Ware John E, Kosinski Mark, Bjorner Jakob B, Bayliss Martha S, Batenhorst Alice, Dahlöf Carl G H, Tepper Stewart, Dowson Andrew

机构信息

QualityMetric Incorporated, Lincoln, RI 02895, USA.

出版信息

Qual Life Res. 2003 Dec;12(8):935-52. doi: 10.1023/a:1026115230284.

Abstract

OBJECTIVE

To evaluate the feasibility of computerized adaptive testing (CAT) and the reliability and validity of CAT-based estimates of headache impact scores in comparison with 'static' surveys.

METHODS

Responses to the 54-item Headache Impact Test (HIT) were re-analyzed for recent headache sufferers (n = 1016) who completed telephone interviews during the National Survey of Headache Impact (NSHI). Item response theory (IRT) calibrations and the computerized dynamic health assessment (DYNHA) software were used to simulate CAT assessments by selecting the most informative items for each person and estimating impact scores according to pre-set precision standards (CAT-HIT). Results were compared with IRT estimates based on all items (total-HIT), computerized 6-item dynamic estimates (CAT-HIT-6), and a developmental version of a 'static' 6-item form (HIT-6-D). Analyses focused on: respondent burden (survey length and administration time), score distributions ('ceiling' and 'floor' effects), reliability and standard errors, and clinical validity (diagnosis, level of severity). A random sample (n = 245) was re-assessed to test responsiveness. A second study (n = 1103) compared actual CAT surveys and an improved 'static' HIT-6 among current headache sufferers sampled on the Internet. Respondents completed measures from the first study and the generic SF-8 Health Survey; some (n = 540) were re-tested on the Internet after 2 weeks.

RESULTS

In the first study, simulated CAT-HIT and total-HIT scores were highly correlated (r = 0.92) without 'ceiling' or 'floor' effects and with a substantial reduction (90.8%) in respondent burden. Six of the 54 items accounted for the great majority of item administrations (3603/5028, 77.6%). CAT-HIT reliability estimates were very high (0.975-0.992) in the range where 95% of respondents scored, and relative validity (RV) coefficients were high for diagnosis (RV = 0.87) and severity (RV = 0.89); patient-level classifications were accurate 91.3% for a diagnosis of migraine. For all three criteria of change, CAT-HIT scores were more responsive than all other measures. In the second study, estimates of respondent burden, item usage, reliability and clinical validity were replicated. The test-retest reliability of CAT-HIT was 0.79 and alternate forms coefficients ranged from 0.85 to 0.91. All correlations with the generic SF-8 were negative.

CONCLUSIONS

CAT-based administrations of headache impact items achieved very large reductions in respondent burden without compromising validity for purposes of patient screening or monitoring changes in headache impact over time. IRT models and CAT-based dynamic health assessments warrant testing among patients with other conditions.

摘要

目的

评估计算机自适应测试(CAT)的可行性,以及与“静态”调查相比,基于CAT的头痛影响评分估计值的可靠性和有效性。

方法

对在全国头痛影响调查(NSHI)期间完成电话访谈的近期头痛患者(n = 1016)对54项头痛影响测试(HIT)的回答进行重新分析。使用项目反应理论(IRT)校准和计算机化动态健康评估(DYNHA)软件,通过为每个人选择信息量最大的项目并根据预设精度标准估计影响评分来模拟CAT评估(CAT-HIT)。将结果与基于所有项目的IRT估计值(总HIT)、计算机化6项动态估计值(CAT-HIT-6)以及“静态”6项表格的开发版本(HIT-6-D)进行比较。分析重点在于:受访者负担(调查长度和管理时间)、评分分布(“天花板”和“地板”效应)、可靠性和标准误差以及临床有效性(诊断、严重程度级别)。对一个随机样本(n = 245)进行重新评估以测试反应性。第二项研究(n = 1103)比较了当前在互联网上抽样的头痛患者中实际的CAT调查和改进后的“静态”HIT-6。受访者完成了第一项研究中的测量以及通用的SF-8健康调查;一些人(n = 540)在2周后在互联网上进行了重新测试。

结果

在第一项研究中,模拟的CAT-HIT和总HIT分数高度相关(r = 0.92),没有“天花板”或“地板”效应,并且受访者负担大幅减轻(90.8%)。54个项目中的六个占项目管理的绝大部分(3603/5028,77.6%)。在95%的受访者得分范围内,CAT-HIT可靠性估计值非常高(0.975 - 0.992),诊断的相对有效性(RV)系数很高(RV = 0.87),严重程度的RV系数也很高(RV = 0.89);偏头痛诊断的患者水平分类准确率为91.3%。对于所有三个变化标准,CAT-HIT分数比所有其他测量方法更具反应性。在第二项研究中,受访者负担、项目使用、可靠性和临床有效性的估计得到了重复。CAT-HIT的重测可靠性为0.79,替代形式系数范围为0.85至0.91。与通用SF-8的所有相关性均为负。

结论

基于CAT的头痛影响项目管理在不影响患者筛查或监测头痛影响随时间变化的有效性的情况下,极大地减轻了受访者负担。IRT模型和基于CAT的动态健康评估值得在患有其他疾病的患者中进行测试。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验