Li Yue, Cai Xueya, Glance Laurent G, Spector William D, Mukamel Dana B
Department of Medicine, University of California, Irvine, CA 92697, USA.
Health Serv Res. 2009 Feb;44(1):79-102. doi: 10.1111/j.1475-6773.2008.00910.x.
To determine how alternative statistical risk-adjustment methods may affect the quality measures (QMs) in nursing home (NH) report cards.
DATA SOURCES/STUDY SETTINGS: Secondary data from the national Minimum Data Set files of 2004 and 2005 that include 605,433 long-term residents in 9,336 facilities.
We estimated risk-adjusted QMs of decline in activities of daily living (ADL) functioning using classical, fixed-effects, and random-effects logistic models. Risk-adjusted QMs were compared with each other, and with the published QM (unadjusted) in identifying high- and low-quality facilities by either the rankings or 95 percent confidence intervals of QMs.
Risk-adjusted QMs showed better overall agreement (or convergent validity) with each other than did the unadjusted versus each adjusted QM; the disagreement rate between unadjusted and adjusted QM can be as high as 48 percent. The risk-adjusted QM derived from the random-effects shrinkage estimator deviated nonrandomly from other risk-adjusted estimates in identifying the best 10 percent facilities using rankings.
The extensively risk-adjusted QMs of ADL decline, even when estimated by alternative statistical methods, show higher convergent validity and provide more robust NH comparisons than the unadjusted QM. Outcome rankings based on ADL decline tend to show lower convergent validity when estimated by the shrinkage estimator rather than other statistical methods.
确定替代统计风险调整方法如何影响疗养院报告卡中的质量指标(QM)。
数据来源/研究背景:2004年和2005年国家最低数据集文件中的二手数据,包括9336个机构中的605433名长期居民。
我们使用经典、固定效应和随机效应逻辑模型估计日常生活活动(ADL)功能下降的风险调整质量指标。对风险调整后的质量指标进行相互比较,并与已发表的质量指标(未调整)进行比较,通过质量指标的排名或95%置信区间来确定高质量和低质量的机构。
与未调整的质量指标和每个调整后的质量指标相比,风险调整后的质量指标之间总体一致性更好(或收敛效度更高);未调整和调整后的质量指标之间的不一致率可能高达48%。在使用排名确定最佳的10%机构时,随机效应收缩估计器得出的风险调整质量指标与其他风险调整估计值存在非随机偏差。
即使通过替代统计方法估计,广泛风险调整后的ADL下降质量指标也比未调整的质量指标具有更高的收敛效度,并能提供更稳健的疗养院比较。当通过收缩估计器而不是其他统计方法估计时,基于ADL下降的结果排名往往显示出较低的收敛效度。