Jones Hayley E, Spiegelhalter David J
MRC Biostatistics Unit, Institute of Public Health, Cambridge, U.K.
Stat Med. 2009 May 30;28(12):1645-67. doi: 10.1002/sim.3583.
Recent changes in individual units are often of interest when monitoring and assessing the performance of healthcare providers. We consider three high profile examples: (a) annual teenage pregnancy rates in English local authorities, (b) quarterly rates of the hospital-acquired infection Clostridium difficile in National Health Service (NHS) Trusts and (c) annual mortality rates following heart surgery in New York State hospitals. Increasingly, government targets call for continual improvements, in each individual provider as well as overall.Owing to the well-known statistical phenomenon of regression-to-the-mean, observed changes between just two measurements are potentially misleading. This problem has received much attention in other areas, but there is a need for guidelines within performance monitoring.In this paper we show theoretically and with worked examples that a simple random effects predictive distribution can be used to 'correct' for the potentially undesirable consequences of regression-to-the-mean on a test for individual change. We discuss connections to the literature in other fields, and build upon this, in particular by examining the effect of the correction on the power to detect genuine changes. It is demonstrated that a gain in average power can be expected, but that this gain is only very slight if the providers are very different from one another, for example due to poor risk adjustment. Further, the power of the corrected test depends on the provider's baseline rate and, although large gains can be expected for some providers, this is at the cost of some power to detect real changes in others.
在监测和评估医疗服务提供者的表现时,各个单位最近的变化通常备受关注。我们考虑三个备受瞩目的例子:(a)英国地方当局的年度青少年怀孕率;(b)国民保健服务(NHS)信托机构中艰难梭菌医院感染的季度发生率;(c)纽约州医院心脏手术后的年度死亡率。政府目标越来越多地要求每个个体提供者以及整体都持续改进。由于回归均值这一众所周知的统计现象,仅两次测量之间观察到的变化可能会产生误导。这个问题在其他领域已受到广泛关注,但在绩效监测中需要相关指南。在本文中,我们通过理论和实例表明,一个简单的随机效应预测分布可用于“校正”回归均值对个体变化检验可能产生的不良后果。我们讨论与其他领域文献的联系,并在此基础上进一步探讨,特别是研究校正对检测真正变化的功效的影响。结果表明,平均功效有望提高,但如果提供者彼此差异很大,例如由于风险调整不佳,这种提高幅度将非常小。此外,校正后检验的功效取决于提供者的基线率,虽然某些提供者有望大幅提高功效,但这是以牺牲检测其他提供者真实变化的部分功效为代价的。