四种自举程序估计医院分析中预测与预期比值置信区间的失败。

The failure of four bootstrap procedures for estimating confidence intervals for predicted-to-expected ratios for hospital profiling.

机构信息

ICES, G106, 2075 Bayview Avenue, Toronto, ON, M4N 3M5, Canada.

Institute of Health Management, Policy and Evaluation, University of Toronto, Toronto, ON, Canada.

出版信息

BMC Med Res Methodol. 2022 Oct 14;22(1):271. doi: 10.1186/s12874-022-01739-x.

DOI:10.1186/s12874-022-01739-x

PMID:36241973

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9564085/

Abstract

BACKGROUND

Healthcare provider profiling involves the comparison of outcomes between patients cared for by different healthcare providers. An important component of provider profiling is risk-adjustment so that providers that care for sicker patients are not unfairly penalized. One method for provider profiling entails using random effects logistic regression models to compute provider-specific predicted-to-expected ratios. These ratios compare the predicted number of deaths at a given provider given the case-mix of its patients with the expected number of deaths had those patients been treated at an average provider. Despite the utility of this metric in provider profiling, methods have not been described to estimate confidence intervals for these ratios. The objective of the current study was to evaluate the performance of four bootstrap procedures for estimating 95% confidence intervals for predicted-to-expected ratios.

METHODS

We used Monte Carlo simulations to evaluate four bootstrap procedures: the naïve bootstrap, a within cluster-bootstrap, the parametric multilevel bootstrap, and a novel cluster-specific parametric bootstrap. The parameters of the data-generating process were informed by empirical analyses of patients hospitalized with acute myocardial infarction. Three factors were varied in the simulations: the number of subjects per cluster, the intraclass correlation coefficient for the binary outcome, and the prevalence of the outcome. We examined coverage rates of both normal-theory bootstrap confidence intervals and bootstrap percentile intervals.

RESULTS

In general, all four bootstrap procedures resulted in inaccurate estimates of the standard error of cluster-specific predicted-to-expected ratios. Similarly, all four bootstrap procedures resulted in 95% confidence intervals whose empirical coverage rates were different from the advertised rate. In many scenarios the empirical coverage rates were substantially lower than the advertised rate.

CONCLUSION

Existing bootstrap procedures should not be used to compute confidence intervals for predicted-to-expected ratios when conducting provider profiling.

摘要

背景

医疗保健提供者分析涉及比较由不同医疗保健提供者照顾的患者的结果。提供者分析的一个重要组成部分是风险调整，以便不会对照顾病情较重患者的提供者进行不公平的惩罚。提供者分析的一种方法是使用随机效应逻辑回归模型来计算提供者特定的预测与预期比率。这些比率将给定提供者的患者人群中给定患者的死亡预测数与平均提供者治疗的情况下的预期死亡数进行比较。尽管该指标在提供者分析中很有用，但尚未描述用于估计这些比率的置信区间的方法。本研究的目的是评估四种自举程序在估计预测与预期比率的 95%置信区间方面的性能。

方法

我们使用蒙特卡罗模拟来评估四种自举程序：简单自举、聚类内自举、参数多层自举和新的聚类特定参数自举。数据生成过程的参数是通过对急性心肌梗死住院患者的实证分析得出的。在模拟中变化了三个因素：每个聚类的受试者数量、二项结局的组内相关系数和结局的流行率。我们检查了正态理论自举置信区间和自举百分位区间的覆盖率。