Shahian David M, Silverstein Treacy, Lovett Ann F, Wolf Robert E, Normand Sharon-Lise T
Tufts University School of Medicine, Boston, Mass, USA.
Circulation. 2007 Mar 27;115(12):1518-27. doi: 10.1161/CIRCULATIONAHA.106.633008. Epub 2007 Mar 12.
Regardless of statistical methodology, public performance report cards must use the highest-quality validated data, preferably from a prospectively maintained clinical database. Using logistic regression and hierarchical models, we compared hospital cardiac surgery profiling results based on clinical data with those derived from contemporaneous administrative data.
Fiscal year 2003 isolated coronary artery bypass grafting surgery results based on an audited and validated Massachusetts clinical registry were compared with those derived from a contemporaneous state administrative database, the latter using the inclusion/exclusion criteria and risk model of the Agency for Healthcare Research and Quality. There was a 27.4% disparity in isolated coronary artery bypass grafting surgery volume (4440 clinical, 5657 administrative), a 0.83% difference in observed in-hospital mortality (2.05% versus 2.88%), corresponding differences in risk-adjusted mortality calculated by various statistical methodologies, and 1 hospital classified as an outlier only with the administrative data-based approach. The discrepancies in volumes and risk-adjusted mortality were most notable for higher-volume programs that presumably perform a higher proportion of combined procedures that were misclassified as isolated coronary artery bypass grafting surgery in the administrative cohort. Subsequent analyses of a patient cohort common to both databases revealed the smoothing effect of hierarchical models, a 9% relative difference in mortality (2.21% versus 2.03%) resulting from nonstandardized mortality end points, and 1 hospital classified as an outlier using logistic regression but not using hierarchical regression.
Cardiac surgery report cards using administrative data are problematic compared with those derived from audited and validated clinical data, primarily because of case misclassification and nonstandardized end points.
无论采用何种统计方法,公开的绩效报告卡都必须使用质量最高的经过验证的数据,最好来自前瞻性维护的临床数据库。我们使用逻辑回归和分层模型,比较了基于临床数据的医院心脏手术分析结果与同期行政数据得出的结果。
将基于经过审核和验证的马萨诸塞州临床登记处的2003财年单纯冠状动脉搭桥手术结果与同期州行政数据库得出的结果进行比较,后者采用了医疗保健研究与质量局的纳入/排除标准和风险模型。单纯冠状动脉搭桥手术量存在27.4%的差异(临床数据为4440例,行政数据为5657例),观察到的住院死亡率相差0.83%(分别为2.05%和2.88%),各种统计方法计算出的风险调整后死亡率也存在相应差异,并且有1家医院仅在基于行政数据的方法中被归类为异常值。对于手术量较大的项目,手术量和风险调整后死亡率的差异最为明显,这些项目可能进行了更高比例的联合手术,在行政队列中被错误分类为单纯冠状动脉搭桥手术。随后对两个数据库共有的患者队列进行分析,结果显示了分层模型的平滑效果、非标准化死亡率终点导致死亡率相对差异9%(分别为2.21%和2.03%),以及有1家医院在使用逻辑回归时被归类为异常值,但在使用分层回归时未被归类为异常值。
与来自经过审核和验证的临床数据的心脏手术报告卡相比,使用行政数据的报告卡存在问题,主要原因是病例分类错误和终点未标准化。