Manaseki-Holland Semira, Lilford Richard J, Bishop Jonathan R B, Girling Alan J, Chen Yen-Fu, Chilton Peter J, Hofer Timothy P
Public Health, Epidemiology and Biostatistics, University of Birmingham, Birmingham, UK.
Warwick Medical School, University of Warwick, Coventry, UK.
BMJ Qual Saf. 2017 May;26(5):408-416. doi: 10.1136/bmjqs-2015-004849. Epub 2016 Jun 22.
Standardised mortality ratios do not provide accurate measures of preventable mortality. This has generated interest in using case notes to assess the preventable component of mortality. But, different methods of measurement have not been compared. We compared the reliability of two scales for assessing preventability and the correspondence between them.
Medical specialists reviewed case notes of patients who had died in hospital, using two instruments: a five-point Likert scale and a continuous (0-100) scale of preventability. To enhance generalisability, we used two different hospital datasets with different types of acute medical patients across different epochs, and in two jurisdictions (UK and USA). We investigated the reliability of measurement and correspondence of preventability estimates across the two scales. Ordinal mixed effects regression methods were used to analyse the Likert scale and to calibrate it against the continuous scale. We report the estimates of the probability a death could have been prevented, accounting for reviewer inconsistency.
Correspondence between the two scales was strong; the Likert categories explained most of the variation (76% UK, 73% USA) in the continuous scale. Measurement reliability was low, but similar across the two instruments in each dataset (intraclass correlation: 0.27, UK; 0.23, USA). Adjusting for the inconsistency of reviewer judgements reduced the proportion of cases with high preventability, such that the proportion of all deaths judged probably or definitely preventable on the balance of probability was less than 1%.
The correspondence is high between a Likert and a continuous scale, although the low reliability of both would suggest careful measurement design would be needed to use either scale. Few to no cases are above the threshold when using a balance of probability approach to determining a preventable death, and in any case, there is little evidence supporting anything more than an ordinal correspondence between these reviewer estimates of probability and the true probability. Thus, it would be more defensible to use them as an ordinal measure of the quality of care received by patients who died in the hospital.
标准化死亡率不能准确衡量可预防的死亡率。这引发了人们对利用病例记录来评估死亡率中可预防部分的兴趣。但是,不同的测量方法尚未得到比较。我们比较了两种评估可预防性的量表的可靠性以及它们之间的一致性。
医学专家使用两种工具审查了在医院死亡患者的病例记录:一个五点李克特量表和一个可预防性的连续(0 - 100)量表。为了提高普遍性,我们使用了两个不同的医院数据集,涵盖不同时期、不同类型的急性内科患者,并且涉及两个司法管辖区(英国和美国)。我们研究了测量的可靠性以及两个量表之间可预防性估计的一致性。使用序数混合效应回归方法分析李克特量表,并将其与连续量表进行校准。我们报告了考虑审查者不一致性后死亡可预防概率的估计值。
两个量表之间的一致性很强;李克特类别解释了连续量表中大部分的变异(英国为76%,美国为73%)。测量可靠性较低,但每个数据集中两种工具的可靠性相似(组内相关系数:英国为0.27,美国为0.23)。调整审查者判断的不一致性后,高可预防性病例的比例降低,以至于根据可能性权衡判断可能或肯定可预防的所有死亡病例的比例小于1%。
李克特量表和连续量表之间的一致性很高,尽管两者的可靠性较低,这表明使用任何一个量表都需要精心设计测量方法。在使用可能性权衡方法确定可预防死亡时,几乎没有病例超过阈值,而且在任何情况下,几乎没有证据支持这些审查者的概率估计与真实概率之间存在比序数对应关系更强的关系。因此,将它们用作对在医院死亡患者所接受护理质量的序数测量更具合理性。