Jarvis Stuart W, Kovacs Caroline, Briggs Jim, Meredith Paul, Schmidt Paul E, Featherstone Peter I, Prytherch David R, Smith Gary B
Centre for Healthcare Modelling and Informatics, University of Portsmouth, Portsmouth, UK; Department of Health Sciences, University of York, York, UK.
Centre for Healthcare Modelling and Informatics, University of Portsmouth, Portsmouth, UK.
Resuscitation. 2015 May;90:1-6. doi: 10.1016/j.resuscitation.2015.01.033. Epub 2015 Feb 8.
Sicker patients generally have more vital sign assessments, particularly immediately before an adverse outcome, and especially if the vital sign monitoring schedule is driven by an early warning score (EWS) value. This lack of independence could influence the measured discriminatory performance of an EWS.
We used a population of 1564,143 consecutive vital signs observation sets collected as a routine part of patients' care. We compared 35 published EWSs for their discrimination of the risk of death within 24h of an observation set using (1) all observations in our dataset, (2) one observation per patient care episode, chosen at random and (3) one observation per patient care episode, chosen as the closest to a randomly selected point in time in each episode. We compared the area under the ROC curve (AUROC) as a measure of discrimination for each of the 35 EWSs under each observation selection method and looked for changes in their rank order.
There were no significant changes in rank order of the EWSs based on AUROC between the different observation selection methods, except for one EWS that included age among its components. Whichever method of observation selection was used, the National Early Warning Score (NEWS) showed the highest discrimination of risk of death within 24h. AUROCs were higher when only one observation set was used per episode of care (significantly higher for many EWSs, including NEWS).
Vital sign measurements can be treated as if they are independent - multiple observations can be used from each episode of care--when comparing the performance and ranking of EWSs, provided no EWS includes age.
病情较重的患者通常有更多的生命体征评估,尤其是在不良结局即将出现之前,特别是当生命体征监测计划由早期预警评分(EWS)值驱动时。这种缺乏独立性可能会影响EWS所测量的判别性能。
我们使用了作为患者护理常规部分收集的1564143个连续生命体征观察集的总体。我们比较了35种已发表的EWS,以判别观察集后24小时内的死亡风险,使用(1)我们数据集中的所有观察值,(2)每个患者护理事件随机选择一次观察值,以及(3)每个患者护理事件选择一次观察值,选择为每个事件中最接近随机选择时间点的观察值。我们比较了每种观察选择方法下35种EWS中每种EWS的ROC曲线下面积(AUROC),作为判别度量,并寻找它们排名顺序的变化。
除了一个将年龄作为其组成部分之一的EWS外,不同观察选择方法之间基于AUROC的EWS排名顺序没有显著变化。无论使用哪种观察选择方法,国家早期预警评分(NEWS)在24小时内对死亡风险的判别能力最高。当每个护理事件仅使用一个观察集时,AUROC更高(许多EWS,包括NEWS,显著更高)。
在比较EWS的性能和排名时,生命体征测量可以被视为独立的——每个护理事件可以使用多个观察值——前提是没有EWS包含年龄。