Verduijn Marion, Peek Niels, de Keizer Nicolette F, van Lieshout Erik-Jan, de Pont Anne-Cornelie J M, Schultz Marcus J, de Jonge Evert, de Mol Bas A J M
Department of Medical Informatics, Academic Medical Center, Amsterdam, the Netherlands.
J Am Med Inform Assoc. 2008 Mar-Apr;15(2):227-34. doi: 10.1197/jamia.M2493. Epub 2007 Dec 20.
To investigate the agreement among clinical experts in their judgments of monitoring data with respect to artifacts, and to examine the effect of reference standards that consist of individual and joint expert judgments on the performance of artifact filters.
Individual judgments of four physicians, a majority vote judgment, and a consensus judgment were obtained for 30 time series of three monitoring variables: mean arterial blood pressure (ABPm), central venous pressure (CVP), and heart rate (HR). The individual and joint judgments were used to tune three existing automated filtering methods and to evaluate the performance of the resulting filters.
The interrater agreement was calculated in terms of positive specific agreement (PSA). The performance of the artifact filters was quantified in terms of sensitivity and positive predictive value (PPV).
PSA values between 0.33 and 0.85 were observed among clinical experts in their selection of artifacts, with relatively high values for CVP data. Artifact filters developed using judgments of individual experts were found to moderately generalize to new time series and other experts; sensitivity values ranged from 0.40 to 0.60 for ABPm and HR filters (PPV: 0.57-0.84), and from 0.63 to 0.80 for CVP filters (PPV: 0.71-0.86). A higher performance value for the filters was found for the three variable types when joint judgments were used for tuning the filtering methods.
Given the disagreement among experts in their individual judgment of monitoring data with respect to artifacts, the use of joint reference standards obtained from multiple experts is recommended for development of automatic artifact filters.
研究临床专家在判断监测数据中的伪差方面的一致性,并检验由个体和联合专家判断组成的参考标准对伪差过滤器性能的影响。
针对三个监测变量(平均动脉血压(ABPm)、中心静脉压(CVP)和心率(HR))的30个时间序列,获得了四位医生的个体判断、多数投票判断和共识判断。个体和联合判断用于调整三种现有的自动过滤方法,并评估所得过滤器的性能。
根据阳性特异性一致性(PSA)计算评分者间一致性。伪差过滤器的性能通过灵敏度和阳性预测值(PPV)进行量化。
临床专家在选择伪差时观察到PSA值在0.33至0.85之间,CVP数据的值相对较高。发现使用个体专家判断开发的伪差过滤器对新的时间序列和其他专家有一定的通用性;ABPm和HR过滤器的灵敏度值范围为0.40至0.60(PPV:0.57 - 0.84),CVP过滤器的灵敏度值范围为0.63至0.80(PPV:0.71 - 0.86)。当使用联合判断来调整过滤方法时,三种变量类型的过滤器性能值更高。
鉴于专家在对监测数据中伪差的个体判断上存在分歧,建议使用从多个专家获得的联合参考标准来开发自动伪差过滤器。