Madsen Kristine, Rosenman Marc, Hui Siu, Breitfeld Philip P
Department of Pediatrics, Indiana University School of Medicine, Indianapolis, Indiana, USA.
J Pediatr Hematol Oncol. 2002 May;24(4):256-62. doi: 10.1097/00043426-200205000-00008.
Validating published risk models in a different time and setting can be a labor-intensive process. Data in electronic format provide the potential to test the validity of risk models without labor-intensive chart reviews and data capture. The authors attempted to use readily available electronic data to find appropriate cases and to validate and refine a previously developed risk model for predicting bacteremia in children with cancer who had fever and neutropenia.
By applying a largely automated case-finding algorithm to linked, electronic clinical and administrative data systems, the authors identified and acquired data regarding 157 episodes of fever and neutropenia in children with cancer admitted to a children's hospital during an 11-month period in 1997. The authors applied a previously developed and validated risk model for bacteremia to this 1997 cohort by assessing the odds ratios among risk groups. The model assigns encounters with absolute monocyte count of 100 cells or more/mm3 to a low-risk group and encounters with an absolute monocyte count of less than 100 cells/mm3 to intermediate-risk (temperature <39.0 degrees C) or high-risk (> or = 39.0 degrees C) groups. In addition, the authors explored whether the new data would have generated the same model. Univariate and multivariable analyses were performed to determine whether there were additional independent predictors of bacteremia. Recursive partitioning of admission absolute monocyte count and temperature was used to assess whether similar cutpoints would be found.
There were 12 episodes of bacteremia (7.6%) among the 157 encounters. The previously developed model correctly predicted increasing rates of bacteremia in this 1997 cohort, ranging from 2.5% in the low-risk group (one episode in a child with an infected central line) to 24% in the high-risk group. The odds ratio for the high-risk versus intermediate-risk group was 4.09 (95% confidence interval 1.05-15.91), comparable to the odds ratio of 3.96 in the previously published derivation cohort (95% confidence interval 1.4-11.1). Multivariate analysis of the new data revealed no independent risk factors for bacteremia other than admission absolute monocyte count and temperature. Recursive partitioning of absolute monocyte count and temperature generated risk categories that were somewhat different from those of the original model. The new data yielded three categories: low risk (temperature < or = 39.5 degrees C and absolute monocyte count >10/mm3), intermediate risk (temperature < or = 39.5 degrees C and absolute monocyte count < or = 10/mm3), and high risk (temperature >39.5 degrees C).
Existing electronic data provide an efficient means for case-finding and model validation and refinement. The previously developed bacteremia model had good but not optimal predictive performance in the new data set. Admission absolute monocyte count and temperature remain significant risk factors for bacteremia. Redefining the risk categories, including a much lower cutpoint for admission absolute monocyte count, improved the model's discrimination, which suggests that predictive models need periodic updating.
在不同的时间和环境中验证已发表的风险模型可能是一个劳动密集型过程。电子格式的数据提供了在无需进行劳动密集型图表审查和数据采集的情况下测试风险模型有效性的潜力。作者试图使用现成的电子数据来寻找合适的病例,并验证和完善先前开发的用于预测患有发热和中性粒细胞减少症的癌症儿童发生菌血症的风险模型。
通过将一种主要为自动化的病例查找算法应用于关联的电子临床和管理数据系统,作者识别并获取了1997年11个月期间入住一家儿童医院的癌症儿童中157例发热和中性粒细胞减少症发作的数据。作者通过评估风险组之间的比值比,将先前开发并验证的菌血症风险模型应用于这个1997年的队列。该模型将绝对单核细胞计数为100个细胞或更多/mm³的病例归为低风险组,将绝对单核细胞计数小于100个细胞/mm³的病例归为中度风险(体温<39.0℃)或高风险(≥39.0℃)组。此外,作者探讨了新数据是否会产生相同的模型。进行单变量和多变量分析以确定是否存在菌血症的其他独立预测因素。使用入院时绝对单核细胞计数和体温的递归划分来评估是否会找到类似的切点。
在这157例病例中,有12例发生菌血症(7.6%)。先前开发的模型正确预测了这个1997年队列中菌血症发生率的增加,范围从低风险组的2.5%(1例感染中心静脉导管的儿童)到高风险组的24%。高风险组与中度风险组的比值比为4.09(95%置信区间1.05 - 15.91),与先前发表的推导队列中的比值比3.96(95%置信区间1.4 - 11.1)相当。对新数据的多变量分析显示,除了入院时绝对单核细胞计数和体温外,没有菌血症的其他独立危险因素。绝对单核细胞计数和体温的递归划分产生的风险类别与原始模型略有不同。新数据产生了三个类别:低风险(体温≤39.5℃且绝对单核细胞计数>10/mm³)、中度风险(体温≤39.5℃且绝对单核细胞计数≤10/mm³)和高风险(体温>39.5℃)。
现有的电子数据为病例查找以及模型验证和完善提供了一种有效的手段。先前开发的菌血症模型在新数据集中具有良好但并非最佳的预测性能。入院时绝对单核细胞计数和体温仍然是菌血症的重要危险因素。重新定义风险类别,包括大幅降低入院时绝对单核细胞计数的切点,提高了模型的辨别力,这表明预测模型需要定期更新。