Jafarpour Nastaran, Izadi Masoumeh, Precup Doina, Buckeridge David L
Department of Computer Engineering, Ecole Polytechnique de Montreal, C.P. 6079, succursale Centre-ville, Montreal, Quebec H3C 3A7, Canada.
Department of Epidemiology and Biostatistics, McGill University, Clinical and Health Informatics Research Group, 1140 Pine Ave. West, Montreal, Quebec H3A 1A3, Canada.
J Biomed Inform. 2015 Feb;53:180-7. doi: 10.1016/j.jbi.2014.10.009. Epub 2014 Nov 6.
To develop a probabilistic model for discovering and quantifying determinants of outbreak detection and to use the model to predict detection performance for new outbreaks.
We used an existing software platform to simulate waterborne disease outbreaks of varying duration and magnitude. The simulated data were overlaid on real data from visits to emergency department in Montreal for gastroenteritis. We analyzed the combined data using biosurveillance algorithms, varying their parameters over a wide range. We then applied structure and parameter learning algorithms to the resulting data set to build a Bayesian network model for predicting detection performance as a function of outbreak characteristics and surveillance system parameters. We evaluated the predictions of this model through 5-fold cross-validation.
The model predicted performance metrics of commonly used outbreak detection methods with an accuracy greater than 0.80. The model also quantified the influence of different outbreak characteristics and parameters of biosurveillance algorithms on detection performance in practically relevant surveillance scenarios. In addition to identifying characteristics expected a priori to have a strong influence on detection performance, such as the alerting threshold and the peak size of the outbreak, the model suggested an important role for other algorithm features, such as adjustment for weekly patterns.
We developed a model that accurately predicts how characteristics of disease outbreaks and detection methods will influence on detection. This model can be used to compare the performance of detection methods under different surveillance scenarios, to gain insight into which characteristics of outbreaks and biosurveillance algorithms drive detection performance, and to guide the configuration of surveillance systems.
开发一种概率模型,用于发现和量化疫情检测的决定因素,并使用该模型预测新疫情的检测性能。
我们使用现有的软件平台模拟不同持续时间和规模的水源性疾病疫情。将模拟数据叠加在蒙特利尔急诊科因肠胃炎就诊的真实数据上。我们使用生物监测算法分析合并后的数据,并在很宽的范围内改变其参数。然后,我们将结构和参数学习算法应用于所得数据集,以构建一个贝叶斯网络模型,用于预测作为疫情特征和监测系统参数函数的检测性能。我们通过五折交叉验证评估了该模型的预测结果。
该模型预测常用疫情检测方法的性能指标,准确率大于0.80。该模型还量化了在实际相关的监测场景中,不同疫情特征和生物监测算法参数对检测性能的影响。除了识别先验预期对检测性能有强烈影响的特征,如警报阈值和疫情峰值大小外,该模型还表明了其他算法特征的重要作用,如对每周模式的调整。
我们开发了一个模型,该模型能够准确预测疾病疫情特征和检测方法将如何影响检测。该模型可用于比较不同监测场景下检测方法的性能,深入了解疫情和生物监测算法的哪些特征驱动检测性能,并指导监测系统的配置。