Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.
J Am Med Inform Assoc. 2011 May 1;18(3):218-24. doi: 10.1136/amiajnl-2011-000137.
Public health surveillance requires outbreak detection algorithms with computational efficiency sufficient to handle the increasing volume of disease surveillance data. In response to this need, the authors propose a spatial clustering algorithm, rank-based spatial clustering (RSC), that detects rapidly infectious but non-contagious disease outbreaks.
The authors compared the outbreak-detection performance of RSC with that of three well established algorithms-the wavelet anomaly detector (WAD), the spatial scan statistic (KSS), and the Bayesian spatial scan statistic (BSS)-using real disease surveillance data on to which they superimposed simulated disease outbreaks.
The following outbreak-detection performance metrics were measured: receiver operating characteristic curve, activity monitoring operating curve curve, cluster positive predictive value, cluster sensitivity, and algorithm run time.
RSC was computationally efficient. It outperformed the other two spatial algorithms in terms of detection timeliness, and outbreak localization. RSC also had overall better timeliness than the time-series algorithm WAD at low false alarm rates.
RSC is an ideal algorithm for analyzing large datasets when the application of other spatial algorithms is not practical. It also allows timely investigation for public health practitioners by providing early detection and well-localized outbreak clusters.
公共卫生监测需要具有足够计算效率的爆发检测算法,以处理不断增加的疾病监测数据量。针对这一需求,作者提出了一种空间聚类算法,即基于排名的空间聚类(RSC),用于检测具有快速传染性但非传染性的疾病爆发。
作者将 RSC 的爆发检测性能与三种成熟的算法——小波异常探测器(WAD)、空间扫描统计量(KSS)和贝叶斯空间扫描统计量(BSS)——进行了比较,使用真实的疾病监测数据,并在其上叠加了模拟的疾病爆发。
测量了以下爆发检测性能指标:接收者操作特征曲线、活动监测操作曲线、簇阳性预测值、簇敏感性和算法运行时间。
RSC 计算效率高。在检测及时性和爆发定位方面,RSC 优于其他两种空间算法。在低误报率下,RSC 的整体及时性也优于时间序列算法 WAD。
当其他空间算法不适用时,RSC 是分析大型数据集的理想算法。它还通过提供早期检测和精确定位的爆发簇,为公共卫生从业者提供及时的调查。