Cauchemez Simon, Ferguson Neil M
MRC Centre for Outbreak Analysis and Modelling, Department of Infectious Disease Epidemiology, Imperial College London, Norfolk Place, London W2 1PG, UK.
J R Soc Interface. 2008 Aug 6;5(25):885-97. doi: 10.1098/rsif.2007.1292.
We present a new statistical approach to analyse epidemic time-series data. A major difficulty for inference is that (i) the latent transmission process is partially observed and (ii) observed quantities are further aggregated temporally. We develop a data augmentation strategy to tackle these problems and introduce a diffusion process that mimics the susceptible-infectious-removed (SIR) epidemic process, but that is more tractable analytically. While methods based on discrete-time models require epidemic and data collection processes to have similar time scales, our approach, based on a continuous-time model, is free of such constraint. Using simulated data, we found that all parameters of the SIR model, including the generation time, were estimated accurately if the observation interval was less than 2.5 times the generation time of the disease. Previous discrete-time TSIR models have been unable to estimate generation times, given that they assume the generation time is equal to the observation interval. However, we were unable to estimate the generation time of measles accurately from historical data. This indicates that simple models assuming homogenous mixing (even with age structure) of the type which are standard in mathematical epidemiology miss key features of epidemics in large populations.
我们提出了一种新的统计方法来分析疫情时间序列数据。推断的一个主要困难在于:(i)潜在传播过程是部分可观测的;(ii)观测数据在时间上进一步聚合。我们开发了一种数据增强策略来解决这些问题,并引入了一种扩散过程,该过程模仿易感-感染-康复(SIR)疫情过程,但在分析上更易于处理。基于离散时间模型的方法要求疫情和数据收集过程具有相似的时间尺度,而我们基于连续时间模型的方法则不受此约束。使用模拟数据,我们发现,如果观测间隔小于疾病代间隔的2.5倍,SIR模型的所有参数(包括代间隔)都能被准确估计。鉴于之前的离散时间TSIR模型假设代间隔等于观测间隔,所以它们无法估计代间隔。然而,我们无法从历史数据中准确估计麻疹的代间隔。这表明,数学流行病学中标准的假设均匀混合(即使具有年龄结构)的简单模型忽略了大群体中疫情的关键特征。