Health Data Research, UK; Swansea University Medical School, UK.
Health Data Research, UK; Swansea University Medical School, UK.
Int J Med Inform. 2021 May;149:104400. doi: 10.1016/j.ijmedinf.2021.104400. Epub 2021 Jan 28.
Introduction The COVID-19 pandemic has highlighted the need for robust data linkage systems and methods for identifying outbreaks of disease in near real-time. Objectives The primary objective of this study was to develop a real-time geospatial surveillance system to monitor the spread of COVID-19 across the UK. Methods Using self-reported app data and the Secure Anonymised Information Linkage (SAIL) Databank, we demonstrate the use of sophisticated spatial modelling for near-real-time prediction of COVID-19 prevalence at small-area resolution to inform strategic government policy areas. Results We demonstrate that using a combination of crowd-sourced app data and sophisticated geo-statistical techniques it is possible to predict hot spots of COVID-19 at fine geographic scales, nationally. We are also able to produce estimates of their precision, which is an important pre-requisite to an effective control strategy to guard against over-reaction to potentially spurious features of 'best guess' predictions. Conclusion In the UK, important emerging risk-factors such as social deprivation or ethnicity vary over small distances, hence risk needs to be modelled at fine spatial resolution to avoid aggregation bias. We demonstrate that existing geospatial statistical methods originally developed for global health applications are well-suited to this task and can be used in an anonymised databank environment, thus preserving the privacy of the individuals who contribute their data.
COVID-19 大流行凸显了建立强大的数据链接系统和方法以实时识别疾病爆发的必要性。
本研究的主要目的是开发实时地理空间监测系统,以监测英国 COVID-19 的传播。
使用自我报告的应用程序数据和安全匿名信息链接 (SAIL) 数据库,我们展示了如何使用复杂的空间建模技术,以接近实时的方式预测 COVID-19 在小区域分辨率下的流行程度,为政府战略政策领域提供信息。
我们证明,通过结合众包应用程序数据和复杂的地理统计技术,可以在全国范围内预测 COVID-19 的热点地区。我们还能够对其精度进行估计,这是有效控制策略的重要前提,可避免对“最佳猜测”预测中潜在虚假特征的过度反应。
在英国,社会贫困或种族等重要新出现的风险因素在小范围内变化,因此需要在精细的空间分辨率下进行风险建模,以避免聚合偏差。我们证明,最初为全球卫生应用开发的现有地理空间统计方法非常适合这项任务,并且可以在匿名数据库环境中使用,从而保护提供数据的个人的隐私。