Jarvis Lab, Department of Medicine and Healthcare, Tencent Technology (Shenzhen) Company, Shenzhen, 518000, China.
Sci Rep. 2020 Dec 3;10(1):21122. doi: 10.1038/s41598-020-78084-w.
The current outbreak of coronavirus disease 2019 (COVID-19) has recently been declared as a pandemic and spread over 200 countries and territories. Forecasting the long-term trend of the COVID-19 epidemic can help health authorities determine the transmission characteristics of the virus and take appropriate prevention and control strategies beforehand. Previous studies that solely applied traditional epidemic models or machine learning models were subject to underfitting or overfitting problems. We propose a new model named Dynamic-Susceptible-Exposed-Infective-Quarantined (D-SEIQ), by making appropriate modifications of the Susceptible-Exposed-Infective-Recovered (SEIR) model and integrating machine learning based parameter optimization under epidemiological rational constraints. We used the model to predict the long-term reported cumulative numbers of COVID-19 cases in China from January 27, 2020. We evaluated our model on officially reported confirmed cases from three different regions in China, and the results proved the effectiveness of our model in terms of simulating and predicting the trend of the COVID-19 outbreak. In China-Excluding-Hubei area within 7 days after the first public report, our model successfully and accurately predicted the long trend up to 40 days and the exact date of the outbreak peak. The predicted cumulative number (12,506) by March 10, 2020, was only 3·8% different from the actual number (13,005). The parameters obtained by our model proved the effectiveness of prevention and intervention strategies on epidemic control in China. The prediction results for five other countries suggested the external validity of our model. The integrated approach of epidemic and machine learning models could accurately forecast the long-term trend of the COVID-19 outbreak. The model parameters also provided insights into the analysis of COVID-19 transmission and the effectiveness of interventions in China.
当前 2019 年冠状病毒病(COVID-19)的爆发最近已被宣布为大流行,并在 200 多个国家和地区蔓延。预测 COVID-19 疫情的长期趋势有助于卫生当局确定病毒的传播特征,并提前采取适当的预防和控制策略。以前仅应用传统流行模型或机器学习模型的研究存在拟合不足或拟合过度的问题。我们通过对易感-暴露-感染-恢复(SEIR)模型进行适当修改,并在流行病学合理约束下集成基于机器学习的参数优化,提出了一种名为动态易感-暴露-感染-隔离(D-SEIQ)的新模型。我们使用该模型来预测 2020 年 1 月 27 日以来中国报告的 COVID-19 长期累计病例数。我们在中国的三个不同地区的官方报告确诊病例中对我们的模型进行了评估,结果证明了我们的模型在模拟和预测 COVID-19 疫情趋势方面的有效性。在中国(不含湖北)首例报告后 7 天内,我们的模型成功准确地预测了长达 40 天的长期趋势和疫情高峰期的确切日期。截至 2020 年 3 月 10 日,我们的模型预测的累计病例数(12506)与实际病例数(13005)相差仅 3.8%。我们的模型得出的参数证明了中国在疫情控制方面的预防和干预策略的有效性。对其他五个国家的预测结果表明了我们模型的外部有效性。流行和机器学习模型的综合方法可以准确预测 COVID-19 疫情的长期趋势。模型参数还提供了对 COVID-19 传播分析和中国干预措施有效性的深入了解。