Mohta Rashi, Prathapani Sravya, Ghosh Palash
Department of Mathematics, Indian Institute of Technology Guwahati, Guwahati, Assam India.
Jyoti and Bhupat Mehta School of Health Sciences and Technology, Indian Institute of Technology Guwahati, Guwahati, Assam India.
Ann Data Sci. 2023 May 15:1-20. doi: 10.1007/s40745-023-00467-3.
Accurate prediction of cumulative COVID-19 infected cases is essential for effectively managing the limited healthcare resources in India. Historically, epidemiological models have helped in controlling such epidemics. Models require accurate historical data to predict future outcomes. In our data, there were days exhibiting erratic, apparently anomalous jumps and drops in the number of daily reported COVID-19 infected cases that did not conform with the overall trend. Including those observations in the training data would most likely worsen model predictive accuracy. However, with existing epidemiological models it is not straightforward to determine, for a specific day, whether or not an outcome should be considered anomalous. In this work, we propose an algorithm to automatically identify anomalous 'jump' and 'drop' days, and then based upon the overall trend, the number of daily infected cases for those days is adjusted and the training data is amended using the adjusted observations. We applied the algorithm in conjunction with a recently proposed, modified Susceptible-Infected-Susceptible (SIS) model to demonstrate that prediction accuracy is improved after adjusting training data counts for apparent erratic anomalous jumps and drops.
准确预测新冠病毒累计感染病例对于有效管理印度有限的医疗资源至关重要。从历史上看,流行病学模型有助于控制此类疫情。模型需要准确的历史数据来预测未来结果。在我们的数据中,存在一些日子,每日报告的新冠病毒感染病例数出现不稳定、明显异常的波动,不符合总体趋势。将这些观测值纳入训练数据很可能会降低模型预测准确性。然而,使用现有的流行病学模型,对于特定的一天,要确定一个结果是否应被视为异常并非易事。在这项工作中,我们提出了一种算法来自动识别异常的“跃升”和“骤降”日,然后根据总体趋势,调整这些日子的每日感染病例数,并使用调整后的观测值修正训练数据。我们将该算法与最近提出的改进的易感-感染-易感(SIS)模型结合应用,以证明在针对明显不稳定的异常跃升和骤降调整训练数据计数后,预测准确性得到了提高。