Massachusetts Institute of Technology, 77 Massachusetts Avenue, MA 02139, USA.
Int J Med Inform. 2020 Nov;143:104262. doi: 10.1016/j.ijmedinf.2020.104262. Epub 2020 Aug 25.
The Coronavirus Disease 2019 (COVID-19) has currently ravaged through the world, resulting in over thirteen million confirmed cases and over five hundred thousand deaths, a complete change in daily life as we know it, worldwide lockdowns, travel restrictions, as well as heightened hygiene measures and physical distancing. Being able to analyse and predict the spread of this epidemic-causing disease is hence of utmost importance now, especially as it would help in the reasoning behind important decisions drastically affecting countries and their people, as well as in ensuring efficient resource and utility management. However, the needs of the people and specific conditions of the spread are varying widely from country to country. Hence, this article has two fold objectives: (i) conduct an in-depth statistical analysis of COVID-19 affected patients in India, (ii) propose a mathematical model for the prediction of spread of COVID-19 cases in India.
There has been limited research in modeling and predicting the spread of COVID-19 in India, owing both to the ongoing nature of the pandemic and limited availability of data. Currently famous SIR and non-SIR based Gauss-error-function and Monte Carlo simulation models do not perform well in the context of COVID-19 spread in India. We propose a 'change-factor' or 'rate-of-change' based mathematical model to predict the spread of the pandemic in India, with data drawn from hundreds of sources.
Average age of affected patients was found to be 38.54 years, with 66.76% males, and 33.24% females. Most patients were in the age range of 18-40 years. Optimal parameter values of the prediction model are identified (α = 1.35, N = 3 and T = 10) by extensive experiments. Over the entire course of time since the outbreak started in India, the model has been 90.36% accurate in predicting the total number of cases the next day, correctly predicting the range in 150 out of the 166 days looked at.
The proposed system showed an accuracy of 90.36% for prediction since the first COVID-19 case in India, and 96.67% accuracy over the month of April. Predicted number of cases for the next day is found to be a function of the numbers over the last 3 days, but with an 'increase' factor influenced by the last 10 days. It is noticed that males are affected more than females. It is also noticed that in India, the number of people in each age bucket is steadily decreasing, with the largest number of adults infected being the youngest ones-a departure from the world trend. The model is self-correcting as it improves its predictions every day, by incorporating the previous day's data into the trend-line for the following days. This model can thus be used dynamically not only to predict the spread of COVID-19 in India, but also to check the effect of various government measures in a short span of time after they are implemented.
目前,2019 年冠状病毒病(COVID-19)在全球肆虐,已确诊病例超过 1300 万例,死亡超过 50 万例,这完全改变了我们所知的日常生活,全球范围内的封锁、旅行限制以及更高的卫生措施和身体距离。因此,分析和预测这种传染病的传播非常重要,尤其是因为这有助于我们理解对国家及其人民产生重大影响的重要决策的背后原因,并确保资源和公用事业的有效管理。然而,人们的需求和传播的具体情况在各国之间差异很大。因此,本文有两个目标:(i)对印度受 COVID-19 影响的患者进行深入的统计分析,(ii)提出一种用于预测印度 COVID-19 病例传播的数学模型。
由于大流行的持续性质和数据的有限可用性,对印度 COVID-19 传播进行建模和预测的研究有限。目前著名的 SIR 和非 SIR 基于高斯误差函数和蒙特卡罗模拟模型在印度 COVID-19 传播的背景下表现不佳。我们提出了一种基于“变化因素”或“变化率”的数学模型,以预测印度大流行的传播,该模型的数据来自数百个来源。
发现受影响患者的平均年龄为 38.54 岁,男性占 66.76%,女性占 33.24%。大多数患者年龄在 18-40 岁之间。通过广泛的实验确定了预测模型的最佳参数值(α=1.35,N=3 和 T=10)。自印度爆发以来的整个时间内,该模型在预测次日的总病例数方面准确率为 90.36%,在 166 天中正确预测了 150 天的范围。
该系统自印度首例 COVID-19 病例以来的预测准确率为 90.36%,4 月份的准确率为 96.67%。预测次日的病例数是过去 3 天的病例数的函数,但受到过去 10 天的“增加”因素的影响。男性比女性更容易受到影响。还注意到,在印度,每个年龄组的人数都在稳步下降,受感染的成年人数量最多的是最年轻的成年人,这与世界趋势不同。该模型是自我修正的,因为它每天通过将前一天的数据纳入后续几天的趋势线来提高预测精度。因此,该模型不仅可以用于动态预测印度 COVID-19 的传播,还可以在实施政府措施后短时间内检查各种措施的效果。