Verma Madhur, Kishore Kamal, Parija Pragyan Paramita, Sahoo Soumya Swaroop, Gambhir Dolly, Gupta Usha, Kakkar Rakesh
Department of Community & Family Medicine, All India Institute of Medical Sciences Bathinda, Punjab, 151001, India.
Department of Biostatistics, Post Graduate Institute of Medical Education and Research, Chandigarh, 160012, India.
BMC Infect Dis. 2025 Mar 28;25(1):431. doi: 10.1186/s12879-025-10801-0.
BACKGROUND: Acute Febrile Illness (AFI) like Malaria, Dengue, Chikungunya, and Enteric fever still remain the most common cause of seeking healthcare in low-middle-income countries and need to be constantly monitored for any impending outbreak. Digital epidemiology promises to assist traditional health surveillance. The health data (including AFI) collated by Google using specialised platforms like Google Trends (GT) is known to correlate with actual disease trends. The present study thus aims to assess the potential of GT to support routine surveillance system and forecast AFI outbreaks reported through the Indian Integrated Disease Surveillance Programme (IDSP). METHODS: We utilised Haryana's IDSP portal to retrieve the weekly data of the most commonly reported infectious diseases causing AFI between 2011 and 2020. Internet search trends were downloaded using GT. Descriptive statistics estimated the burden of the AFI and Bland-Altman's plot depicted statistical agreement between the two. We adopted the Box-Jenkins approach to attain the final SARIMA model and explain the time-dependent weekly incidence of AFI. RESULTS: The time series plot of the reported AFI displayed trends. Martin- Bland plots depicted acceptable agreement between two datasets for all Chikungunya and Dengue. Among the models evaluated, the Malaria model [SARIMA(1,1,1)(1,1,1)] demonstrated the best performance with a balanced fit and reasonable accuracy, while the Enteric Fever model [SARIMA(0,1,0)(1,1,1)] exhibited low prediction error but weak seasonal significance. In contrast, the Dengue [SARIMA(1,1,0)(1,1,0)] and Chikungunya [ARIMA(1,0,0)(0,0,0)] models had high forecast errors, limiting their predictive reliability. Overall, GT supplemented the prediction performance of the SARIMA models with adjusted R of 46%, 50%, 50%, and 52% compared to the original 43%, 49%, 20%, and 48%. CONCLUSIONS: Our study observed modest improvements in GT-based SARIMA forecasting models compared to routine IDSP mechanisms for predicting AFI outbreaks in Haryana, highlighting the potential for further enhancement. As more granular GT data becomes available, its integration with traditional surveillance systems could significantly enhance forecasting accuracy for AFI and other infectious disease outbreaks. At no additional cost to the health system, GT can serve as a valuable, real-time digital epidemiology tool, strengthening public health preparedness and enabling timely interventions for the early containment of emerging diseases.
Healthc Inform Res. 2018-10
BMC Public Health. 2020-6-16
Glob Health Action. 2019
MMWR Surveill Summ. 2024-5-30
Indian J Public Health. 2012
Indian J Community Med. 2023
Biol Lett. 2023-2
Front Res Metr Anal. 2022-9-15
BMC Public Health. 2021-8-21
JMIR Public Health Surveill. 2021-8-30