Long Haoyu, Chen Yilin, Feng Jingru, Chen Jian, Zhang Xue, Han Wenjie, Kang Min, Du Xiangjun
School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou, P.R. China.
School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, P.R. China.
PLoS Negl Trop Dis. 2025 Jun 25;19(6):e0013232. doi: 10.1371/journal.pntd.0013232. eCollection 2025 Jun.
Dengue is a significant global health threat, transmitted by mosquitoes and influenced by multiple factors. A comprehensive analysis of the impact of these factors on dengue at a global scale is helpful for better understanding and effective control of dengue epidemics.
This study employed machine learning techniques to develop a global predictive model for forecasting annual dengue cases. A wide range of multi-source features, including historical cases, population, climate, air travel, forest, anemia, vector, serotype and socioeconomic features, were comprehensively considered. The impact of these features was revealed using the SHAP (Shapley Additive Explanations) framework.
The global multi-variable model outperformed the baseline model, indicating the importance of considering multiple factors. Among the multi-source features, historical cases contribute the most, at about 73.63%. Risk factors associated to dengue were identified, including the occurrence of Aedes mosquitoes, changes in the predominant serotype, and the prevalence of anemia. Feature contribution pattern was different between hyperendemic and non-hyperendemic regions. In hyperendemic regions, historical cases and population were found to contribute more significantly, emphasizing the role of population immunity in dengue dynamics.
Dengue is influenced by a wide range of multi-source factors, and prevention and control measures should be specifically designed while taking into account regional differences for effective control of dengue.
登革热是一种重大的全球健康威胁,通过蚊子传播且受多种因素影响。在全球范围内对这些因素对登革热的影响进行全面分析,有助于更好地理解和有效控制登革热疫情。
本研究采用机器学习技术开发了一个用于预测年度登革热病例的全球预测模型。综合考虑了广泛的多源特征,包括历史病例、人口、气候、航空旅行、森林、贫血、病媒、血清型和社会经济特征。使用SHAP(Shapley值加法解释)框架揭示了这些特征的影响。
全球多变量模型优于基线模型,表明考虑多种因素的重要性。在多源特征中,历史病例贡献最大,约为73.63%。确定了与登革热相关的危险因素,包括伊蚊的出现、优势血清型的变化以及贫血的流行情况。高流行地区和非高流行地区的特征贡献模式不同。在高流行地区,发现历史病例和人口的贡献更为显著,强调了人群免疫力在登革热动态中的作用。
登革热受多种多源因素影响,应在考虑地区差异的同时专门设计预防和控制措施,以有效控制登革热。