Kelly Patrick H, Kwark Rob, Marick Harrison M, Davis Julie, Stark James H, Madhava Harish, Dobler Gerhard, Moïsi Jennifer C
Vaccines and Antivirals Medical Affairs, Pfizer US Commercial Division, Pfizer, Inc., 66 Hudson Yards Blvd E, New York City, NY, USA.
Tiber Solutions, Fairfax, VA, USA.
Int J Health Geogr. 2025 Mar 14;24(1):3. doi: 10.1186/s12942-025-00388-9.
Tick-borne encephalitis (TBE) is the most serious tick-borne viral disease in Europe. Identifying TBE risk areas can be difficult due to hyper focal circulation of the TBE virus (TBEV) between mammals and ticks. To better define TBE hazard risks and elucidate regional-specific environmental factors that drive TBEV circulation, we developed two machine-learning (ML) algorithms to predict the habitat suitability (maximum entropy), and occurrence of TBEV (extreme gradient boosting) within distinct European regions (Central Europe, Nordics, and Baltics) using local variables of climate, habitat, topography, and animal hosts and reservoirs.
Geocoordinates that reported the detection of TBEV in ticks or rodents and anti-TBEV antibodies in rodent reservoirs in 2000 or later were extracted from published and grey literature. Region-specific ML models were defined via K-means clustering and trained according to the distribution of extracted geocoordinates relative to explanatory variables in each region. Final models excluded colinear variables and were evaluated for performance.
521 coordinates (455 ticks; 66 rodent reservoirs) of TBEV occurrence (2000-2022) from 100 records were extracted for model development. The models had high performance across regions (AUC: 0.72-0.92). The strongest predictors of habitat suitability and TBEV occurrence in each region were associated with different variable categories: climate variables were the strongest predictors of habitat suitability in Central Europe; rodent reservoirs and elevation were strongest in the Nordics; and animal hosts and land cover contributed most to the Baltics. The models predicted several areas with few or zero reported TBE incidence as highly suitable (≥ 60%) TBEV habitats or increased probability (≥ 25%) of TBEV occurrence including western Norway coastlines, northern Denmark, northeastern Croatia, eastern France, and northern Italy, suggesting potential capacity for locally-acquired autochthonous TBEV infections or possible underreporting of TBE cases based on reported human surveillance data.
This study shows how varying environmental factors drive the occurrence of TBEV within different European regions and identifies potential new risk areas for TBE. Importantly, we demonstrate the utility of ML models to generate reliable insights into TBE hazard risks when trained with sufficient explanatory variables and to provide high resolution and harmonized risk maps for public use.
蜱传脑炎(TBE)是欧洲最严重的蜱传病毒性疾病。由于TBE病毒(TBEV)在哺乳动物和蜱之间高度局部循环,确定TBE风险区域可能很困难。为了更好地界定TBE危害风险并阐明驱动TBEV循环的区域特定环境因素,我们开发了两种机器学习(ML)算法,利用气候、栖息地、地形以及动物宿主和储存宿主的局部变量,预测欧洲不同区域(中欧、北欧和波罗的海地区)内的栖息地适宜性(最大熵)和TBEV的存在情况(极端梯度提升)。
从已发表文献和灰色文献中提取2000年或之后报告的蜱或啮齿动物中TBEV检测以及啮齿动物储存宿主中抗TBEV抗体的地理坐标。通过K均值聚类定义区域特定的ML模型,并根据提取的地理坐标相对于每个区域解释变量的分布进行训练。最终模型排除了共线变量并对性能进行了评估。
从100条记录中提取了521个(455个蜱;66个啮齿动物储存宿主)2000 - 2022年TBEV存在情况的坐标用于模型开发。这些模型在各区域均具有高性能(AUC:0.72 - 0.92)。每个区域栖息地适宜性和TBEV存在的最强预测因子与不同的变量类别相关:气候变量是中欧栖息地适宜性的最强预测因子;啮齿动物储存宿主和海拔在北欧最强;动物宿主和土地覆盖对波罗的海地区贡献最大。模型预测了几个报告TBE发病率很低或为零的地区为高度适宜(≥60%)的TBEV栖息地或TBEV发生概率增加(≥25%)的地区,包括挪威西海岸、丹麦北部、克罗地亚东北部、法国东部和意大利北部,这表明可能存在本地获得的TBEV本土感染能力,或者基于报告的人类监测数据可能存在TBE病例报告不足的情况。
本研究展示了不同环境因素如何驱动欧洲不同区域内TBEV的发生,并确定了TBE潜在的新风险区域。重要的是,我们证明了ML模型在使用足够的解释变量进行训练时,对于生成有关TBE危害风险的可靠见解以及提供供公众使用的高分辨率和统一风险地图的实用性。