Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, United States.
Department of Computer Science, University of Minnesota, Minneapolis, MN, United States.
Prev Vet Med. 2021 Nov;196:105449. doi: 10.1016/j.prevetmed.2021.105449. Epub 2021 Jul 29.
Porcine epidemic diarrhea virus (PEDv) was introduced to the U.S. in 2013 and is now considered to be endemic. Like many endemic diseases, it is challenging for producers to estimate and respond to spatial and temporal variation in risk. Utilizing a regional spatio-temporal dataset containing weekly PEDv infection status for ∼15 % of the U.S. sow herd, we present a machine learning platform developed to forecast the probability of PEDv infection in sow farms in the U.S. Participating stakeholders (swine production companies) in a swine-dense region of the U.S. shared weekly information on a) PEDv status of farms and b) animal movements for the past week and scheduled movements for the upcoming week. Environmental (average temperature, humidity, among others) and land use characteristics (hog density, proportion of area with different land uses) in a 5 km radius around each farm were summarized. Using the Extreme Gradient Boosting (XGBoost) machine learning model with Synthetic Minority Over-sampling Technique (SMOTE), we developed a near real-time tool that generates weekly PEDv predictions (pertaining to two-weeks in advance) to farms of participating stakeholders. Based on retrospective data collected between 2014 and 2017, the sensitivity, specificity, positive and negative predictive values of our model were 19.9, 99.9, 70.5 and 99.4 %, respectively. Overall accuracy was 99.3 %, although this metric is heavily biased by imbalance in the data (less than 0.7 % of farms had an outbreak each week). This platform has been used to deliver weekly real-time forecasts since December 2019. The forecast platform has a built-in feature to re-train the predictive model in order to remain as relevant as possible to current epidemiological situations, or to expand to a different disease. These dynamic forecasts, which account for recent animal movements, present disease distribution, and environmental factors, will promote data-informed and targeted disease management and prevention within the U.S. swine industry.
猪流行性腹泻病毒(PEDv)于 2013 年传入美国,现已被认为是地方性疾病。与许多地方性疾病一样,生产者难以估计和应对风险的时空变化。本研究利用包含美国约 15%母猪群每周 PEDv 感染状况的区域性时空数据集,介绍了一种开发的机器学习平台,用于预测美国母猪场 PEDv 感染的概率。美国猪密度较高地区的参与利益相关者(生猪生产公司)每周共享有关农场的以下信息:a)PEDv 状况;b)过去一周的动物流动情况和未来一周的计划流动情况。在每个农场周围 5 公里范围内总结了环境(平均温度、湿度等)和土地利用特征(生猪密度、不同土地利用面积的比例)。使用极端梯度提升(XGBoost)机器学习模型和合成少数过采样技术(SMOTE),我们开发了一种近实时工具,可根据参与利益相关者的农场每周生成 PEDv 预测(提前两周)。基于 2014 年至 2017 年期间收集的回顾性数据,我们的模型的敏感性、特异性、阳性和阴性预测值分别为 19.9%、99.9%、70.5%和 99.4%。总体准确率为 99.3%,尽管由于数据不平衡(每周不到 0.7%的农场爆发疫情),该指标存在严重偏差。自 2019 年 12 月以来,该预测平台一直用于提供每周实时预测。预测平台具有内置功能,可重新训练预测模型,使其尽可能与当前的流行病学情况相关,或扩展到不同的疾病。这些动态预测考虑了最近的动物流动、疾病分布和环境因素,将促进美国生猪行业的数据驱动和有针对性的疾病管理和预防。