Department of Veterinary Diagnostic and Production Animal Medicine, College of Veterinary Medicine, Iowa State University, Ames, IA, USA.
Department of Statistics, College of Liberal Arts and Sciences, Iowa State University, Ames, IA, USA.
Prev Vet Med. 2024 Nov;232:106327. doi: 10.1016/j.prevetmed.2024.106327. Epub 2024 Aug 30.
Mortality during the post-weaning phase is a critical indicator of swine production system performance, influenced by a complex interaction of multiple factors of the epidemiological triad. This study leveraged retrospective data from 1723 groups of pigs marketed within a US swine production system to develop a Wean-Quality Score (WQS) using machine learning techniques. The study evaluated three machine learning models, Random Forest, Support Vector Machine, and Gradient Boosting Machine, to classify groups having high or low 60-day mortality, where high mortality groups represented 25 % of the groups among the study population with the highest mortality values (n=431; 60-day mortality=9.98 %), and the remaining 75 % of the groups were of low mortality (n=1292; 60-day mortality=2.75 %). The best-performing model, Random Forest (RF), outperformed the other ML models in terms of accuracy (0.90), sensitivity (0.84), and specificity (0.92) metrics, and was then selected for further analysis, which consisted of creating the WQS and ranking the most important factors for classifying groups as high or low mortality. The most important factors ranked through the RF model to classify groups with high mortality were pre-weaning mortality, weaning age, average parity of litters in sow farms, and PRRS status. Additionally, stocking conditions such as stocking density and time to fill the barn were important predictors of high mortality. The WQS was developed and correlated (r = 0.74) with the actual 60-day mortality of the groups, offering a valuable tool for assessing post-weaning survivability in swine production systems before weaning. This study highlights the potential of machine learning and comprehensive data utilization to improve the assessment and management of weaned pig quality in commercial swine production, which producers can utilize to identify and intervene in groups, according to the WQS.
断奶后阶段的死亡率是衡量猪生产系统性能的一个关键指标,受到流行病学三角的多种因素的复杂相互作用的影响。本研究利用美国猪生产系统中销售的 1723 组猪的回顾性数据,利用机器学习技术开发了一种断奶质量评分(WQS)。该研究评估了三种机器学习模型,即随机森林、支持向量机和梯度提升机,以对具有高或低 60 天死亡率的猪群进行分类,其中高死亡率组占研究人群中死亡率最高的组的 25%(n=431;60 天死亡率=9.98%),其余 75%的组死亡率较低(n=1292;60 天死亡率=2.75%)。表现最好的模型是随机森林(RF),在准确性(0.90)、敏感性(0.84)和特异性(0.92)方面均优于其他 ML 模型,因此被选中进行进一步分析,包括创建 WQS 和对分类为高或低死亡率的组进行最重要因素排名。通过 RF 模型对高死亡率组进行分类的最重要因素是断奶前死亡率、断奶年龄、母猪农场中每窝的平均产仔数和 PRRS 状态。此外,饲养密度和畜舍填满时间等饲养条件也是高死亡率的重要预测因素。WQS 已经开发出来并与组的实际 60 天死亡率相关(r = 0.74),为评估断奶后猪在商业猪生产系统中的生存能力提供了有价值的工具。本研究强调了机器学习和综合数据利用的潜力,以提高对商业猪生产中断奶猪质量的评估和管理,生产者可以根据 WQS 识别和干预这些猪群。