Megahed Ameer A, Bommineni Y Reddy, Short Michael, Bittar João H J
Department of Large Animal Clinical Sciences, College of Veterinary Medicine, University of Florida, Gainesville, Florida, USA.
Department of Animal Medicine (Internal Medicine), Faculty of Veterinary Medicine, Benha University, Moshtohor-Toukh, Egypt.
J Vet Intern Med. 2025 May-Jun;39(3):e70070. doi: 10.1111/jvim.70070.
Bovine leukemia virus (BLV) infection in beef cattle has received less attention than in dairy herds, despite its potential impact on the beef industry.
To compare six different supervised machine-learning (SML) algorithms used to identify the most important risk factors for predicting BLV seropositivity in beef cattle in Florida.
Retrospective study. We used a dataset of 1511 blood sample records from the Bronson Animal Disease Diagnostic Laboratory, Florida Department of Agriculture & Consumer Services, submitted for BLV antibody testing from 2012 to 2022.
Logistic regression (LR), decision tree (DT), gradient boosting (GB), random forest (RF), neural network (NN), and support vector machine (SVM) were used.
Of the submitted samples, 11.6% were positive for BLV. The RF model best predicted BLV infection with an area under the receiver operating characteristic curve (AUROC) of 0.98, with a misclassification rate of 0.06. The DT model showed comparable performance to RF (AUROC, 0.94; misclassification rate, 0.06). However, the NN model had the poorest performance. The RF model showed that BLV seropositivity can be best predicted by testing beef cows during the dry season, which mostly coincides with the pre-calving processing and calving seasons, particularly for cattle raised in southern Florida.
The RF model shows promise for predicting BLV seropositivity in beef cattle. Key predictive risk factors include the dry season months coinciding with pre-calving and calving seasons and geographic location. These findings could help develop predictive tools for effective screening for BLV infection and targeted interventions.
尽管牛白血病病毒(BLV)感染对肉牛产业有潜在影响,但肉牛群中的感染情况比奶牛群受到的关注更少。
比较六种不同的监督机器学习(SML)算法,以确定预测佛罗里达州肉牛BLV血清阳性最重要的风险因素。
回顾性研究。我们使用了佛罗里达州农业与消费者服务部布朗森动物疾病诊断实验室的1511份血液样本记录数据集,这些样本于2012年至2022年提交用于BLV抗体检测。
使用逻辑回归(LR)、决策树(DT)、梯度提升(GB)、随机森林(RF)、神经网络(NN)和支持向量机(SVM)。
在提交的样本中,11.6%的样本BLV呈阳性。随机森林模型对BLV感染的预测效果最佳,受试者工作特征曲线下面积(AUROC)为0.98,误分类率为0.06。决策树模型表现与随机森林相当(AUROC为0.94;误分类率为0.06)。然而,神经网络模型表现最差。随机森林模型显示,在旱季对肉牛进行检测能最好地预测BLV血清阳性,旱季大多与产犊前处理和产犊季节重合,特别是对于佛罗里达州南部饲养的牛。
随机森林模型在预测肉牛BLV血清阳性方面显示出前景。关键的预测风险因素包括与产犊前和产犊季节重合的旱季月份以及地理位置。这些发现有助于开发有效的BLV感染筛查预测工具和针对性干预措施。