Seminara J A, Callero K R, An S, Salpekar C M, Van Althuis M, Barbano D M, McArt J A A
Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY 14853.
Southwest New York Dairy, Livestock, and Field Crops Program, Cornell University Cooperative Extension, Cornell University, Ithaca, NY 14853.
J Dairy Sci. 2025 Sep;108(9):10073-10083. doi: 10.3168/jds.2025-26373. Epub 2025 Jul 8.
Many multiparous cows struggle to adapt to the challenges of the early postpartum period. Dyscalcemia, a condition defined by low blood calcium concentrations at 4 DIM and associated with suboptimal performance across a spectrum of epidemiologically important outcomes (health, productivity, and reproductive success), can be a useful indicator that maladaptive phenotypes are developing in early postpartum dairy cows. Identifying dyscalcemic cows, though theoretically useful from a management perspective, is not logistically viable for commercial dairy farms due to the costs and labor that would be involved in the collection and analysis of samples. Furthermore, timely methods of analysis are lacking. Therefore, our objective in this cross-sectional study was to develop a predictive model for establishing dyscalcemia status by applying machine learning approaches to milk weights and milk constituent data predicted using Fourier-transform mid-infrared spectroscopy (FTIR) from a single milking at 4 DIM. We hypothesized that such a model would have adequate diagnostic characteristics. To test this hypothesis, we collected blood, milk weights, and proportional milk samples from 542 multiparous Holsteins on 5 commercial dairy farms in central New York at 4 DIM. Blood was analyzed for serum total calcium concentration and milk was subjected to FTIR analysis from which constituent data were predicted. Cows were diagnosed as having dyscalcemia if they had serum total calcium concentration ≤2.2 mmol/L at 4 DIM, and as eucalcemic if their serum total calcium concentrations were >2.2 mmol/L at this time. Using milk yield data and the concentrations of anhydrous lactose, true protein, fat, and fatty acid groups, including de novo, mixed, and preformed, all measured in g/100 g milk, as well as milk urea nitrogen (mg/100 g milk), and milk ketone bodies (BHB and acetone; mmol/L) we fit and cross validated random forest models stratified by parity group (2, 3, and ≥4) and farm, for the prediction of dyscalcemia status, our main outcome of interest. We found that on average our models performed favorably with an area under the receiver operating characteristic curve of 0.95 (95% CI: 0.86-1.00), accuracy of 0.90 (95% CI: 0.81-0.98), sensitivity of 0.85 (95% CI: 0.64-1.00), specificity of 0.91 (95% CI: 0.84-1.00), positive predictive value of 0.71 (95% CI: 0.32-1.00) and negative predictive value of 0.96 (95% CI: 0.89-1.00). The data providing the most valuable information to our models were milk weight, and concentrations of lactose and protein. These findings, though limited to a single geographic region, time of day, milking schedule, and season, support the concept that machine learning approaches combined with milk constituent data could become a valuable tool for discriminating between dyscalcemic cows and their eucalcemic counterparts in the early postpartum period.
许多经产奶牛难以适应产后早期的挑战。血钙过少症是一种在产后4天定义为血钙浓度低的病症,并且与一系列在流行病学上重要的结果(健康、生产力和繁殖成功率)的次优表现相关,它可能是产后早期奶牛出现适应不良表型的一个有用指标。从管理角度来看,识别血钙过少的奶牛虽然在理论上有用,但由于采集和分析样本所涉及的成本和劳动力,对于商业奶牛场来说在后勤上并不可行。此外,还缺乏及时的分析方法。因此,在这项横断面研究中,我们的目标是通过将机器学习方法应用于产后4天单次挤奶时使用傅里叶变换中红外光谱(FTIR)预测的牛奶重量和牛奶成分数据,来开发一个用于确定血钙过少症状态的预测模型。我们假设这样一个模型将具有足够的诊断特征。为了检验这个假设,我们在纽约州中部的5个商业奶牛场,在产后4天从542头经产荷斯坦奶牛身上采集了血液、牛奶重量和比例牛奶样本。分析血液中的血清总钙浓度,并对牛奶进行FTIR分析以预测成分数据。如果奶牛在产后4天血清总钙浓度≤2. mmol/L,则诊断为患有血钙过少症;如果此时它们的血清总钙浓度>2.2 mmol/L,则诊断为血钙正常。使用牛奶产量数据以及无水乳糖、真蛋白、脂肪和脂肪酸组(包括从头合成、混合和预先形成的)的浓度,所有这些都以每100克牛奶中的克数来衡量,以及牛奶尿素氮(每100克牛奶中的毫克数)和牛奶酮体(β-羟基丁酸和丙酮;毫摩尔/升),我们拟合并交叉验证了按胎次组(2、3和≥4)和农场分层的随机森林模型,用于预测血钙过少症状态,这是我们主要感兴趣的结果。我们发现,平均而言,我们的模型表现良好,受试者工作特征曲线下面积为0.95(95%置信区间:0.86 - 1.00),准确率为0.90(95%置信区间:0.81 - 0.98),灵敏度为0.85(95%置信区间:0.64 - 1.00),特异性为0.91(95%置信区间:0.84 - 1.00),阳性预测值为0.71(95%置信区间:0.32 - 1.00),阴性预测值为0.96(95%置信区间:0.89 - 1.00)。为我们的模型提供最有价值信息的数据是牛奶重量、乳糖和蛋白质的浓度。这些发现虽然仅限于单个地理区域、一天中的时间、挤奶时间表和季节,但支持了这样一种概念,即机器学习方法与牛奶成分数据相结合可能成为在产后早期区分血钙过少奶牛和血钙正常奶牛的有价值工具。