Department of Medical Sciences, School of Veterinary Medicine, University of Wisconsin-Madison, Madison, WI, USA.
Department of Medical Sciences, School of Veterinary Medicine, University of Wisconsin-Madison, Madison, WI, USA.
Prev Vet Med. 2021 Aug;193:105422. doi: 10.1016/j.prevetmed.2021.105422. Epub 2021 Jun 30.
Dairy cows suffer poor metabolic adaptation syndrome (PMAS) during early post-calving periods caused by negative energy balance. Measurement of blood beta-hydroxy butyric acid (BHBA) and blood non-esterified fatty acids (NEFA) allow early and accurate detection of negative energy balance. Machine learning prediction of blood BHBA and blood NEFA using milk testing samples represents an opportunity to identify at-risk animals, using less labor than direct blood testing methods. Routine milk testing on modern dairies and computer record keeping provide an immense amount of data which can then be used in machine learning models. Previous research for predicting blood metabolites using Fourier-transform infrared spectroscopy (FTIR) milk data has focused mainly on individual models rather than a comparison among the models. Full model selection is the process of comparing different combinations of pre-processing methods, variable selection, and statistical learning algorithms to determine which model results in the lowest prediction error for a given dataset. For this project we used a full model selection approach with regression trees (rtFMS) . rtFMS uses the cross-validated performance of different model configurations to feed a regression tree for selecting a final model. A total of 384 possible model configurations (algorithms, predictors and data preprocessing options) for each outcome (blood BHBA and blood NEFA) were considered in the rtFMS technique. rtFMS allows direct comparison of multiple modeling approaches reducing bias due to empirical knowledge, modeling habits, or preferences, identifying the model with minimal root mean squared prediction error (RMSE) . An elastic net regression model was selected as the best performing model for both biomarkers. The input data for blood BHBA predictions were FTIR milk spectra, with a second derivative pre-processing, and a filter with 212 wave numbers, obtaining RMSE = 0.354 (0.328-0.392). The best performing model for blood NEFA had input data of FTIR milk spectra, with a second derivative pre-processing, and a filter with 212 wave numbers filter along with the time of milking, obtaining RMSE = 0.601 (0.564-0.654). The comparison of multiple modeling strategies, conducted by rtFMS, present an option for improved FTIR prediction models of blood BHBA and blood NEFA by reducing error due to human bias. The implementation of rtFMS to design future prediction models can guide model inputs and features. Our prediction models have the potential to increase early detection of metabolic disorders in dairy cows during the transition period.
奶牛在产后早期会出现代谢适应不良综合征(PMAS),这是由于能量负平衡引起的。血液β-羟丁酸(BHBA)和非酯化脂肪酸(NEFA)的测量可以早期、准确地检测到能量负平衡。使用牛奶检测样本进行血液 BHBA 和血液 NEFA 的机器学习预测为识别高危动物提供了机会,比直接血液检测方法需要的劳动力更少。现代奶牛场的常规牛奶检测和计算机记录保存提供了大量的数据,然后可以在机器学习模型中使用。以前使用傅里叶变换红外光谱(FTIR)牛奶数据预测血液代谢物的研究主要集中在单个模型上,而不是对模型进行比较。全模型选择是一种比较不同预处理方法、变量选择和统计学习算法组合的过程,以确定哪个模型在给定数据集上产生的预测误差最低。在这个项目中,我们使用了全模型选择方法与回归树(rtFMS)。rtFMS 使用不同模型配置的交叉验证性能为回归树提供反馈,以选择最终模型。对于每个结果(血液 BHBA 和血液 NEFA),共考虑了 384 种可能的模型配置(算法、预测因子和数据预处理选项)。rtFMS 允许对多个建模方法进行直接比较,减少了由于经验知识、建模习惯或偏好而产生的偏差,从而确定了具有最小均方根预测误差(RMSE)的模型。弹性网络回归模型被选为这两种生物标志物的最佳表现模型。血液 BHBA 预测的输入数据是 FTIR 牛奶光谱,具有二阶导数预处理,以及 212 个波数的滤波器,得到 RMSE=0.354(0.328-0.392)。血液 NEFA 的最佳表现模型的输入数据是 FTIR 牛奶光谱,具有二阶导数预处理,以及 212 个波数的滤波器和挤奶时间,得到 RMSE=0.601(0.564-0.654)。通过 rtFMS 进行的多种建模策略的比较为减少人为偏差引起的血液 BHBA 和血液 NEFA 的 FTIR 预测模型误差提供了一种选择。rtFMS 的实施可以为设计未来的预测模型提供指导,包括模型输入和特征。我们的预测模型有可能增加奶牛在过渡期早期对代谢紊乱的检测。