Department of Agronomy, Food, Natural Resources, Animals and Environment, University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy.
Department of Agronomy, Food, Natural Resources, Animals and Environment, University of Padova, Viale dell' Università 16, 35020 Legnaro, Italy.
J Dairy Sci. 2021 Jul;104(7):8107-8121. doi: 10.3168/jds.2020-19861. Epub 2021 Apr 15.
Fourier-transform infrared (FTIR) spectroscopy is a powerful high-throughput phenotyping tool for predicting traits that are expensive and difficult to measure in dairy cattle. Calibration equations are often developed using standard methods, such as partial least squares (PLS) regression. Methods that employ penalization, rank-reduction, and variable selection, as well as being able to model the nonlinear relations between phenotype and FTIR, might offer improvements in predictive ability and model robustness. This study aimed to compare the predictive ability of 2 machine learning methods, namely random forest (RF) and gradient boosting machine (GBM), and penalized regression against PLS regression for predicting 3 phenotypes differing in terms of biological meaning and relationships with milk composition (i.e., phenotypes measurable directly and not directly in milk, reflecting different biological processes which can be captured using milk spectra) in Holstein-Friesian cattle under 2 cross-validation scenarios. The data set comprised phenotypic information from 471 Holstein-Friesian cows, and 3 target phenotypes were evaluated: (1) body condition score (BCS), (2) blood β-hydroxybutyrate (BHB, mmol/L), and (3) κ-casein expressed as a percentage of nitrogen (κ-CN, % N). The data set was split considering 2 cross-validation scenarios: samples-out random in which the population was randomly split into 10-folds (8-folds for training and 1-fold for validation and testing); and herd/date-out in which the population was randomly assigned to training (70% herd), validation (10%), and testing (20% herd) based on the herd and date in which the samples were collected. The random grid search was performed using the training subset for the hyperparameter optimization and the validation set was used for the generalization of prediction error. The trained model was then used to assess the final prediction in the testing subset. The grid search for penalized regression evidenced that the elastic net (EN) was the best regularization with increase in predictive ability of 5%. The performance of PLS (standard model) was compared against 2 machine learning techniques and penalized regression using 2 cross-validation scenarios. Machine learning methods showed a greater predictive ability for BCS (0.63 for GBM and 0.61 for RF), BHB (0.80 for GBM and 0.79 for RF), and κ-CN (0.81 for GBM and 0.80 for RF) in samples-out cross-validation. Considering a herd/date-out cross-validation these values were 0.58 (GBM and RF) for BCS, 0.73 (GBM and RF) for BHB, and 0.77 (GBM and RF) for κ-CN. The GBM model tended to outperform other methods in predictive ability around 4%, 1%, and 7% for EN, RF, and PLS, respectively. The prediction accuracies of the GBM and RF models were similar, and differed statistically from the PLS model in samples-out random cross-validation. Although, machine learning techniques outperformed PLS in herd/date-out cross-validation, no significant differences were observed in terms of predictive ability due to the large standard deviation observed for predictions. Overall, GBM achieved the highest accuracy of FTIR-based prediction of the different phenotypic traits across the cross-validation scenarios. These results indicate that GBM is a promising method for obtaining more accurate FTIR-based predictions for different phenotypes in dairy cattle.
傅里叶变换红外(FTIR)光谱学是一种强大的高通量表型分析工具,可用于预测在奶牛中昂贵且难以测量的性状。校准方程通常使用标准方法(如偏最小二乘(PLS)回归)开发。采用惩罚、降秩和变量选择的方法,以及能够模拟表型和 FTIR 之间的非线性关系的方法,可能会提高预测能力和模型稳健性。本研究旨在比较两种机器学习方法(随机森林(RF)和梯度提升机(GBM))和惩罚回归与 PLS 回归在两种交叉验证情况下预测荷斯坦-弗里生奶牛 3 种表型的预测能力,这 3 种表型在生物学意义和与牛奶成分的关系方面存在差异(即,可直接测量和不可直接测量的表型,反映不同的生物学过程,可以使用牛奶光谱来捕获)。数据集包含了 471 头荷斯坦-弗里生奶牛的表型信息,并评估了 3 个目标表型:(1)体况评分(BCS),(2)血液β-羟丁酸(BHB,mmol/L),(3)κ-酪蛋白表示为氮的百分比(κ-CN,% N)。数据集考虑了两种交叉验证情况进行了拆分:样本随机外,群体随机分为 10 折(8 折用于训练,1 折用于验证和测试);以及 herd/date-out,群体根据采集样本的 herd 和 date 随机分配到训练(70% herd)、验证(10%)和测试(20% herd)。使用训练子集进行随机网格搜索以进行超参数优化,并使用验证集进行预测误差的泛化。然后使用训练好的模型在测试子集上评估最终预测。惩罚回归的网格搜索表明,弹性网(EN)是最佳的正则化方法,可提高 5%的预测能力。使用两种交叉验证情况比较了 PLS(标准模型)与两种机器学习技术和惩罚回归的性能。机器学习方法在样本外交叉验证中对 BCS(GBM 为 0.63,RF 为 0.61)、BHB(GBM 为 0.80,RF 为 0.79)和 κ-CN(GBM 为 0.81,RF 为 0.80)具有更高的预测能力。考虑 herd/date-out 交叉验证,这些值分别为 BCS(GBM 和 RF)的 0.58、BHB(GBM 和 RF)的 0.73 和 κ-CN(GBM 和 RF)的 0.77。GBM 模型在预测能力方面的表现优于其他方法,分别提高了 4%、1%和 7%左右,用于 EN、RF 和 PLS。GBM 和 RF 模型的预测准确性相似,在样本外随机交叉验证中与 PLS 模型存在统计学差异。尽管机器学习技术在 herd/date-out 交叉验证中的预测能力优于 PLS,但由于预测的标准偏差较大,因此在预测能力方面没有观察到显著差异。总体而言,GBM 在跨交叉验证情况下实现了不同表型特征的 FTIR 预测的最高准确性。这些结果表明,GBM 是一种很有前途的方法,可以在奶牛中获得更准确的基于 FTIR 的不同表型预测。