Mota Lucio Flavio Macedo, Pegolo Sara, Baba Toshimi, Morota Gota, Peñagaricano Francisco, Bittante Giovanni, Cecchinato Alessio
Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, 35020 Legnaro, Italy.
Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA.
Animals (Basel). 2021 Jul 2;11(7):1993. doi: 10.3390/ani11071993.
In general, Fourier-transform infrared (FTIR) predictions are developed using a single-breed population split into a training and a validation set. However, using populations formed of different breeds is an attractive way to design cross-validation scenarios aimed at increasing prediction for difficult-to-measure traits in the dairy industry. This study aimed to evaluate the potential of FTIR prediction using training set combining specialized and dual-purpose dairy breeds to predict different phenotypes divergent in terms of biological meaning, variability, and heritability, such as body condition score (BCS), serum β-hydroxybutyrate (BHB), and kappa casein (k-CN) in the major cattle breed, i.e., Holstein-Friesian. Data were obtained from specialized dairy breeds: Holstein (468 cows) and Brown Swiss (657 cows), and dual-purpose breeds: Simmental (157 cows), Alpine Grey (75 cows), and Rendena (104 cows), giving a total of 1461 cows from 41 multi-breed dairy herds. The FTIR prediction model was developed using a gradient boosting machine (GBM), and predictive ability for the target phenotype in Holstein cows was assessed using different cross-validation (CV) strategies: a within-breed scenario using 10-fold cross-validation, for which the Holstein population was randomly split into 10 folds, one for validation and the remaining nine for training (10-fold_HO); an across-breed scenario (BS_HO) where the Brown Swiss cows were used as the training set and the Holstein cows as the validation set; a specialized multi-breed scenario (BS+HO_10-fold), where the entire Brown Swiss and Holstein populations were combined then split into 10 folds, and a multi-breed scenario (Multi-breed), where the training set comprised specialized (Holstein and Brown Swiss) and dual-purpose (Simmental, Alpine Grey, and Rendena) dairy cows, combined with nine folds of the Holstein cows. Lastly a Multi-breed CV2 scenario was implemented, assuming the same number of records as the reference scenario and using the same proportions as the multi-breed. Within-Holstein, FTIR predictions had a predictive ability of 0.63 for BCS, 0.81 for BHB, and 0.80 for k-CN. Using a specific breed (Brown Swiss) as the training set for prediction in the Holstein population reduced the prediction accuracy by 10% for BCS, 7% for BHB, and 11% for k-CN. Notably, the combination of Holstein and Brown Swiss cows in the training set increased the predictive ability of the model by 6%, which was 0.66 for BCS, 0.85 for BHB, and 0.87 for k-CN. Using multiple specialized and dual-purpose animals in the training set outperforms the 10-fold_HO (standard) approach, with an increase in predictive ability of 8% for BCS, 7% for BHB, and 10% for k-CN. When the Multi-breed CV2 was implemented, no improvement was observed. Our findings suggest that FTIR prediction of different phenotypes in the Holstein breed can be improved by including different specialized and dual-purpose breeds in the training population. Our study also shows that predictive ability is enhanced when the size of the training population and the phenotypic variability are increased.
一般来说,傅里叶变换红外光谱(FTIR)预测模型是利用单一品种群体划分为训练集和验证集来构建的。然而,使用由不同品种组成的群体是设计交叉验证方案的一种有吸引力的方式,旨在提高对乳业中难以测量性状的预测能力。本研究旨在评估使用结合了专门化奶牛品种和兼用型奶牛品种的训练集进行FTIR预测的潜力,以预测在生物学意义、变异性和遗传力方面存在差异的不同表型,例如主要奶牛品种荷斯坦 - 弗里生牛的体况评分(BCS)、血清β - 羟基丁酸(BHB)和κ - 酪蛋白(k - CN)。数据来自专门化奶牛品种:荷斯坦牛(468头)和瑞士褐牛(657头),以及兼用型品种:西门塔尔牛(157头)、阿尔卑斯灰牛(75头)和伦德讷牛(104头),共计来自41个多品种奶牛群的1461头奶牛。FTIR预测模型是使用梯度提升机(GBM)构建的,并使用不同的交叉验证(CV)策略评估其对荷斯坦奶牛目标表型的预测能力:在品种内情景下使用10折交叉验证,即将荷斯坦群体随机分为10份,一份用于验证,其余九份用于训练(10折_HO);跨品种情景(BS_HO),即将瑞士褐牛作为训练集,荷斯坦牛作为验证集;专门化多品种情景(BS + HO_10折),即将整个瑞士褐牛和荷斯坦群体合并后再分为10份;多品种情景(多品种),其中训练集包括专门化品种(荷斯坦和瑞士褐牛)和兼用型品种(西门塔尔、阿尔卑斯灰牛和伦德讷牛)的奶牛,再加上九份荷斯坦奶牛。最后实施了多品种CV2情景,假设记录数量与参考情景相同,并使用与多品种情景相同的比例。在荷斯坦品种内,FTIR对BCS的预测能力为0.63,对BHB为0.81,对k - CN为0.80。使用特定品种(瑞士褐牛)作为训练集来预测荷斯坦群体时,BCS的预测准确率降低了10%,BHB降低了7%,k - CN降低了11%。值得注意的是,训练集中荷斯坦牛和瑞士褐牛的组合使模型的预测能力提高了6%,BCS为0.66,BHB为0.85,k - CN为0.87。在训练集中使用多个专门化和兼用型品种的动物优于10折_HO(标准)方法,BCS的预测能力提高了8%,BHB提高了7%,k - CN提高了10%。当实施多品种CV2情景时,未观察到改进。我们的研究结果表明,通过在训练群体中纳入不同的专门化和兼用型品种,可以提高荷斯坦品种不同表型的FTIR预测能力。我们的研究还表明,当训练群体规模和表型变异性增加时,预测能力会增强。