Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, Jinan 250100, China.
Shandong OX Livestock Breeding Co., Ltd., Jinan 250100, China.
Molecules. 2023 Jan 9;28(2):666. doi: 10.3390/molecules28020666.
Genetic improvement of milk fatty acid content traits in dairy cattle is of great significance. However, chromatography-based methods to measure milk fatty acid content have several disadvantages. Thus, quick and accurate predictions of various milk fatty acid contents based on the mid-infrared spectrum (MIRS) from dairy herd improvement (DHI) data are essential and meaningful to expand the amount of phenotypic data available. In this study, 24 kinds of milk fatty acid concentrations were measured from the milk samples of 336 Holstein cows in Shandong Province, China, using the gas chromatography (GC) technique, which simultaneously produced MIRS values for the prediction of fatty acids. After quantification by the GC technique, milk fatty acid contents expressed as g/100 g of milk (milk-basis) and g/100 g of fat (fat-basis) were processed by five spectral pre-processing algorithms: first-order derivative (DER1), second-order derivative (DER2), multiple scattering correction (MSC), standard normal transform (SNV), and Savitzky-Golsy convolution smoothing (SG), and four regression models: random forest regression (RFR), partial least square regression (PLSR), least absolute shrinkage and selection operator regression (LassoR), and ridge regression (RidgeR). Two ranges of wavebands (4000400 cm and 30172823 cm/1805~1734 cm) were also used in the above analysis. The prediction accuracy was evaluated using a 10-fold cross validation procedure, with the ratio of the training set and the test set as 3:1, where the determination coefficient (R) and residual predictive deviation (RPD) were used for evaluations. The results showed that 17 out of 31 milk fatty acids were accurately predicted using MIRS, with RPD values higher than 2 and R values higher than 0.75. In addition, 16 out of 31 fatty acids were accurately predicted by RFR, indicating that the ensemble learning model potentially resulted in a higher prediction accuracy. Meanwhile, DER1, DER2 and SG pre-processing algorithms led to high prediction accuracy for most fatty acids. In summary, these results imply that the application of MIRS to predict the fatty acid contents of milk is feasible.
遗传改良奶牛乳脂肪酸含量性状具有重要意义。然而,基于色谱的方法测量乳脂肪酸含量有几个缺点。因此,基于奶牛群改良(DHI)数据的中红外光谱(MIRS)快速准确地预测各种乳脂肪酸含量对于扩展可用表型数据的数量具有重要意义。本研究采用气相色谱(GC)技术,对中国山东省 336 头荷斯坦奶牛的牛奶样本进行了 24 种乳脂肪酸浓度的测量,同时产生了用于预测脂肪酸的 MIRS 值。GC 技术定量后,以 g/100g 牛奶(乳基)和 g/100g 脂肪(脂基)表示的乳脂肪酸含量经过五种光谱预处理算法(一阶导数(DER1)、二阶导数(DER2)、多次散射校正(MSC)、标准正态变换(SNV)和 Savitzky-Golsy 卷积平滑(SG))和四种回归模型(随机森林回归(RFR)、偏最小二乘回归(PLSR)、最小绝对收缩和选择算子回归(LassoR)和岭回归(RidgeR))进行处理。上述分析还使用了两个波段范围(4000400cm 和 30172823cm/1805~1734cm)。采用 10 折交叉验证程序评估预测精度,训练集和测试集的比例为 3:1,其中采用决定系数(R)和残差预测偏差(RPD)进行评估。结果表明,使用 MIRS 准确预测了 31 种乳脂肪酸中的 17 种,RPD 值高于 2,R 值高于 0.75。此外,16 种乳脂肪酸被 RFR 准确预测,表明集成学习模型可能具有更高的预测精度。同时,DER1、DER2 和 SG 预处理算法导致大多数脂肪酸具有较高的预测精度。总之,这些结果表明,MIRS 应用于预测乳脂肪酸含量是可行的。