Gruber Stefan, Rienesl Lisa, Köck Astrid, Egger-Danner Christa, Sölkner Johann
Institute of Livestock Sciences, University of Natural Resources and Life Sciences, Vienna (BOKU), Gregor-Mendel-Straße 33, 1180 Vienna, Austria.
ZuchtData EDV-Dienstleistungen GmbH, Dresdner Straße 89/19, 1200 Vienna, Austria.
Animals (Basel). 2023 Mar 29;13(7):1193. doi: 10.3390/ani13071193.
Mid-infrared (MIR) spectroscopy is routinely applied to determine major milk components, such as fat and protein. Moreover, it is used to predict fine milk composition and various traits pertinent to animal health. MIR spectra indicate an absorbance value of infrared light at 1060 specific wavenumbers from 926 to 5010 cm. According to research, certain parts of the spectrum do not contain sufficient information on traits of dairy cows. Hence, the objective of the present study was to identify specific regions of the MIR spectra of particular importance for the prediction of mastitis and ketosis, performing variable selection analysis. Partial least squares discriminant analysis (PLS-DA) along with three other statistical methods, support vector machine (SVM), least absolute shrinkage and selection operator (LASSO), and random forest (RF), were compared. Data originated from the Austrian milk recording and associated health monitoring system (GMON). Test-day data and corresponding MIR spectra were linked to respective clinical mastitis and ketosis diagnoses. Certain wavenumbers were identified as particularly relevant for the prediction models of clinical mastitis (23) and ketosis (61). Wavenumbers varied across four distinct statistical methods as well as concerning different traits. The results indicate that variable selection analysis could potentially be beneficial in the process of modeling.
中红外(MIR)光谱法通常用于测定牛奶中的主要成分,如脂肪和蛋白质。此外,它还用于预测牛奶的精细成分以及与动物健康相关的各种特征。MIR光谱表示在926至5010厘米的1060个特定波数处红外光的吸光度值。根据研究,光谱的某些部分不包含关于奶牛特征的足够信息。因此,本研究的目的是通过进行变量选择分析,确定对于乳腺炎和酮病预测特别重要的MIR光谱的特定区域。比较了偏最小二乘判别分析(PLS-DA)以及其他三种统计方法,即支持向量机(SVM)、最小绝对收缩和选择算子(LASSO)以及随机森林(RF)。数据来源于奥地利牛奶记录和相关健康监测系统(GMON)。测试日数据和相应的MIR光谱与各自的临床乳腺炎和酮病诊断相关联。某些波数被确定为与临床乳腺炎(23个)和酮病(61个)的预测模型特别相关。波数在四种不同的统计方法以及不同特征方面有所不同。结果表明,变量选择分析在建模过程中可能是有益的。