Department of Veterinary Science, University of Parma, 43126 Parma, Italy.
Department of Veterinary Science, University of Parma, 43126 Parma, Italy.
J Dairy Sci. 2024 Apr;107(4):1967-1979. doi: 10.3168/jds.2023-23827. Epub 2023 Oct 19.
The prediction of the cheese yield (%CY) traits for curd, solids, and retained water and the amount of fat, protein, solids, and energy recovered from the milk into the curd (%REC) by Bayesian models, using Fourier-transform infrared spectroscopy (FTIR), can be of significant economic interest to the dairy industry and can contribute to the improvement of the cheese process efficiency. The yields give a quantitative measure of the ratio between weights of the input and output of the process, whereas the nutrient recovery allows to assess the quantitative transfer of a component from milk to cheese (expressed in % of the initial weight). The aims of this study were: (1) to investigate the feasibility of using bulk milk spectra to predict %CY and %REC traits, and (2) to quantify the effect of the dairy industry and the contribution of single-spectrum wavelengths on the prediction accuracy of these traits using vat milk samples destined to the production of Grana Padano Protected Designation of Origin cheese. Information from 72 cheesemaking days (in total, 216 vats) from 3 dairy industries were collected. For each vat, the milk was weighed and analyzed for composition (total solids [TS], lactose, protein, and fat). After 48 h from cheesemaking, each cheese was weighed, and the resulting whey was sampled for composition as well (TS, lactose, protein, and fat). Two spectra from each milk sample were collected in the range between 5,011 and 925 cm and averaged before the data analysis. The calibration models were developed via a Bayesian approach by using the BGLR (Bayesian Generalized Linear Regression) package of R software. The performance of the models was assessed by the coefficient of determination (R) and the root mean squared error (RMSE) of validation. Random cross-validation (CVL) was applied [80% calibration and 20% validation set] with 10 replicates. Then, a stratified cross-validation (SCV) was performed to assess the effect of the dairy industry on prediction accuracy. The study was repeated using a selection of informative wavelengths to assess the necessity of using whole spectra to optimize prediction accuracy. Results showed the feasibility of using FTIR spectra and Bayesian models to predict cheesemaking traits. The R values obtained with the CVL procedure were promising in particular for the %CY and %REC for protein, ranging from 0.44 to 0.66 with very low RMSE (from 0.16 to 0.53). Prediction accuracy obtained with the SCV was strongly influenced by the dairy factory industry. The general low values gained with the SCV do not permit a practical application of this approach, but they highlight the importance of building calibration models with a dataset covering the largest possible sample variability. This study also demonstrated that the use of the full FTIR spectra may be redundant for the prediction of the cheesemaking traits and that a specific selection of the most informative wavelengths led to improved prediction accuracy. This could lead to the development of dedicated spectrometers using selected wavelengths with built-in calibrations for the online prediction of these innovative traits.
利用贝叶斯模型,使用傅里叶变换红外光谱(FTIR)预测凝乳、固体和保留水分的干物质产量(%CY)以及从牛奶中回收的脂肪、蛋白质、固体和能量的量(%REC),对乳品加工业具有重要的经济意义,并有助于提高奶酪加工效率。产率提供了输入和过程输出重量之间的定量衡量标准,而营养物回收率可以评估从牛奶到奶酪的定量转移(以初始重量的%表示)。本研究的目的是:(1)研究使用批量牛奶光谱预测%CY 和%REC 特性的可行性,以及(2)使用生产 Grana Padano 受保护原产地名称奶酪的 vat 牛奶样本,量化乳制品行业的影响和单个光谱波长对这些特性预测精度的贡献。从 3 家乳制品厂收集了 72 个奶酪制作日(总共 216 个 vat)的信息。对于每个 vat,称重牛奶并分析其成分(总固体[TS]、乳糖、蛋白质和脂肪)。奶酪制作后 48 小时,对每个奶酪进行称重,并对所得乳清进行成分分析(TS、乳糖、蛋白质和脂肪)。从每个牛奶样本中采集了两个光谱,范围在 5,011 到 925 cm 之间,并在数据分析之前对其进行平均。使用 R 软件的 BGLR(贝叶斯广义线性回归)包通过贝叶斯方法开发校准模型。通过验证的决定系数(R)和均方根误差(RMSE)评估模型的性能。应用随机交叉验证(CVL)[80%的校准和 20%的验证集],重复 10 次。然后,进行分层交叉验证(SCV)以评估乳制品行业对预测精度的影响。使用有选择的信息波长重复该研究,以评估优化预测精度是否需要使用全光谱。结果表明,使用 FTIR 光谱和贝叶斯模型预测奶酪制作特性是可行的。CVL 过程获得的 R 值特别有希望,特别是对于蛋白质的%CY 和%REC,范围从 0.44 到 0.66,RMSE 非常低(从 0.16 到 0.53)。SCV 获得的预测精度受到乳制品工厂行业的强烈影响。SCV 获得的一般低值不允许实际应用这种方法,但它们突出了使用涵盖最大样本可变性的数据集构建校准模型的重要性。本研究还表明,使用全 FTIR 光谱可能对预测奶酪制作特性是多余的,并且对最具信息量的波长的特定选择可以提高预测精度。这可能导致开发使用内置校准的专用光谱仪,以在线预测这些创新特性。