School of Mechanical Engineering, Sichuan University of Science and Engineering, Yibin, China.
Key Laboratory of Brewing Biotechnology and Application of Sichuan Province, Yibin, China.
J Food Sci. 2024 Jun;89(6):3540-3553. doi: 10.1111/1750-3841.17102. Epub 2024 May 8.
Starch and alcohol serve as pivotal indicators in assessing the quality of lees fermentation. In this paper, two hyperspectral imaging (HSI) techniques (visible-near-infrared (Vis-NIR) and NIR) were utilized to acquire separate HSI data, which were then fused and analyzed toforecast the starch and alcohol contents during the fermentation of lees. Five preprocessing methods were first used to preprocess the Vis-NIR, NIR, and the fused Vis-NIR and NIR data, after which partial least squares regression models were established to determine the best preprocessing method. Following, competitive adaptive reweighted sampling, successive projection algorithm, and principal component analysis algorithms were used to extract the characteristic wavelengths to accurately predict the starch and alcohol levels. Finally, support vector machine (SVM)-AdaBoost and XGBoost models were built based on the low-level fusion (LLF) and intermediate-level fusion (ILF) of single Vis-NIR and NIR as well as the fused data. The results showed that the SVM-AdaBoost model built using the LLF data afterpreprocessing by standard normalized variable was most accurate for predicting the starch content, with an of 0.9976 and a root mean square error of prediction (RMSEP) of 0.0992. The XGBoost model built using ILF data was most accurate for predicting the alcohol content, with an of 0.9969 and an RMSEP of 0.0605. In conclusion, the analysis of fused data from distinct HSI technologies facilitates rapid and precise determination of the starch and alcohol contents in fermented grains.
淀粉和酒精是评估酒曲发酵质量的重要指标。本研究采用两种高光谱成像(HSI)技术(可见-近红外(Vis-NIR)和近红外(NIR))分别获取 HSI 数据,然后融合并分析以预测酒曲发酵过程中的淀粉和酒精含量。首先,使用五种预处理方法对 Vis-NIR、NIR 和融合的 Vis-NIR 和 NIR 数据进行预处理,然后建立偏最小二乘回归模型以确定最佳预处理方法。接着,采用竞争自适应重加权采样、连续投影算法和主成分分析算法提取特征波长,以准确预测淀粉和酒精含量。最后,基于单 Vis-NIR 和 NIR 以及融合数据的低水平融合(LLF)和中间水平融合(ILF)构建支持向量机(SVM)-AdaBoost 和 XGBoost 模型。结果表明,经标准归一化变量预处理后的 LLF 数据构建的 SVM-AdaBoost 模型预测淀粉含量最准确,其 的值为 0.9976,预测均方根误差(RMSEP)为 0.0992。基于 ILF 数据构建的 XGBoost 模型预测酒精含量最准确,其 的值为 0.9969,RMSEP 为 0.0605。总之,融合来自不同 HSI 技术的数据可实现对发酵谷物中淀粉和酒精含量的快速准确测定。