Yang Haili, Xia Hao, Liu Sai, Chen Shan, Li Lan, Liao Xilong, Fei Lei, Xie Liangliang, Tian Jianping, Hu Xinjun
School of Mechanical Engineering, Sichuan University of Science and Engineering, Yibin, 644000, China.
Anhui Province Key Laboratory of Intelligent Solid-State Fermentation Technology, Bozhou, 236800, China.
Bioprocess Biosyst Eng. 2025 Jun 14. doi: 10.1007/s00449-025-03187-5.
Daqu is a traditional Chinese brewing ingredient that serves dual functions of saccharification and fermentation during the brewing process. The acidity content during the Daqu fermentation process directly affects the quality of the Daqu. Traditional methods for measuring Daqu acidity are complex and exhibit lag, making it difficult to monitor fermentation acidity in real time. Given the strong correlation between Daqu acidity and environmental variables, this paper proposes a time series prediction model for Daqu acidity based on the KNN-Attention-LSTM-XGBoost model. Upon collecting and analyzing the microenvironmental parameters of Daqu, the XGBoost model was used to select two optimal imputation methods (LFBI and KNN). Partial Least Squares Regression (PLSR) was employed to extract key parameters, and feature extraction using the lag and rolling window methods was performed to capture temporal trends and fluctuations. Comparative analysis revealed that KNN preprocessing combined with the Attention-LSTM-XGBoost model performed best in predicting Daqu acidity, with R values reaching 0.9790, 0.9768, and 0.9636 for the upper, middle, and lower Daqu layers, respectively. This combination outperformed the LSTM-XGBoost and XGBoost models, with improvements of 3.87%, 1.11%, and 2.84% compared to LSTM-XGBoost, and 4.70%, 4.37%, and 8.46% compared to XGBoost. This study addresses the challenge of predicting Daqu acidity during fermentation and provides insights into the optimization of the Daqu fermentation process.
大曲是中国传统酿造原料,在酿造过程中具有糖化和发酵双重功能。大曲发酵过程中的酸度直接影响大曲的品质。传统的大曲酸度测量方法复杂且具有滞后性,难以实时监测发酵酸度。鉴于大曲酸度与环境变量之间的强相关性,本文提出了一种基于KNN-Attention-LSTM-XGBoost模型的大曲酸度时间序列预测模型。在收集和分析大曲微环境参数后,使用XGBoost模型选择了两种最优插补方法(LFBI和KNN)。采用偏最小二乘回归(PLSR)提取关键参数,并使用滞后和滚动窗口方法进行特征提取,以捕捉时间趋势和波动。对比分析表明,KNN预处理结合Attention-LSTM-XGBoost模型在预测大曲酸度方面表现最佳,大曲上层、中层和下层的R值分别达到0.9790、0.9768和0.9636。该组合优于LSTM-XGBoost和XGBoost模型,与LSTM-XGBoost相比分别提高了3.87%、1.11%和2.84%,与XGBoost相比分别提高了4.70%、4.37%和8.46%。本研究解决了发酵过程中大曲酸度预测的挑战,并为大曲发酵过程的优化提供了见解。