Suppr超能文献

基于多源数据融合与KNN-Attention-LSTM-XGBoost模型的大曲酸度时间序列预测与分析研究

A study on time-series prediction and analysis of acidity of Daqu based on multivariate data fusion and KNN-Attention-LSTM-XGBoost modeling.

作者信息

Yang Haili, Xia Hao, Liu Sai, Chen Shan, Li Lan, Liao Xilong, Fei Lei, Xie Liangliang, Tian Jianping, Hu Xinjun

机构信息

School of Mechanical Engineering, Sichuan University of Science and Engineering, Yibin, 644000, China.

Anhui Province Key Laboratory of Intelligent Solid-State Fermentation Technology, Bozhou, 236800, China.

出版信息

Bioprocess Biosyst Eng. 2025 Jun 14. doi: 10.1007/s00449-025-03187-5.

Abstract

Daqu is a traditional Chinese brewing ingredient that serves dual functions of saccharification and fermentation during the brewing process. The acidity content during the Daqu fermentation process directly affects the quality of the Daqu. Traditional methods for measuring Daqu acidity are complex and exhibit lag, making it difficult to monitor fermentation acidity in real time. Given the strong correlation between Daqu acidity and environmental variables, this paper proposes a time series prediction model for Daqu acidity based on the KNN-Attention-LSTM-XGBoost model. Upon collecting and analyzing the microenvironmental parameters of Daqu, the XGBoost model was used to select two optimal imputation methods (LFBI and KNN). Partial Least Squares Regression (PLSR) was employed to extract key parameters, and feature extraction using the lag and rolling window methods was performed to capture temporal trends and fluctuations. Comparative analysis revealed that KNN preprocessing combined with the Attention-LSTM-XGBoost model performed best in predicting Daqu acidity, with R values reaching 0.9790, 0.9768, and 0.9636 for the upper, middle, and lower Daqu layers, respectively. This combination outperformed the LSTM-XGBoost and XGBoost models, with improvements of 3.87%, 1.11%, and 2.84% compared to LSTM-XGBoost, and 4.70%, 4.37%, and 8.46% compared to XGBoost. This study addresses the challenge of predicting Daqu acidity during fermentation and provides insights into the optimization of the Daqu fermentation process.

摘要

大曲是中国传统酿造原料,在酿造过程中具有糖化和发酵双重功能。大曲发酵过程中的酸度直接影响大曲的品质。传统的大曲酸度测量方法复杂且具有滞后性,难以实时监测发酵酸度。鉴于大曲酸度与环境变量之间的强相关性,本文提出了一种基于KNN-Attention-LSTM-XGBoost模型的大曲酸度时间序列预测模型。在收集和分析大曲微环境参数后,使用XGBoost模型选择了两种最优插补方法(LFBI和KNN)。采用偏最小二乘回归(PLSR)提取关键参数,并使用滞后和滚动窗口方法进行特征提取,以捕捉时间趋势和波动。对比分析表明,KNN预处理结合Attention-LSTM-XGBoost模型在预测大曲酸度方面表现最佳,大曲上层、中层和下层的R值分别达到0.9790、0.9768和0.9636。该组合优于LSTM-XGBoost和XGBoost模型,与LSTM-XGBoost相比分别提高了3.87%、1.11%和2.84%,与XGBoost相比分别提高了4.70%、4.37%和8.46%。本研究解决了发酵过程中大曲酸度预测的挑战,并为大曲发酵过程的优化提供了见解。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验