Xing Zhuoran, Shi Yaqi, Zhang Kai, Ding Songshuang, Shi Xiangdong
College of Tobacco Science, National Tobacco Cultivation and Physiology and Biochemistry Research Center, Key Laboratory for Tobacco Cultivation of Tobacco Industry, Henan Agricultural University, Zhengzhou, China.
Anhui Fermented Food Engineering Research Center, School of Food and Biological Engineering, Hefei University of Technology, Hefei, Anhui, China.
Front Plant Sci. 2025 Mar 26;16:1553110. doi: 10.3389/fpls.2025.1553110. eCollection 2025.
Accurately determining the moisture content of cigar leaves during the air-curing process is crucial for quality preservation. Traditional measurement techniques are often subjective and destructive, limiting their practical application.
In this study, we propose a stacking ensemble learning model for non-destructive moisture prediction, leveraging image-based analysis of naturally suspended cigar leaves. In this study, front and rear surface images of cigar leaves were collected throughout the air-curing process. Color and texture features were extracted from these images, and a filtering method was applied to remove redundant variables. To ensure optimal model selection, the entropy weight method was employed to comprehensively evaluate candidate machine learning models, leading to the construction of a stacking ensemble model. Furthermore, we applied the SHAP method to quantify the contribution of each input feature to the prediction results.
The stacking ensemble model, comprising MLP, RF, and GBDT as base learners and LR as the meta-learner, achieved superior prediction accuracy ( =0.989) and outperforms than traditional machine learning models ( ranged from 0.961 to 0.982). SHAP analysis revealed that front surface features (45.5%) and leaf features (38.5%) were the most influential predictors, with airing period (), , , and identified as key predictors.
This study provides a feasible and scalable solution for real-time and non-destructive monitoring of cigar leaf moisture content, offering effective technical support for similar agricultural and food drying applications.
在晾制过程中准确测定雪茄烟叶的水分含量对于品质保存至关重要。传统测量技术往往主观且具有破坏性,限制了它们的实际应用。
在本研究中,我们提出了一种用于无损水分预测的堆叠集成学习模型,利用对自然悬挂的雪茄烟叶进行基于图像的分析。在本研究中,在整个晾制过程中收集了雪茄烟叶的正面和背面图像。从这些图像中提取颜色和纹理特征,并应用一种过滤方法去除冗余变量。为确保选择最优模型,采用熵权法对候选机器学习模型进行综合评估,从而构建了一个堆叠集成模型。此外,我们应用SHAP方法来量化每个输入特征对预测结果的贡献。
由多层感知器(MLP)、随机森林(RF)和梯度提升决策树(GBDT)作为基学习器以及逻辑回归(LR)作为元学习器组成的堆叠集成模型实现了卓越的预测精度( =0.989),并且比传统机器学习模型表现更优( 范围从0.961至0.982)。SHAP分析表明,正面特征(45.5%)和叶片特征(38.5%)是最具影响力的预测因子,晾制期()、 、 以及 被确定为关键预测因子。
本研究为雪茄烟叶水分含量的实时无损监测提供了一种可行且可扩展的解决方案,为类似的农业和食品干燥应用提供了有效的技术支持。