Yang Zhongpan, Jin Wuyin, Du Jinsong
Appl Opt. 2025 Jun 20;64(18):5087-5098. doi: 10.1364/AO.565221.
The process of manufacturing tobacco casings constitutes a critical step in the production of tobacco leaves, exerting a substantial influence on the improvement of their physical and chemical characteristics, and consequently, the quality of the final product. Nevertheless, the prevailing approach to casing accuracy detection is predominantly focused on dosage monitoring, with a paucity of attention being paid to real-time effect evaluation. Current hyperspectral-based detection systems encounter difficulties in extracting trace additive features and managing high-dimensional data under limited sample conditions. A multi-basis continuous wavelet transform (CWT) and machine-learning-integrated framework for non-destructive propylene glycol (PG) content prediction were proposed in this paper, addressing precision limitations in tobacco quality monitoring. The preprocessing of hyperspectral imaging data from six PG concentration levels were undertaken via Savitzky-Golay filtering, followed by multiscale decomposition using three CWT basis functions with morlet, Mexican hat, and Gaussian wavelets. A dual optimization mechanism combining correlation threshold filtering and wavelength frequency statistics was developed to enable efficient feature wavelength selection. Furthermore, a stacking regression model was constructed and compared with standalone algorithms. The results demonstrated that the multiscale combined MMG strategy achieved 79.86% dimensionality reduction by selecting 58 feature wavelengths covering adjacent regions in the near-infrared and short-wave infrared (NIR-SWIR) range, significantly enhancing model generalization compared to full-spectrum inputs. Additionally, the stacking regression model attained optimal performance, with a testing set coefficient of determination of 0.9704, under combined MMG input through synergistic complementarity of heterogeneous base learners, with a root mean square error of 0.3188. It is confirmed in this paper that spectral feature interpretability is improved by multi-basis CWT decomposition through complementary wavelet-scale responses, and a novel, to our knowledge, non-destructive PG detection paradigm for industrial tobacco processing is established by the framework. Transferable insights for the hyperspectral analysis of trace components in agricultural products are provided by the methodology, and it could be applied to the non-destructive detection of trace additives for tobacco quality control.
烟草包衣制造过程是烟叶生产中的关键步骤,对改善烟叶的物理和化学特性进而对最终产品质量有重大影响。然而,目前包衣精度检测的主要方法主要集中在剂量监测上,而对实时效果评估关注较少。当前基于高光谱的检测系统在有限样本条件下提取痕量添加剂特征和处理高维数据时存在困难。本文提出了一种基于多基连续小波变换(CWT)和机器学习的无损丙二醇(PG)含量预测框架,以解决烟草质量监测中的精度限制问题。对六个PG浓度水平的高光谱成像数据进行预处理,先通过Savitzky-Golay滤波,然后使用具有莫雷、墨西哥帽和高斯小波的三种CWT基函数进行多尺度分解。开发了一种结合相关阈值滤波和波长频率统计的双重优化机制,以实现高效的特征波长选择。此外,构建了堆叠回归模型并与独立算法进行比较。结果表明,多尺度组合MMG策略通过选择覆盖近红外和短波红外(NIR-SWIR)范围内相邻区域的58个特征波长实现了79.86%的降维,与全光谱输入相比,显著提高了模型的泛化能力。此外,堆叠回归模型通过异构基学习器的协同互补在组合MMG输入下达到了最佳性能,测试集决定系数为0.9704,均方根误差为0.3188。本文证实,通过互补小波尺度响应的多基CWT分解提高了光谱特征的可解释性,该框架建立了一种据我们所知的用于工业烟草加工的新型无损PG检测范式。该方法为农产品痕量成分的高光谱分析提供了可转移的见解,可应用于烟草质量控制中痕量添加剂的无损检测。