Liang Youyan, Zhao Le, Guo Junwei, Wang Hongbo, Liu Shaofeng, Wang Luoping, Chen Li, Chen Mantang, Zhang Nuohan, Liu Huimin, Nie Cong
Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou, Henan450001, China.
Technology Center of China Tobacco Yunnan Industrial Co. Ltd., Kunming650231, China.
ACS Omega. 2022 Oct 20;7(43):38650-38659. doi: 10.1021/acsomega.2c04139. eCollection 2022 Nov 1.
Near-infrared spectroscopy has been widely used to characterize the chemical composition of tobacco because it is fast, economical, and nondestructive. However, few predictive models perform ideally when applied to large spectral libraries of tobacco and its various chemical indicators. In this study, the just-in-time learning-integrated partial least-squares (JIT-PLS) modeling strategy was applied for the first time to quantitatively analyze 71 chemical components in Chinese tobacco. Approximately 18000 tobacco samples from China were analyzed to find appropriately similar measurements and propose suitable and flexible similar subsets from the calibration for each test sample. In total, 879 representative aged tobacco leaf samples and 816 cigarette samples were used as external instances to evaluate the practical predicting ability of the proposed method. The most suitable similar subsets for each test sample could be selected by limiting the Euclidean distance and number of similar subsets to 0-3.0 × 10 and 10-300, respectively. The majority of the JIT-PLS models performed significantly better than traditional PLS models. Specifically, using JIT-PLS instead of traditional PLS models increased the values from 0.347-0.984 to 0.763-0.996, and from 0.179-0.981 to 0.506-0.989 for the prediction of 67 and 71 components in aged tobacco leaf and cigarette samples, respectively. Good prediction ability was demonstrated for routine chemical components, polyphenolic compounds, organic acids, and other compounds, with the mean ratios of prediction to deviation (RPD) being 7.74, 4.39, 4.05, and 5.48, respectively). The proposed methodology could simultaneously determine 67 major components in large and complicated tobacco spectral libraries with high precision and accuracy, which will assist tobacco and cigarette quality control in collecting as well as processing stages.
近红外光谱法因其快速、经济且无损,已被广泛用于表征烟草的化学成分。然而,当应用于大型烟草光谱库及其各种化学指标时,很少有预测模型能表现得很理想。在本研究中,即时学习集成偏最小二乘法(JIT-PLS)建模策略首次被用于定量分析中国烟草中的71种化学成分。分析了来自中国的约18000个烟草样本,以找到适当相似的测量值,并从校准中为每个测试样本提出合适且灵活的相似子集。总共879个代表性陈化烟叶样本和816个卷烟样本被用作外部实例,以评估所提出方法的实际预测能力。通过将欧几里得距离和相似子集数量分别限制在0 - 3.0×10和10 - 300,可以为每个测试样本选择最合适的相似子集。大多数JIT-PLS模型的表现明显优于传统PLS模型。具体而言,对于陈化烟叶和卷烟样本中67种和71种成分的预测,使用JIT-PLS而非传统PLS模型时, 值分别从0.347 - 0.984提高到0.763 - 0.996,以及从0.179 - 0.981提高到0.506 - 0.989。对于常规化学成分、多酚类化合物、有机酸和其他化合物,均表现出良好的预测能力,预测与偏差的平均比值(RPD)分别为7.74、4.39、4.05和5.48。所提出的方法能够高精度、准确地同时测定大型复杂烟草光谱库中的67种主要成分,这将有助于烟草和卷烟在采集以及加工阶段的质量控制。