• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于可解释机器学习的烟叶部位判别模型构建与解读

Construction and interpretation of tobacco leaf position discrimination model based on interpretable machine learning.

作者信息

Kou Ranran, Wang Cong, Liu Jinxia, Wan Ran, Jin Zhe, Zhao Le, Liu Youjie, Guo Junwei, Li Feng, Wang Hongbo, Yang Song, Nie Cong

机构信息

Key Laboratory of Tobacco Chemistry, Zhengzhou Tobacco Research Institute of China National Tobacco Corporation (CNTC), Zhengzhou, China.

Technology Center, China Tobacco Jilin Industrial Co., Ltd., Changchun, China.

出版信息

Front Plant Sci. 2025 Jul 25;16:1619380. doi: 10.3389/fpls.2025.1619380. eCollection 2025.

DOI:10.3389/fpls.2025.1619380
PMID:40786945
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12331666/
Abstract

Tobacco leaf position is closely associated with its quality whose material basis is the chemical components of tobacco leaf. In recent years, near-infrared (NIR) spectroscopy combined with algorithmic models has emerged as a popular method for identifying the tobacco leaf position. However, when applied to leaf position discrimination, these models often rely on principal components derived from dimensionality-reduced spectral signals, resulting in limited interpretability and difficulty in identifying key chemical components. Chemical composition data combined with algorithmic models can also be used to discriminate tobacco leaf positions. However, the acquisition of chemical components relies on traditional instrumental analytical methods. As a result, the acquisition of chemical composition data is time-consuming and labor-intensive, involving only a limited number of compounds. The study proposes a novel approach that integrates machine learning with advanced interpretability techniques for both tobacco leaf position discrimination and analysis. Based on the 70 tobacco leaf chemical components obtained using near-infrared rapid analysis technology, tobacco leaf position discrimination models were built using Support Vector Machine (SVM), Back Propagation Neural Network (BPNN), and Random Forest (RF). Particle swarm optimization (PSO) was used to optimize parameters of each model. Chemical components were analyzed for statistical significance across leaf positions, and their influence on model predictions was interpreted using SHapley Additive exPlanations (SHAP). The experimental results showed that among all models, the SVM- hybrid kernel demonstrated the most robust and accurate performance, achieving discrimination accuracies of 98.17% and 96.33% on the training and test sets, respectively. SHAP analysis provided a clear ranking of feature importance and revealed the positive and negative contributions of individual chemical components. The proposed method can be useful for position traceability and chemical feature analysis of various crops.

摘要

烟叶部位与其品质密切相关,其物质基础是烟叶的化学成分。近年来,近红外(NIR)光谱结合算法模型已成为识别烟叶部位的常用方法。然而,在应用于叶片部位判别时,这些模型通常依赖于从降维光谱信号中导出的主成分,导致解释性有限,难以识别关键化学成分。化学成分数据结合算法模型也可用于判别烟叶部位。然而,化学成分的获取依赖于传统的仪器分析方法。因此,化学成分数据的获取既耗时又费力,且涉及的化合物数量有限。本研究提出了一种将机器学习与先进的可解释性技术相结合的新方法,用于烟叶部位判别和分析。基于近红外快速分析技术获得的70种烟叶化学成分,使用支持向量机(SVM)、反向传播神经网络(BPNN)和随机森林(RF)建立了烟叶部位判别模型。采用粒子群优化(PSO)对各模型的参数进行优化。分析了不同叶位化学成分的统计显著性,并使用SHapley Additive exPlanations(SHAP)解释了它们对模型预测的影响。实验结果表明,在所有模型中,SVM混合核表现出最强健和准确的性能,在训练集和测试集上的判别准确率分别达到98.17%和96.33%。SHAP分析提供了特征重要性的清晰排序,并揭示了单个化学成分的正负贡献。该方法可用于各种作物的部位溯源和化学特征分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/689cf43e0bef/fpls-16-1619380-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/d1d4b7b3c110/fpls-16-1619380-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/b0905da08376/fpls-16-1619380-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/6cf0fca63385/fpls-16-1619380-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/d7f69ec4c89b/fpls-16-1619380-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/e77b7c82cda3/fpls-16-1619380-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/ffd32c63f96b/fpls-16-1619380-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/97900c46b8d4/fpls-16-1619380-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/689cf43e0bef/fpls-16-1619380-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/d1d4b7b3c110/fpls-16-1619380-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/b0905da08376/fpls-16-1619380-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/6cf0fca63385/fpls-16-1619380-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/d7f69ec4c89b/fpls-16-1619380-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/e77b7c82cda3/fpls-16-1619380-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/ffd32c63f96b/fpls-16-1619380-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/97900c46b8d4/fpls-16-1619380-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0e/12331666/689cf43e0bef/fpls-16-1619380-g008.jpg

相似文献

1
Construction and interpretation of tobacco leaf position discrimination model based on interpretable machine learning.基于可解释机器学习的烟叶部位判别模型构建与解读
Front Plant Sci. 2025 Jul 25;16:1619380. doi: 10.3389/fpls.2025.1619380. eCollection 2025.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
4
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
7
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
8
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
9
Short-Term Memory Impairment短期记忆障碍
10
Machine learning framework for oxytetracycline removal using nanostructured cupric oxide supported on magnetic chitosan alginate biocomposite.基于磁性壳聚糖海藻酸盐生物复合材料负载纳米结构氧化铜去除土霉素的机器学习框架
Sci Rep. 2025 Jul 18;15(1):26124. doi: 10.1038/s41598-025-11424-w.

本文引用的文献

1
Outlier Removal with Weight Penalization and Aggregation: A Robust Variable Selection Method for Enhancing Near-Infrared Spectral Analysis Performance.
Anal Chem. 2025 Apr 8;97(13):7325-7332. doi: 10.1021/acs.analchem.4c07007. Epub 2025 Feb 19.
2
Machine learning-enhanced flavoromics: Identifying key aroma compounds and predicting sensory quality in sauce-flavor baijiu.机器学习增强的风味组学:识别酱香型白酒中的关键香气化合物并预测感官品质
Food Chem. 2025 May 30;475:143328. doi: 10.1016/j.foodchem.2025.143328. Epub 2025 Feb 10.
3
Enhancing Multi-species Liver Microsomal Stability Prediction through Artificial Intelligence.通过人工智能增强多物种肝微粒体稳定性预测。
J Chem Inf Model. 2024 Apr 22;64(8):3222-3236. doi: 10.1021/acs.jcim.4c00159. Epub 2024 Mar 18.
4
Machine Learning Models Using SHapley Additive exPlanation for Fire Risk Assessment Mode and Effects Analysis of Stadiums.基于 SHapley Additive exPlanation 的机器学习模型在体育场火灾风险评估模型中的应用及效果分析。
Sensors (Basel). 2023 Feb 14;23(4):2151. doi: 10.3390/s23042151.
5
Just-in-Time Learning-Integrated Partial Least-Squares Strategy for Accurately Predicting 71 Chemical Constituents in Chinese Tobacco by Near-Infrared Spectroscopy.用于通过近红外光谱法准确预测中国烟草中71种化学成分的即时学习集成偏最小二乘法策略
ACS Omega. 2022 Oct 20;7(43):38650-38659. doi: 10.1021/acsomega.2c04139. eCollection 2022 Nov 1.
6
New SHapley Additive ExPlanations (SHAP) Approach to Evaluate the Raw Materials Interactions of Steel-Fiber-Reinforced Concrete.用于评估钢纤维增强混凝土原材料相互作用的新夏普利值加性解释(SHAP)方法。
Materials (Basel). 2022 Sep 9;15(18):6261. doi: 10.3390/ma15186261.
7
A Study on China's Tobacco Taxation and Its Influencing Factor on Economic Growth.中国烟草税收及其对经济增长的影响因素研究
Front Psychol. 2022 Feb 24;13:832040. doi: 10.3389/fpsyg.2022.832040. eCollection 2022.
8
Perseverance is innovation: the journey to successful tobacco tax reform.坚持不懈就是创新:成功进行烟草税改革的历程。
Tob Control. 2022 Mar;31(2):241-242. doi: 10.1136/tobaccocontrol-2021-057088.
9
Quantitative Analysis and Discrimination of Partially Fermented Teas from Different Origins Using Visible/Near-Infrared Spectroscopy Coupled with Chemometrics.采用可见/近红外光谱结合化学计量学对不同产地部分发酵茶的定量分析与鉴别。
Sensors (Basel). 2020 Sep 23;20(19):5451. doi: 10.3390/s20195451.
10
Application of Convolutional Neural Network-Based Feature Extraction and Data Fusion for Geographical Origin Identification of Radix Astragali by Visible/Short-Wave Near-Infrared and Near Infrared Hyperspectral Imaging.基于卷积神经网络的特征提取和数据融合在可见/短波近红外和近红外高光谱成像中用于黄芪产地鉴别
Sensors (Basel). 2020 Sep 1;20(17):4940. doi: 10.3390/s20174940.