• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合通用数据分析工具(UDT)和极端梯度提升(XGBoost),通过近红外光谱法识别黑豆的地理来源。

Combining UDT with XGBoost to identify the geographical origin of black beans by near-infrared spectroscopy.

作者信息

Cao Zhihang, Wu Xiaohong, Wu Bin, Zhang Zexi, Sun Jun

机构信息

Mengxi Honors College, Jiangsu University, Zhenjiang, 212013, China.

School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, 212013, China.

出版信息

Curr Res Food Sci. 2025 Jun 29;11:101131. doi: 10.1016/j.crfs.2025.101131. eCollection 2025.

DOI:10.1016/j.crfs.2025.101131
PMID:40689301
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12271794/
Abstract

This study aims to rapidly and non-destructively identify the geographical origin of black beans () using a portable near-infrared (NIR) spectrometer, addressing the challenge of distinguishing black beans due to significant regional variations in quality. A total of 400 black bean samples were collected from five regions in China. To improve classification accuracy, a novel model combining uncorrelated discriminant transform (UDT) with extreme gradient boosting (XGBoost) was proposed for feature extraction and classification. When evaluated with k-nearest neighbor (KNN), naive Bayes (NB), and support vector machine (SVM) classifiers, UDT achieved accuracies of 96.25 %, 93.75 %, and 96.25 %, respectively, outperforming Foley-Sammon transform (FST) and discriminant principal component analysis (DPCA). The UDT + XGBoost combination achieved the highest classification accuracy of 100 %. For robust validation, a 5-fold cross-validation strategy was applied to the UDT + XGBoost model, achieving an average accuracy of 96.00 %. This study provides a reliable method for black bean origin traceability and authenticity.

摘要

本研究旨在使用便携式近红外(NIR)光谱仪快速、无损地识别黑豆的地理来源,以应对因品质存在显著区域差异而导致难以区分黑豆的挑战。共从中国五个地区收集了400份黑豆样本。为提高分类准确率,提出了一种将不相关判别变换(UDT)与极端梯度提升(XGBoost)相结合的新型模型用于特征提取和分类。当使用k近邻(KNN)、朴素贝叶斯(NB)和支持向量机(SVM)分类器进行评估时,UDT分别达到了96.25%、93.75%和96.25%的准确率,优于福利-萨蒙变换(FST)和判别主成分分析(DPCA)。UDT + XGBoost组合实现了最高的100%分类准确率。为进行稳健验证,对UDT + XGBoost模型应用了5折交叉验证策略,平均准确率达到96.00%。本研究为黑豆产地溯源和真伪鉴别提供了一种可靠方法。

相似文献

1
Combining UDT with XGBoost to identify the geographical origin of black beans by near-infrared spectroscopy.结合通用数据分析工具(UDT)和极端梯度提升(XGBoost),通过近红外光谱法识别黑豆的地理来源。
Curr Res Food Sci. 2025 Jun 29;11:101131. doi: 10.1016/j.crfs.2025.101131. eCollection 2025.
2
Can a robust machine learning-based integrated framework benefit universal water quality evaluation applications? Comparative view in the whole coastal regions of China.一个强大的基于机器学习的综合框架能否有益于通用水质评估应用?中国沿海全域的比较视角。
Mar Pollut Bull. 2025 Nov;220:118399. doi: 10.1016/j.marpolbul.2025.118399. Epub 2025 Jul 16.
3
Development and validation of a machine learning model for predicting co-infection of in pediatric patients.用于预测儿科患者合并感染的机器学习模型的开发与验证。
Transl Pediatr. 2025 Jun 27;14(6):1201-1212. doi: 10.21037/tp-2024-562. Epub 2025 Jun 25.
4
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
5
Construction and validation of HBV-ACLF bacterial infection diagnosis model based on machine learning.基于机器学习的HBV-ACLF细菌感染诊断模型的构建与验证
BMC Infect Dis. 2025 Jul 1;25(1):847. doi: 10.1186/s12879-025-11199-5.
6
Constructing a Classification Model for Cervical Cancer Tumor Tissue and Normal Tissue Based on CT Radiomics.基于 CT 影像组学构建宫颈癌肿瘤组织与正常组织分类模型
Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241298554. doi: 10.1177/15330338241298554.
7
Mortality Risk Prediction in Patients With Antimelanoma Differentiation-Associated, Gene 5 Antibody-Positive, Dermatomyositis-Associated Interstitial Lung Disease: Algorithm Development and Validation.抗黑色素瘤分化相关基因5抗体阳性、皮肌炎相关间质性肺疾病患者的死亡风险预测:算法开发与验证
J Med Internet Res. 2025 Feb 5;27:e62836. doi: 10.2196/62836.
8
A comparative analysis of emotion recognition from EEG signals using temporal features and hyperparameter-tuned machine learning techniques.使用时间特征和超参数调整的机器学习技术对脑电图信号进行情感识别的比较分析。
MethodsX. 2025 Jun 25;15:103468. doi: 10.1016/j.mex.2025.103468. eCollection 2025 Dec.
9
A three-classification machine learning model for non-invasive prediction of molecular subtypes in diffuse glioma: a two-center study.用于弥漫性胶质瘤分子亚型无创预测的三分类机器学习模型:一项双中心研究
Quant Imaging Med Surg. 2025 Jun 6;15(6):5752-5768. doi: 10.21037/qims-24-2461. Epub 2025 May 29.
10
Multiclass classification of thalassemia types using complete blood count and HPLC data with machine learning.利用全血细胞计数和高效液相色谱数据,通过机器学习对地中海贫血类型进行多类别分类。
Sci Rep. 2025 Jul 21;15(1):26379. doi: 10.1038/s41598-025-06594-6.

本文引用的文献

1
Analyzing TVB-N in snakehead by Bayesian-optimized 1D-CNN using molecular vibrational spectroscopic techniques: Near-infrared and Raman spectroscopy.利用分子振动光谱技术(近红外和拉曼光谱)通过贝叶斯优化 1D-CNN 分析乌头中的 TVB-N。
Food Chem. 2025 Feb 1;464(Pt 2):141701. doi: 10.1016/j.foodchem.2024.141701. Epub 2024 Oct 19.
2
Near-Infrared Spectroscopy Combined with Fuzzy Improved Direct Linear Discriminant Analysis for Nondestructive Discrimination of Chrysanthemum Tea Varieties.近红外光谱结合模糊改进直接线性判别分析用于菊花茶品种的无损鉴别
Foods. 2024 May 7;13(10):1439. doi: 10.3390/foods13101439.
3
Nutritional Modulation of Hepcidin in the Treatment of Various Anemic States.
营养调控铁调素在各种贫血状态治疗中的作用。
Nutrients. 2023 Dec 12;15(24):5081. doi: 10.3390/nu15245081.
4
Uncovering the differences in flavor volatiles of different colored foxtail millets based on gas chromatography-ion migration spectrometry and chemometrics.基于气相色谱-离子迁移谱和化学计量学揭示不同颜色谷子风味挥发物的差异
Curr Res Food Sci. 2023 Sep 9;7:100585. doi: 10.1016/j.crfs.2023.100585. eCollection 2023.
5
Effects of digestion on protein degradation, phenolic compound release, and bioactivity of black bean tempeh.消化对黑豆豆豉蛋白质降解、酚类化合物释放及生物活性的影响。
Front Nutr. 2022 Oct 12;9:1017765. doi: 10.3389/fnut.2022.1017765. eCollection 2022.
6
Classification of Tea Quality Levels Using Near-Infrared Spectroscopy Based on CLPSO-SVM.基于CLPSO-SVM的近红外光谱法茶叶品质等级分类
Foods. 2022 Jun 5;11(11):1658. doi: 10.3390/foods11111658.
7
Discrimination of the Red Jujube Varieties Using a Portable NIR Spectrometer and Fuzzy Improved Linear Discriminant Analysis.基于便携式近红外光谱仪和模糊改进线性判别分析的红枣品种鉴别
Foods. 2022 Mar 7;11(5):763. doi: 10.3390/foods11050763.
8
Characterization of the aroma-active compounds in black soybean sauce, a distinctive soy sauce.黑豆瓣酱特征香气成分分析,一种独特的酱油。
Food Chem. 2021 Dec 1;364:130334. doi: 10.1016/j.foodchem.2021.130334. Epub 2021 Jun 9.
9
Rapid detection of mercury in food via rhodamine 6G signal using surface-enhanced Raman scattering coupled multivariate calibration.利用表面增强拉曼散射结合多元校准技术,通过罗丹明 6G 信号快速检测食品中的汞。
Food Chem. 2021 Oct 1;358:129844. doi: 10.1016/j.foodchem.2021.129844. Epub 2021 Apr 20.
10
Qualitative identification of the edible oil storage period using a homemade portable electronic nose combined with multivariate analysis.利用自制便携式电子鼻结合多元分析对食用油脂储存期进行定性鉴别。
J Sci Food Agric. 2021 Jun;101(8):3448-3456. doi: 10.1002/jsfa.10975. Epub 2020 Dec 21.