• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的观赏南瓜种子形态分类与多样性分析

Machine Learning-Based Morphological Classification and Diversity Analysis of Ornamental Pumpkin Seeds.

作者信息

Ermiş Sıtkı, Ercan Uğur, Kabaş Aylin, Kabaş Önder, Moiceanu Georgiana

机构信息

Department of Horticulture, Faculty of Agriculture, Eskişehir Osmangazi University, Eskişehir 26040, Türkiye.

Department of Informatics, Akdeniz University, Antalya 07070, Türkiye.

出版信息

Foods. 2025 Apr 25;14(9):1498. doi: 10.3390/foods14091498.

DOI:10.3390/foods14091498
PMID:40361582
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12071245/
Abstract

Ornamental pumpkin ( L. var. ) seeds are highly morphologically variable, and their classification is hence a complex task for the seed industry. Efficient and accurate classification is critical for agricultural production, breeding programs, and seed sorting for commerce. This study employs machine learning models-Random Forest (RF), LightGBM, and k-Nearest Neighbors (KNN)-to classify ornamental pumpkin seeds based on their morphological (mass, elongation, width, thickness) and colorimetric characteristics (L*, a*, b* values from CIELAB color space). Prior to model training, the data set was preprocessed through normalization and balancing to enhance classification performance. In this study, six different types of ornamental pumpkin seeds were used, with a total of 900 (150 each of SDE0619, SDE1020, SDE1620, SDE2621, SDE4521, and SDE7721). The classification performance of the models was evaluated using different metrics, such as Accuracy, Balanced Accuracy, Precision, Recall, F1 Score, Matthews Correlation Coefficient (MCC), and Cohen's Kappa. Among the tested models, the RF model performed best, with Accuracy of 0.959, Balanced Accuracy of 0.961, Precision (Macro) of 0.962, Recall (Macro) of 0.961, F1 Score (Macro) of 0.961, MCC of 0.951, and Cohen's Kappa of 0.951. In contrast, the worst classification performance of the tested models was with the KNN model across all the evaluation metrics. These outcomes reflect the potential of machine learning-based approaches for seed classification automation, error minimization in seed classification, and maximization of efficiency in the seed industry. The high classification performance of the Random Forest model with 95.9% accuracy and 0.951 MCC value shows that artificial intelligence-based automatic classification of ornamental pumpkin seeds according to their morphological and colorimetric characteristics can make significant contributions to the seed industry, while the integration of this approach into seed sorting and quality determination processes can enable the creation of effective breeding schemes for optimum seed selection by maximizing the accuracy of agricultural processes.

摘要

观赏南瓜(L. var.)种子在形态上具有高度变异性,因此对种子行业来说,其分类是一项复杂的任务。高效且准确的分类对于农业生产、育种计划以及商业种子分选至关重要。本研究采用机器学习模型——随机森林(RF)、LightGBM和k近邻(KNN)——基于观赏南瓜种子的形态特征(质量、长度、宽度、厚度)和比色特征(CIELAB颜色空间中的L*、a*、b*值)对其进行分类。在模型训练之前,通过归一化和平衡对数据集进行预处理,以提高分类性能。在本研究中,使用了六种不同类型的观赏南瓜种子,共900颗(SDE0619、SDE1020、SDE1620、SDE2621、SDE4521和SDE7721各150颗)。使用不同的指标评估模型的分类性能,如准确率、平衡准确率、精确率、召回率、F1分数、马修斯相关系数(MCC)和科恩卡方系数。在测试的模型中,RF模型表现最佳,准确率为0.959,平衡准确率为0.961,精确率(宏)为0.962,召回率(宏)为0.961,F1分数(宏)为0.961,MCC为0.951,科恩卡方系数为0.951。相比之下,在所有评估指标中,测试模型中KNN模型的分类性能最差。这些结果反映了基于机器学习的方法在种子分类自动化、种子分类误差最小化以及种子行业效率最大化方面的潜力。随机森林模型95.9%的准确率和0.951的MCC值表明,基于人工智能根据观赏南瓜种子的形态和比色特征进行自动分类可为种子行业做出重大贡献,而将这种方法整合到种子分选和质量测定过程中,可以通过最大化农业过程的准确性来制定有效的育种方案,以实现最佳种子选择。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/7cec5a727260/foods-14-01498-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/d7894ff6b220/foods-14-01498-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/07d357479886/foods-14-01498-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/2431564afd87/foods-14-01498-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/96240fd9a7c2/foods-14-01498-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/8463507e7b55/foods-14-01498-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/9602503d024c/foods-14-01498-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/977c1186b713/foods-14-01498-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/efedee3ee701/foods-14-01498-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/7cec5a727260/foods-14-01498-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/d7894ff6b220/foods-14-01498-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/07d357479886/foods-14-01498-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/2431564afd87/foods-14-01498-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/96240fd9a7c2/foods-14-01498-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/8463507e7b55/foods-14-01498-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/9602503d024c/foods-14-01498-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/977c1186b713/foods-14-01498-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/efedee3ee701/foods-14-01498-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a9c/12071245/7cec5a727260/foods-14-01498-g009.jpg

相似文献

1
Machine Learning-Based Morphological Classification and Diversity Analysis of Ornamental Pumpkin Seeds.基于机器学习的观赏南瓜种子形态分类与多样性分析
Foods. 2025 Apr 25;14(9):1498. doi: 10.3390/foods14091498.
2
Identifying determinants of malnutrition in under-five children in Bangladesh: insights from the BDHS-2022 cross-sectional study.确定孟加拉国五岁以下儿童营养不良的决定因素:来自2022年孟加拉国人口与健康调查横断面研究的见解
Sci Rep. 2025 Apr 24;15(1):14336. doi: 10.1038/s41598-025-99288-y.
3
Voice Disorder Classification Using Wav2vec 2.0 Feature Extraction.使用Wav2vec 2.0特征提取的语音障碍分类
J Voice. 2024 Sep 25. doi: 10.1016/j.jvoice.2024.09.002.
4
Classification of multi-year and multi-variety pumpkin seeds using hyperspectral imaging technology and three-dimensional convolutional neural network.利用高光谱成像技术和三维卷积神经网络对多年和多品种南瓜籽进行分类。
Plant Methods. 2023 Aug 10;19(1):82. doi: 10.1186/s13007-023-01057-3.
5
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
6
Suitability of elemental fingerprinting for assessing the geographic origin of pumpkin (Cucurbita pepo var. styriaca) seed oil.元素指纹图谱分析用于评估南瓜籽油(Cucurbita pepo var. styriaca)地理来源的适宜性。
Food Chem. 2013 Feb 15;136(3-4):1533-42. doi: 10.1016/j.foodchem.2012.06.040. Epub 2012 Jun 29.
7
A Comparative Analysis of Machine-Learning Algorithms for Automated International Classification of Diseases (ICD)-10 Coding in Malaysian Death Records.马来西亚死亡记录中用于自动国际疾病分类(ICD)-10编码的机器学习算法的比较分析
Cureus. 2025 Jan 12;17(1):e77342. doi: 10.7759/cureus.77342. eCollection 2025 Jan.
8
Addressing Imbalanced Classification Problems in Drug Discovery and Development Using Random Forest, Support Vector Machine, AutoGluon-Tabular, and H2O AutoML.使用随机森林、支持向量机、AutoGluon-Tabular和H2O自动机器学习解决药物发现与开发中的不平衡分类问题。
J Chem Inf Model. 2025 Apr 28;65(8):3976-3989. doi: 10.1021/acs.jcim.5c00023. Epub 2025 Apr 15.
9
A retrospective study using machine learning to develop predictive model to identify rotavirus-associated acute gastroenteritis in children.一项使用机器学习开发预测模型以识别儿童轮状病毒相关性急性胃肠炎的回顾性研究。
PeerJ. 2025 Apr 14;13:e19025. doi: 10.7717/peerj.19025. eCollection 2025.
10
Characteristics of antioxidant activity and composition of pumpkin seed oils in 12 cultivars.12 个品种南瓜籽油的抗氧化活性及组成特征。
Food Chem. 2013 Aug 15;139(1-4):155-61. doi: 10.1016/j.foodchem.2013.02.009. Epub 2013 Feb 16.

本文引用的文献

1
Quantitative Assessment of Brix in Grafted Melon Cultivars: A Machine Learning and Regression-Based Approach.嫁接甜瓜品种中糖度的定量评估:一种基于机器学习和回归的方法。
Foods. 2024 Nov 29;13(23):3858. doi: 10.3390/foods13233858.
2
Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges.机器学习(ML)和数学建模(MM)在医疗保健中的应用,特别关注癌症预后和抗癌治疗:现状与挑战
Pharmaceutics. 2024 Feb 9;16(2):260. doi: 10.3390/pharmaceutics16020260.
3
Classification of multi-year and multi-variety pumpkin seeds using hyperspectral imaging technology and three-dimensional convolutional neural network.
利用高光谱成像技术和三维卷积神经网络对多年和多品种南瓜籽进行分类。
Plant Methods. 2023 Aug 10;19(1):82. doi: 10.1186/s13007-023-01057-3.
4
Characterization of the USDA , , and collections.美国农业部、[此处可能缺失部分内容]及[此处可能缺失部分内容]馆藏的特征描述。
Front Plant Sci. 2023 Mar 13;14:1130814. doi: 10.3389/fpls.2023.1130814. eCollection 2023.
5
Weed and Corn Seedling Detection in Field Based on Multi Feature Fusion and Support Vector Machine.基于多特征融合和支持向量机的田间杂草和玉米幼苗检测。
Sensors (Basel). 2020 Dec 31;21(1):212. doi: 10.3390/s21010212.
6
Interactive machine learning for soybean seed and seedling quality classification.交互式机器学习在大豆种子和幼苗质量分类中的应用。
Sci Rep. 2020 Jul 9;10(1):11267. doi: 10.1038/s41598-020-68273-y.
7
The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.马修斯相关系数(MCC)在二分类评估中优于 F1 得分和准确率的优势。
BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.
8
Whole-genome resequencing of morphotypes to discover genomic variants associated with morphology and horticulturally valuable traits.对形态型进行全基因组重测序,以发现与形态和园艺有价值性状相关的基因组变异。
Hortic Res. 2019 Aug 11;6:94. doi: 10.1038/s41438-019-0176-9. eCollection 2019.
9
A comparison of MCC and CEN error measures in multi-class prediction.多类预测中 MCC 和 CEN 误差度量的比较。
PLoS One. 2012;7(8):e41882. doi: 10.1371/journal.pone.0041882. Epub 2012 Aug 8.
10
Gourds afloat: a dated phylogeny reveals an Asian origin of the gourd family (Cucurbitaceae) and numerous oversea dispersal events.葫芦漂浮:一份过时的系统发育学研究揭示了葫芦科起源于亚洲以及众多海外扩散事件。
Proc Biol Sci. 2009 Mar 7;276(1658):843-51. doi: 10.1098/rspb.2008.1447.