基于红外光谱和机器学习的杏品种分类

Classification of Apricot Varieties by Infrared Spectroscopy and Machine Learning.

作者信息

Béjar-Grimalt Jaume, Pérez-Guaita David, Sánchez-Illana Ángel, García-Contreras Rodolfo, Kataria Rashmi, Bureau Sylvie, de la Guardia Miguel, Cadet Frédéric

机构信息

Department of Analytical Chemistry, University of Valencia, 46100 Burjassot, Spain.

Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad Nacional Autonoma de Mexico, 04510 Mexico City, Mexico.

出版信息

ACS Agric Sci Technol. 2025 Jul 8;5(7):1373-1381. doi: 10.1021/acsagscitech.5c00068. eCollection 2025 Jul 21.

DOI:10.1021/acsagscitech.5c00068

PMID:40740977

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12309246/

Abstract

This work aimed to investigate using ATR-FTIR spectroscopy combined with machine learning to classify eight apricot varieties. Traditionally, variety identification relies on physicochemical property measurements, which are time-consuming and require laboratory analysis. Instead, we used the ATR-FTIR spectra from 731 apricots divided into calibration (512) and test (219) sets and three machine learning models (i.e., partial least-squares-discriminant analysis (PLS-DA), support vector machine (SVM), and random forest (RF)) to accurately predict 97% of the test samples. Additionally, careful inspection of the PLS-DA regression vectors revealed a strong correlation between the spectra and biochemical composition in sugar and organic acids, validating ATR-FTIR spectroscopy as a viable alternative for variety identification. Finally, to validate the results, additional models were constructed using the physicochemical data from the apricots. These reference models were then tested using the same data splits as the spectroscopic data used as a reference method, obtaining similar results with both approaches.

摘要

这项工作旨在研究使用衰减全反射傅里叶变换红外光谱（ATR-FTIR）结合机器学习对八个杏子品种进行分类。传统上，品种鉴定依赖于物理化学性质测量，这既耗时又需要实验室分析。相反，我们使用了来自731个杏子的ATR-FTIR光谱，这些杏子被分为校准集（512个）和测试集（219个），并使用三种机器学习模型（即偏最小二乘判别分析（PLS-DA）、支持向量机（SVM）和随机森林（RF））来准确预测97%的测试样本。此外，对PLS-DA回归向量的仔细检查揭示了光谱与糖和有机酸中的生化成分之间的强相关性，验证了ATR-FTIR光谱作为品种鉴定的可行替代方法。最后，为了验证结果，使用杏子的物理化学数据构建了额外的模型。然后使用与用作参考方法的光谱数据相同的数据划分对这些参考模型进行测试，两种方法都获得了相似的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e77/12309246/a3078f02c3bc/as5c00068_0001.jpg

相似文献

Classification of Apricot Varieties by Infrared Spectroscopy and Machine Learning.

ACS Agric Sci Technol. 2025 Jul 8;5(7):1373-1381. doi: 10.1021/acsagscitech.5c00068. eCollection 2025 Jul 21.

Using ATR-FTIR spectroscopy and machine learning for forensic hair identification.

J Forensic Sci. 2025 Jul;70(4):1537-1543. doi: 10.1111/1556-4029.70062. Epub 2025 Jun 9.

Use of infrared spectroscopy ATR-FTIR as a rapid alternative for Colombian cocoa liquors: classification varieties and estimation of volatile compounds concentration.

Food Res Int. 2025 Oct;217:116828. doi: 10.1016/j.foodres.2025.116828. Epub 2025 Jun 16.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.

Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.

Predicting reticuloruminal pH and subacute ruminal acidosis of individual cows using machine learning and Fourier-transform infrared spectroscopy milk analysis.

J Dairy Sci. 2025 Aug;108(8):8606-8618. doi: 10.3168/jds.2024-25970. Epub 2025 Jun 9.

Carbon dioxide detection for diagnosis of inadvertent respiratory tract placement of enterogastric tubes in children.

Cochrane Database Syst Rev. 2025 Feb 19;2(2):CD011196. doi: 10.1002/14651858.CD011196.pub2.

FTIR-based molecular fingerprinting for the rapid classification of dengue and chikungunya from human sera using machine learning: an observational study.

Lancet Reg Health Southeast Asia. 2025 Jul 9;40:100630. doi: 10.1016/j.lansea.2025.100630. eCollection 2025 Sep.

Classification of finger movements through optimal EEG channel and feature selection.

Front Hum Neurosci. 2025 Jul 16;19:1633910. doi: 10.3389/fnhum.2025.1633910. eCollection 2025.

Machine Learning-Based Predictive Modeling of Infrared Spectroscopic Data from Thermal Conversion of Athabasca Bitumen.

ACS Omega. 2025 Jul 2;10(27):29836-29855. doi: 10.1021/acsomega.5c04463. eCollection 2025 Jul 15.

本文引用的文献

Visualizing the structural and chemical heterogeneity of fruit and vegetables using advanced imaging techniques: fundamentals, instrumental aspects, applications and future perspectives.

Crit Rev Food Sci Nutr. 2025;65(21):4147-4171. doi: 10.1080/10408398.2024.2384650. Epub 2024 Jul 30.

Exploring the Steps of Infrared (IR) Spectral Analysis: Pre-Processing, (Classical) Data Modelling, and Deep Learning.

Molecules. 2023 Sep 30;28(19):6886. doi: 10.3390/molecules28196886.

From bench to worktop: Rapid evaluation of nutritional parameters in liquid foodstuffs by IR spectroscopy.

Food Chem. 2021 Dec 15;365:130442. doi: 10.1016/j.foodchem.2021.130442. Epub 2021 Jun 22.

Novel Application of Near-infrared Spectroscopy and Chemometrics Approach for Detection of Lime Juice Adulteration.

Iran J Pharm Res. 2020 Spring;19(2):34-44. doi: 10.22037/ijpr.2019.112328.13686.

Fresh, freeze-dried or cell wall samples: Which is the most appropriate to determine chemical, structural and rheological variations during apple processing using ATR-FTIR spectroscopy?

Food Chem. 2020 Nov 15;330:127357. doi: 10.1016/j.foodchem.2020.127357. Epub 2020 Jun 16.

Use of Machine Learning and Infrared Spectra for Rheological Characterization and Application to the Apricot.

Sci Rep. 2019 Dec 16;9(1):19197. doi: 10.1038/s41598-019-55543-7.

SVM-RFE: selection and visualization of the most relevant features through non-linear kernels.

BMC Bioinformatics. 2018 Nov 19;19(1):432. doi: 10.1186/s12859-018-2451-4.

Variable Selection for Support Vector Machines in Moderately High Dimensions.

J R Stat Soc Series B Stat Methodol. 2016 Jan;78(1):53-76. doi: 10.1111/rssb.12100. Epub 2015 Jan 5.

Prediction of banana quality indices from color features using support vector regression.

Talanta. 2016;148:54-61. doi: 10.1016/j.talanta.2015.10.073. Epub 2015 Oct 26.

Direct determination of major components in human diets and baby foods.

Anal Bioanal Chem. 2015 Mar;407(7):1961-72. doi: 10.1007/s00216-015-8461-4. Epub 2015 Jan 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于红外光谱和机器学习的杏品种分类

Classification of Apricot Varieties by Infrared Spectroscopy and Machine Learning.

作者信息

Béjar-Grimalt Jaume, Pérez-Guaita David, Sánchez-Illana Ángel, García-Contreras Rodolfo, Kataria Rashmi, Bureau Sylvie, de la Guardia Miguel, Cadet Frédéric

机构信息

Department of Analytical Chemistry, University of Valencia, 46100 Burjassot, Spain.

Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad Nacional Autonoma de Mexico, 04510 Mexico City, Mexico.

出版信息

ACS Agric Sci Technol. 2025 Jul 8;5(7):1373-1381. doi: 10.1021/acsagscitech.5c00068. eCollection 2025 Jul 21.

DOI:10.1021/acsagscitech.5c00068

PMID:40740977

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12309246/

Abstract

摘要

基于红外光谱和机器学习的杏品种分类

Classification of Apricot Varieties by Infrared Spectroscopy and Machine Learning.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于红外光谱和机器学习的杏品种分类

Classification of Apricot Varieties by Infrared Spectroscopy and Machine Learning.

作者信息

机构信息

出版信息

相似文献

本文引用的文献