Gumus Ozgur, Yasar Erkan, Gumus Z Pinar, Ertas Hasan
1Department of Computer Engineering, Faculty of Engineering, Ege University, 35100 Bornova, Izmir, Turkey.
2Central Research Testing and Analysis Laboratory Research and Application Center (EGE-MATAL), Ege University, 35100 Bornova, Izmir, Turkey.
J Food Sci Technol. 2020 Apr;57(4):1535-1543. doi: 10.1007/s13197-019-04189-4. Epub 2019 Nov 26.
Research on investigation and determination of geographic origins of olive oils is increased by consumers' demand to authenticated olive oils. Classification algorithms which are machine learning methods can be employed for the authentication of olive oils. In this study, different classification algorithms were evaluated to reveal the most accurate one for authentication of Turkish olive oils. BayesNet, Naive Bayes, Multilayer Perception, IBK, Kstar, SMO, Random Forest, J48, LWL, Logistic Regression, Simple Logistic, LogitBoost algorithms were implemented on 61 chemical analysis parameters of 49 olive oil samples from 6 different locations at Western Turkey. These 61 parameters were obtained from five different chemical analyses which are stable carbon isotope ratio, trace elements, sterol compositions, FAMEs and TAGs. This study is the most comprehensive study to determine the geographical origin of Turkish olive oils in terms of these mentioned features. Classification performances of the algorithms were compared using accuracy, specificity and sensitivity metrics. Random Forest, BayesNet, and LogitBoost algorithms were found as the best classification algorithms for authentication of Turkish olive oils. Using the classification model in this study, geographic origin of an unknown olive oil can be predicted with high accuracy. Besides, similar models can be developed to obtain useful information for authentication of other food products.
消费者对正宗橄榄油的需求推动了橄榄油地理来源调查与测定的研究。作为机器学习方法的分类算法可用于橄榄油的认证。在本研究中,对不同的分类算法进行了评估,以找出用于土耳其橄榄油认证的最准确算法。在来自土耳其西部6个不同地点的49个橄榄油样品的61个化学分析参数上实施了贝叶斯网络、朴素贝叶斯、多层感知器、IBK、Kstar、SMO、随机森林、J48、局部加权学习、逻辑回归、简单逻辑回归、LogitBoost算法。这61个参数来自五种不同的化学分析,即稳定碳同位素比率、微量元素、甾醇成分、脂肪酸甲酯和甘油三酯。就上述特征而言,本研究是确定土耳其橄榄油地理来源的最全面研究。使用准确率、特异性和灵敏度指标比较了算法的分类性能。随机森林、贝叶斯网络和LogitBoost算法被发现是土耳其橄榄油认证的最佳分类算法。使用本研究中的分类模型,可以高精度预测未知橄榄油的地理来源。此外,可以开发类似的模型以获取用于其他食品认证的有用信息。