Institute of Organic Chemistry , University of Hamburg , Martin-Luther-King-Platz 6 , 20146 Hamburg , Germany.
Hamburg School of Food Science, Institute of Food Chemistry , University of Hamburg , Grindelallee 117 , 20146 Hamburg , Germany.
J Agric Food Chem. 2018 Nov 7;66(44):11873-11879. doi: 10.1021/acs.jafc.8b03724. Epub 2018 Oct 24.
A total of 262 authentic samples was analyzed by H NMR spectroscopy for the geographical discrimination of hazelnuts ( Corylus avellana L.) covering samples from five countries (Germany, France, Georgia, Italy, and Turkey) and the harvest years 2013-2016. This article describes method development starting with an extraction protocol suitable for separation of polar and nonpolar metabolites in addition to reduction of macromolecular components. Using the polar fraction for data analysis, principle component analysis was applied and used to monitor sample preparation and measurement. Several machine learning algorithms were tested to build a classification model. The best results were obtained by a linear discrimination analysis applying a random subspace algorithm. The division of the samples in a trainings set and a test set yielded a cross validation accuracy of 91% for the training set and an accuracy of 96% for the test set. The identification of key features was carried out by Kruskal-Wallis test and t test. A feature assigned to betaine exhibits a significant level for the classification of all five countries and is considered a possible candidate for the development of targeted approaches. Further, the results were compared to a previously published study based on LC-MS analysis of nonpolar metabolites. In summary, this study shows the robustness and high accuracy of a discrimination model based on NMR analysis of polar metabolites.
对来自五个国家(德国、法国、格鲁吉亚、意大利和土耳其)和 2013-2016 年收获年份的榛子(Corylus avellana L.)的 262 个真实样本进行了 1 H NMR 波谱分析,以进行地理判别。本文描述了从适合分离极性和非极性代谢物以及减少大分子成分的提取方案开始的方法开发。使用极性部分进行数据分析,应用主成分分析并用于监测样品制备和测量。测试了几种机器学习算法来构建分类模型。通过应用随机子空间算法的线性判别分析获得了最佳结果。将样品分为训练集和测试集,训练集的交叉验证准确率为 91%,测试集的准确率为 96%。通过 Kruskal-Wallis 检验和 t 检验进行了关键特征的识别。被分配给甜菜碱的特征对于所有五个国家的分类均具有显著水平,被认为是开发有针对性方法的可能候选者。此外,还将结果与先前基于非极性代谢物 LC-MS 分析的研究进行了比较。总之,这项研究表明基于极性代谢物 NMR 分析的判别模型具有稳健性和高精度。