Willenberg Ina, Parma Alessandra, Bonte Anja, Matthäus Bertrand
Working Group for Lipid Research, Department of Safety and Quality of Cereals, Max Rubner-Institut (MRI), 32756 Detmold, Germany.
Foods. 2021 Feb 23;10(2):479. doi: 10.3390/foods10020479.
In the presented study a non-targeted approach using high-performance liquid chromatography coupled to electrospray ionization quadrupole time-of-flight mass spectrometry (HPLC-ESI-qToF-MS) combined with chemometric techniques was used to build a statistical model to verify the geographic origin of virgin olive oils. The sample preparation by means of liquid/liquid extraction of polar compounds was optimized regarding the number of multiple extractions, application of ultrasonic treatment and temperature during concentration of the analytes. The presented workflow for data processing aimed to identify the most predictive features and was applied to a set of 95 olive oils from Spain, Italy, Portugal and Greece. Different strategies for data reduction and multivariate analysis were compared. Stepwise variable selection showed for both applied multivariate models-linear discriminant analysis (LDA) and logit regression (LR)-to be the most suitable variable selection strategy. The 10-fold cross validation of the LDA showed a classification rate of 83.1% for the test set. For the LR models the prediction accuracy of the test set was even higher with values of 90.4% (Portugal), 86.2% (Italy), 93.8% (Greece) and 88.3% (Spain). Moreover, the reduction of features allows an easier following up strategy for identification of the unknowns and defining marker substances.
在本研究中,采用了一种非靶向方法,即利用高效液相色谱与电喷雾电离四极杆飞行时间质谱联用技术(HPLC-ESI-qToF-MS)并结合化学计量学技术,构建统计模型以验证初榨橄榄油的地理来源。通过液/液萃取极性化合物进行样品制备,在多次萃取次数、超声处理应用以及分析物浓缩过程中的温度等方面进行了优化。所呈现的数据处理工作流程旨在识别最具预测性的特征,并应用于一组来自西班牙、意大利、葡萄牙和希腊的95种橄榄油。比较了不同的数据降维和多元分析策略。逐步变量选择显示,对于所应用的两种多元模型——线性判别分析(LDA)和逻辑回归(LR)——而言,是最合适的变量选择策略。LDA的10倍交叉验证显示,测试集的分类率为83.1%。对于LR模型,测试集的预测准确率更高,葡萄牙为90.4%、意大利为86.2%、希腊为93.8%、西班牙为88.3%。此外,特征的减少使得对未知物的识别和定义标记物质的后续追踪策略更加容易。