Cheng Kaiqi, Xiao Jingzhe, He Jingyuan, Yang Rongguang, Pei Jinjin, Jin Wengang, Abd El-Aty A M
Qinba State Key Laboratory of Biological Resource and Ecological Environment (Incubation), Collaborative Innovation Center of Bio-Resource in Qinba Mountain Area, Shaanxi University of Technology, Hanzhong, China.
Key Laboratory of Bio-Resources of Shaanxi Province, School of Bioscience and Engineering, Shaanxi University of Technology, Hanzhong, China.
Front Nutr. 2025 Apr 22;12:1582576. doi: 10.3389/fnut.2025.1582576. eCollection 2025.
Colored onions are favored by consumers due to their distinctive aroma, rich phytochemical content, and diverse biological activities. However, comprehensive analyses of their phytochemical profiles and volatile metabolites remain limited.
In this study, total phenols, flavonoids, anthocyanins, carotenoids, and antioxidant activities of three colored onion bulbs were evaluated. Volatile metabolites were identified using headspace solid-phase microextraction combined with gas chromatography-mass spectrometry (HS-SPME/GC-MS). Multivariate statistical analyses, feature selection techniques (SelectKBest, LASSO), and machine learning models were applied to further analyze and classify the metabolite profiles.
Significant differences in phytochemical composition and antioxidant activities were observed among the three onion types. A total of 243 volatile metabolites were detected, with sulfur compounds accounting for 51-64%, followed by organic acids and their derivatives (4-19%). Multivariate analysis revealed distinct volatile profiles, and 19 key metabolites were identified as biomarkers. Additionally, 33 and 38 feature metabolites were selected by SelectKBest and LASSO, respectively. The 38 features selected by LASSO enabled clear differentiation of onion types via PCA, UMAP, and k-means clustering. Among the four machine learning models tested, the random forest model achieved the highest classification accuracy (1.00). SHAP analysis further confirmed 20 metabolites as potential key markers.
The findings suggest that the combination of HS-SPME/GC-MS and machine learning, particularly the random forest algorithm, is a powerful approach for characterizing and classifying volatile metabolite profiles in colored onions. This method holds potential for quality assessment and breeding applications.
彩色洋葱因其独特的香气、丰富的植物化学成分和多样的生物活性而受到消费者青睐。然而,对其植物化学特征和挥发性代谢物的全面分析仍然有限。
在本研究中,评估了三种彩色洋葱鳞茎的总酚、黄酮类化合物、花青素、类胡萝卜素和抗氧化活性。使用顶空固相微萃取结合气相色谱 - 质谱联用(HS-SPME/GC-MS)鉴定挥发性代谢物。应用多元统计分析、特征选择技术(SelectKBest、LASSO)和机器学习模型进一步分析和分类代谢物谱。
在三种洋葱类型中观察到植物化学成分和抗氧化活性存在显著差异。共检测到243种挥发性代谢物,其中硫化合物占51 - 64%,其次是有机酸及其衍生物(4 - 19%)。多元分析揭示了不同的挥发性谱,并鉴定出19种关键代谢物作为生物标志物。此外,SelectKBest和LASSO分别选择了33种和38种特征代谢物。LASSO选择的38个特征通过主成分分析(PCA)、均匀流形近似和投影(UMAP)以及k均值聚类能够清晰区分洋葱类型。在测试的四种机器学习模型中,随机森林模型达到了最高的分类准确率(1.00)。SHAP分析进一步确认了20种代谢物为潜在的关键标志物。
研究结果表明,HS-SPME/GC-MS与机器学习(特别是随机森林算法)相结合是表征和分类彩色洋葱挥发性代谢物谱的有力方法。该方法在质量评估和育种应用方面具有潜力。