Lee Soo Yee, Mediani Ahmed, Maulidiani Maulidiani, Khatib Alfi, Ismail Intan Safinar, Zawawi Norhasnida, Abas Faridah
Laboratory of Natural Products, Institute of Bioscience, Universiti Putra Malaysia, Serdang, Selangor, Malaysia.
Department of Food Science, Faculty of Food Science and Technology, Universiti Putra Malaysia, Serdang, Selangor, Malaysia.
J Sci Food Agric. 2018 Jan;98(1):240-252. doi: 10.1002/jsfa.8462. Epub 2017 Jul 28.
Neptunia oleracea is a plant consumed as a vegetable and which has been used as a folk remedy for several diseases. Herein, two regression models (partial least squares, PLS; and random forest, RF) in a metabolomics approach were compared and applied to the evaluation of the relationship between phenolics and bioactivities of N. oleracea. In addition, the effects of different extraction conditions on the phenolic constituents were assessed by pattern recognition analysis.
Comparison of the PLS and RF showed that RF exhibited poorer generalization and hence poorer predictive performance. Both the regression coefficient of PLS and the variable importance of RF revealed that quercetin and kaempferol derivatives, caffeic acid and vitexin-2-O-rhamnoside were significant towards the tested bioactivities. Furthermore, principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA) results showed that sonication and absolute ethanol are the preferable extraction method and ethanol ratio, respectively, to produce N. oleracea extracts with high phenolic levels and therefore high DPPH scavenging and α-glucosidase inhibitory activities.
Both PLS and RF are useful regression models in metabolomics studies. This work provides insight into the performance of different multivariate data analysis tools and the effects of different extraction conditions on the extraction of desired phenolics from plants. © 2017 Society of Chemical Industry.
水蕹菜是一种可作为蔬菜食用的植物,并且已被用作多种疾病的民间疗法。在此,对代谢组学方法中的两种回归模型(偏最小二乘法,PLS;和随机森林,RF)进行了比较,并将其应用于评估水蕹菜酚类物质与生物活性之间的关系。此外,通过模式识别分析评估了不同提取条件对酚类成分的影响。
PLS和RF的比较表明,RF表现出较差的泛化能力,因此预测性能较差。PLS的回归系数和RF的变量重要性均表明,槲皮素和山奈酚衍生物、咖啡酸和牡荆素 - 2 - O - 鼠李糖苷对所测试的生物活性具有显著影响。此外,主成分分析(PCA)和偏最小二乘判别分析(PLS - DA)结果表明,超声处理和无水乙醇分别是制备具有高酚含量以及因此具有高DPPH清除和α - 葡萄糖苷酶抑制活性的水蕹菜提取物的优选提取方法和乙醇比例。
PLS和RF都是代谢组学研究中有用的回归模型。这项工作深入了解了不同多变量数据分析工具的性能以及不同提取条件对从植物中提取所需酚类物质的影响。© 2017化学工业协会。