Men Hong, Shi Yan, Fu Songlin, Jiao Yanan, Qiao Yu, Liu Jingjing
College of Automation Engineering, Northeast Electric Power University, Jilin 132012, China.
Sensors (Basel). 2017 Jul 19;17(7):1656. doi: 10.3390/s17071656.
Multi-sensor data fusion can provide more comprehensive and more accurate analysis results. However, it also brings some redundant information, which is an important issue with respect to finding a feature-mining method for intuitive and efficient analysis. This paper demonstrates a feature-mining method based on variable accumulation to find the best expression form and variables' behavior affecting beer flavor. First, e-tongue and e-nose were used to gather the taste and olfactory information of beer, respectively. Second, principal component analysis (PCA), genetic algorithm-partial least squares (GA-PLS), and variable importance of projection (VIP) scores were applied to select feature variables of the original fusion set. Finally, the classification models based on support vector machine (SVM), random forests (RF), and extreme learning machine (ELM) were established to evaluate the efficiency of the feature-mining method. The result shows that the feature-mining method based on variable accumulation obtains the main feature affecting beer flavor information, and the best classification performance for the SVM, RF, and ELM models with 96.67%, 94.44%, and 98.33% prediction accuracy, respectively.
多传感器数据融合能够提供更全面、更准确的分析结果。然而,它也带来了一些冗余信息,这对于寻找一种直观且高效分析的特征挖掘方法而言是一个重要问题。本文展示了一种基于变量累积的特征挖掘方法,以找出最佳表达形式以及影响啤酒风味的变量行为。首先,分别使用电子舌和电子鼻来收集啤酒的味觉和嗅觉信息。其次,应用主成分分析(PCA)、遗传算法 - 偏最小二乘法(GA - PLS)以及投影变量重要性(VIP)得分来选择原始融合集的特征变量。最后,建立基于支持向量机(SVM)、随机森林(RF)和极限学习机(ELM)的分类模型,以评估该特征挖掘方法的效率。结果表明,基于变量累积的特征挖掘方法获得了影响啤酒风味信息的主要特征,并且对于SVM、RF和ELM模型分别具有96.67%、94.44%和98.33%的预测准确率,展现出最佳的分类性能。