Wang Yu, Xing Jianguo, Qian Shu
School of Computer Science & Information Engineering, Zhejiang Gongshang University, Hangzhou 310018, China.
Sensors (Basel). 2017 Oct 16;17(10):2356. doi: 10.3390/s17102356.
In order to enhance the selectivity of metal oxide gas sensors, we use a flow modulation method to exploit transient sensor information. The method is based on modulating the flow of the carrier gas that brings the species to be measured into the sensor chamber. We present an active perception strategy by using a DQN which can optimize the flow modulation online. The advantage of DQN is not only that the classification accuracy is higher than traditional methods such as PCA, but also that it has a good adaptability under small samples and labeled data. From observed values of the sensors array and its past experiences, the DQN learns an action policy to change the flow speed dynamically that maximizes the total rewards (or minimizes the classification error). Meanwhile, a CNN is trained to predict sample class and reward according to current actions and observation of sensors. We demonstrate our proposed methods on a gases classification problem in a real time environment. The results show that the DQN learns to modulate flow to classify different gas and the correct rates of gases are: sesame oil 100%, lactic acid 80%, acetaldehyde 80%, acetic acid 80%, and ethyl acetate 100%, the average correct rate is 88%. Compared with the traditional method, the results of PCA are: sesame oil 100%, acetic acid 24%, acetaldehyde 100%, lactic acid 56%, ethyl acetate 68%, the average accuracy rate is 69.6%. DQN uses fewer steps to achieve higher recognition accuracy and improve the recognition speed, and to reduce the training and testing costs.
为了提高金属氧化物气体传感器的选择性,我们采用流量调制方法来利用瞬态传感器信息。该方法基于调制载气的流量,载气将待测物质带入传感器腔室。我们提出了一种主动感知策略,通过使用深度Q网络(DQN)来在线优化流量调制。DQN的优势不仅在于分类准确率高于主成分分析(PCA)等传统方法,还在于它在小样本和有标签数据下具有良好的适应性。基于传感器阵列的观测值及其过往经验,DQN学习一种动作策略,以动态改变流速,从而使总奖励最大化(或使分类误差最小化)。同时,训练一个卷积神经网络(CNN)根据当前动作和传感器观测来预测样本类别和奖励。我们在实时环境中的气体分类问题上展示了我们提出的方法。结果表明,DQN学会了调制流量以对不同气体进行分类,各种气体的正确识别率分别为:芝麻油100%、乳酸80%、乙醛80%、乙酸80%、乙酸乙酯100%,平均正确识别率为88%。与传统方法相比,PCA的结果为:芝麻油100%、乙酸24%、乙醛100%、乳酸56%、乙酸乙酯68%,平均准确率为69.6%。DQN用更少的步骤实现了更高的识别准确率,提高了识别速度,并降低了训练和测试成本。