Bureau of Economic Geology, Jackson School of Geosciences, The University of Texas at Austin, Austin, Texas 78758-4445, United States.
Environ Sci Technol. 2021 Nov 16;55(22):15531-15541. doi: 10.1021/acs.est.1c04048. Epub 2021 Oct 25.
Driven by the collection of enormous amounts of streaming data from sensors, and with the emergence of the internet of things, the need for developing robust detection techniques to identify data anomalies has increased recently. The algorithms for anomaly detection are required to be selected based on the type of data. In this study, we propose a predictive anomaly detection technique, DeepSense, which is applied to soil gas concentration data acquired from sensors being used for environmental characterization at a prospective CO storage site in Queensland, Australia. DeepSense takes advantage of deep-learning algorithms as its predictor module and uses a process-based soil gas method as the basis of its anomaly detector module. The proposed predictor framework leverages the power of convolutional neural network algorithms for feature extraction and simultaneously captures the long-term temporal dependency through long short-term memory algorithms. The proposed process-based anomaly detection method is a cost-effective alternative to the conventional concentration-based soil gas methodologies which rely on long-term baseline surveys for defining the threshold level. The results indicate that the proposed framework performs well in diagnosing anomalous data in soil gas concentration data streams. The robustness and efficacy of the DeepSense were verified against data sets acquired from different monitoring stations of the storage site.
受传感器采集大量流式数据的推动,以及物联网的出现,最近对开发强大的检测技术以识别数据异常的需求有所增加。异常检测算法需要根据数据类型进行选择。在这项研究中,我们提出了一种预测异常检测技术 DeepSense,它应用于从澳大利亚昆士兰州一个潜在的 CO2 储存场地的传感器获取的土壤气体浓度数据,用于环境特征描述。DeepSense 利用深度学习算法作为其预测器模块,并使用基于过程的土壤气体方法作为其异常检测器模块的基础。所提出的预测器框架利用卷积神经网络算法的强大功能进行特征提取,并通过长短时记忆算法同时捕捉长期时间依赖性。所提出的基于过程的异常检测方法是对传统基于浓度的土壤气体方法的一种具有成本效益的替代方法,后者依赖于长期基线调查来定义阈值水平。结果表明,所提出的框架在诊断土壤气体浓度数据流中的异常数据方面表现良好。DeepSense 的稳健性和有效性已针对来自储存场地不同监测站的数据集进行了验证。