Shi Haoxin, Guo Jian, Deng Yuandong, Qin Zixuan
State Key Laboratory of Geohazard Prevention and Geoenviromment Protection, Chengdu University of Technology, Chengdu, 610059, China.
College of Construction Engineering, Jilin University, Changchun, 130026, China.
Sci Rep. 2023 Sep 7;13(1):14718. doi: 10.1038/s41598-023-38447-5.
Detection of subsurface hydrodynamic anomalies plays a significant role in groundwater resource management and environmental monitoring. In this paper, based on data from the groundwater level, atmospheric pressure, and precipitation in the Chengdu area of China, a method for detecting outliers considering the factors affecting groundwater levels is proposed. By analyzing the factors affecting groundwater levels in the monitoring site and eliminating them, simplified groundwater data is obtained. Applying sl-Pauta (self-learning-based Pauta), iForest (Isolated Forest), OCSVM (One-Class SVM), and KNN to synthetic data with known outliers, testing and evaluating the effectiveness of 4 technologies. Finally, the four methods are applied to the detection of outliers in simplified groundwater levels. The results show that in the detection of outliers in synthesized data, the OCSVM method has the best detection performance, with a precision rate of 88.89%, a recall rate of 91.43%, an F1 score of 90.14%, and an AUC value of 95.66%. In the detection of outliers in simplified groundwater levels, a qualitative analysis of the displacement data within the field of view indicates that the outlier detection performance of iForest and OCSVM is better than that of KNN. The proposed method for considering the factors affecting groundwater levels can improve the efficiency and accuracy of detecting outliers in groundwater level data.
地下流体动力异常的探测在地下水资源管理和环境监测中具有重要作用。本文基于中国成都地区地下水位、大气压力和降水数据,提出一种考虑影响地下水位因素的异常值检测方法。通过分析监测点影响地下水位的因素并予以消除,得到简化的地下水数据。将滑动式帕陶(基于自学习的帕陶)、孤立森林、一类支持向量机和K近邻算法应用于带有已知异常值的合成数据,测试和评估这4种技术的有效性。最后,将这4种方法应用于简化地下水位异常值的检测。结果表明,在合成数据异常值检测中,一类支持向量机方法检测性能最佳,精确率为88.89%,召回率为91.43%,F1分数为90.14%,AUC值为95.66%。在简化地下水位异常值检测中,对视野内位移数据进行定性分析表明,孤立森林和一类支持向量机的异常值检测性能优于K近邻算法。所提出的考虑影响地下水位因素的方法能够提高地下水位数据异常值检测的效率和准确性。