• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于虚拟样本生成和高斯过程回归技术,改进有限监测数据条件下的地下水质量预测。

Improving prediction of groundwater quality in situations of limited monitoring data based on virtual sample generation and Gaussian process regression.

机构信息

Key Laboratory of Groundwater Resources and Environment, Ministry of Education, Jilin University Changchun 130021, China; Jilin Provincial Key Laboratory of Water Resources and Environment, Jilin University Changchun 130021, China; College of New Energy and Environment, Jilin University, Changchun 130021, China; National-Local Joint Engineering Laboratory of In-Situ Conversion, Drilling and Exploitation Technology for Oil Shale, Changchun 130021, China.

Key Laboratory of Groundwater Resources and Environment, Ministry of Education, Jilin University Changchun 130021, China; Jilin Provincial Key Laboratory of Water Resources and Environment, Jilin University Changchun 130021, China; College of New Energy and Environment, Jilin University, Changchun 130021, China; National-Local Joint Engineering Laboratory of In-Situ Conversion, Drilling and Exploitation Technology for Oil Shale, Changchun 130021, China.

出版信息

Water Res. 2024 Dec 1;267:122498. doi: 10.1016/j.watres.2024.122498. Epub 2024 Sep 21.

DOI:10.1016/j.watres.2024.122498
PMID:39332348
Abstract

The increasing pollution of aquifers by human activities over recent decades poses a threat to drinking water safety. While Gaussian Process Regression (GPR) is a robust tool for predicting and monitoring water quality, its effectiveness is hindered limitations of available data on model training and validation, known as the "small sample problem". Various attempts to resolve this problem include virtual sample generation (VSG). This study aimed to increase the accuracy of GPR for predicting water quality in situations of limited datasets. Three VSG methods, namely Multi Distribution Mega-Trend Diffusion (MD-MTD), Generative Adversarial Network (GAN), and t-distributed stochastic nearest neighbor embedding (t-SNE) were compared for enhancing the accuracy of GPR model prediction of Strontium (Sr). The models were used to predict Sr in the shallow aquifer system in Songyuan, Jilin Province. The results showed that t-SNE provided the most significant improvement to the accuracy of the GPR, with R increasing from 0.86 to 0.99 (12.98 %), followed by MD-MTD (R of 0.95, 9.39 %), with the least improvement obtained by GAN (R of 0.92, 5.98 %). Boxplots show that MD-MTD-GPR predictions do not fully capture observed data distributions. GANs accurately replicate the data distribution, while t-SNE-GPR achieves the highest prediction accuracy and handles data fluctuations. GPR accuracy improves with an increasing number of virtual samples but tends to decrease when the number exceeds 258 in this study. This study can guide the improvement of the accuracy of GPR for situations of limited datasets. The results of this study can help improve water quality management and drinking water safety in regions with sparse monitoring data.

摘要

近年来,人类活动导致的地下水污染日益严重,对饮用水安全构成威胁。虽然高斯过程回归(GPR)是一种预测和监测水质的强大工具,但由于模型训练和验证数据的局限性,即“小样本问题”,其有效性受到限制。为了解决这个问题,人们提出了各种方法,包括虚拟样本生成(VSG)。本研究旨在提高 GPR 在有限数据集情况下预测水质的准确性。本研究比较了三种 VSG 方法,即多分布巨型趋势扩散(MD-MTD)、生成对抗网络(GAN)和 t 分布随机近邻嵌入(t-SNE),以提高 GPR 模型预测锶(Sr)的准确性。模型用于预测吉林省松原浅层含水层系统中的 Sr。结果表明,t-SNE 对 GPR 准确性的提高最为显著,R 值从 0.86 增加到 0.99(12.98%),其次是 MD-MTD(R 值为 0.95,9.39%),而 GAN 的提高最小(R 值为 0.92,5.98%)。箱线图显示,MD-MTD-GPR 的预测并未完全捕获观测数据分布。GAN 准确地复制了数据分布,而 t-SNE-GPR 则实现了最高的预测准确性并处理了数据波动。随着虚拟样本数量的增加,GPR 的准确性会提高,但在本研究中,当数量超过 258 时,准确性会下降。本研究可以指导提高 GPR 在有限数据集情况下的准确性。本研究的结果可以帮助改善数据稀疏监测地区的水质管理和饮用水安全。

相似文献

1
Improving prediction of groundwater quality in situations of limited monitoring data based on virtual sample generation and Gaussian process regression.基于虚拟样本生成和高斯过程回归技术,改进有限监测数据条件下的地下水质量预测。
Water Res. 2024 Dec 1;267:122498. doi: 10.1016/j.watres.2024.122498. Epub 2024 Sep 21.
2
Optimizing coastal groundwater quality predictions: A novel data mining framework with cross-validation, bootstrapping, and entropy analysis.优化沿海地区地下水水质预测:一种具有交叉验证、自助法和熵分析的新型数据挖掘框架。
J Contam Hydrol. 2025 Feb;269:104480. doi: 10.1016/j.jconhyd.2024.104480. Epub 2024 Dec 10.
3
Monitoring the evolution and migration of a methane gas plume in an unconfined sandy aquifer using time-lapse GPR and ERT.利用时移探地雷达(GPR)和大地电阻率测深法(ERT)监测无压砂质含水层中甲烷气羽状流的演化与迁移。
J Contam Hydrol. 2017 Oct;205:12-24. doi: 10.1016/j.jconhyd.2017.08.011. Epub 2017 Aug 30.
4
An approach based on multivariate distribution and Gaussian copulas to predict groundwater quality using DNN models in a data scarce environment.一种基于多元分布和高斯Copula函数,在数据稀缺环境中使用深度神经网络模型预测地下水质量的方法。
MethodsX. 2023 Feb 2;10:102034. doi: 10.1016/j.mex.2023.102034. eCollection 2023.
5
Enhanced virtual sample generation based on manifold features: Applications to developing soft sensor using small data.基于流形特征的增强虚拟样本生成:在利用小数据开发软传感器中的应用。
ISA Trans. 2022 Jul;126:398-406. doi: 10.1016/j.isatra.2021.07.033. Epub 2021 Jul 23.
6
Groundwater health risk assessment and its temporal and spatial evolution based on trapezoidal fuzzy number-Monte Carlo stochastic simulation: A case study in western Jilin province.基于梯形模糊数-蒙特卡罗随机模拟的地下水健康风险评价及其时空演变:以吉林省西部为例。
Ecotoxicol Environ Saf. 2024 Sep 1;282:116736. doi: 10.1016/j.ecoenv.2024.116736. Epub 2024 Jul 17.
7
Interpolation of extensive routine water pollution monitoring datasets: methodology and discussion of implications for aquifer management.大量常规水污染监测数据集的插值:方法及对含水层管理影响的探讨
Environ Sci Process Impacts. 2014 Aug;16(8):2007-17. doi: 10.1039/c4em00190g.
8
The nitrogen cycle in highly urbanized tropical regions and the effect of river-aquifer interactions: The case of Jakarta and the Ciliwung River.高度城市化热带地区的氮循环及河流-含水层相互作用的影响:以雅加达和芝利翁河为例。
J Contam Hydrol. 2016 Sep;192:87-100. doi: 10.1016/j.jconhyd.2016.06.004. Epub 2016 Jul 2.
9
Time-lapse dielectric properties monitoring of the flow cell during DNAPL contamination and remediation processes by full-waveform inversion of GPR data using particle swarm optimization: A laboratory study.利用粒子群优化的全波形反演对 GPR 数据进行监测,研究在 DNAPL 污染和修复过程中流动池的时移介电特性:一项实验室研究。
J Contam Hydrol. 2024 Nov;267:104443. doi: 10.1016/j.jconhyd.2024.104443. Epub 2024 Oct 10.
10
Machine learning predictive insight of water pollution and groundwater quality in the Eastern Province of Saudi Arabia.沙特阿拉伯东部省份水污染与地下水质量的机器学习预测洞察
Sci Rep. 2024 Aug 28;14(1):20031. doi: 10.1038/s41598-024-70610-4.

引用本文的文献

1
Evaluation of ecological geological environment carrying capacity and analysis of driving mechanisms based on normal cloud model and geodetector model.基于正态云模型和地理探测器模型的生态地质环境承载力评价及驱动机制分析
Sci Rep. 2025 Jan 22;15(1):2800. doi: 10.1038/s41598-025-85761-1.