• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于随机森林回归和模型解释的水质空间异质性建模。

Spatial heterogeneity modeling of water quality based on random forest regression and model interpretation.

机构信息

College of Environmental & Resource Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China.

Department of Civil and Environmental Engineering, Case Western Reserve University, Cleveland, OH, 44106, United States.

出版信息

Environ Res. 2021 Nov;202:111660. doi: 10.1016/j.envres.2021.111660. Epub 2021 Jul 12.

DOI:10.1016/j.envres.2021.111660
PMID:34265353
Abstract

A systematic understanding of the spatial distribution of water quality is critical for successful watershed management; however, the limited number of physical monitoring stations has restricted the evaluation of spatial water quality distribution and the identification of features impacting the water quality. To fill this gap, we developed a modeling process that employed the random forest regression (RFR) to model the water quality distribution for the Taihu Lake basin in Zhejiang Province, China, and adopted the Shapley Additive exPlanations (SHAP) method to interpret the underlying driving forces. We first used RFR to model three water quality parameters: permanganate index (COD), total phosphorus (TP), and total nitrogen (TN), based on 16 watershed features. We then applied the built models to generate water quality distribution maps for the basin, with the COD ranging from 1.39 to 6.40 mg/L, TP from 0.02 to 0.23 mg/L, and TN from 1.43 to 4.27 mg/L. These maps showed generally consistent patterns among the COD, TN, and TP with minor differences in the spatial distribution. The SHAP analysis showed that the TN was mainly affected by agricultural non-point sources, while the COD and TP were affected by agricultural and domestic sources. Due to differences in sewage collection and treatment between urban and rural areas, the water quality in highly populated urban areas was better than that in rural areas, which led to an unexpected positive relationship between water quality and population density. Overall, with the RFR models and SHAP interpretation, we obtained a continuous distribution pattern of the water quality and identified its driving forces in the basin. These findings provided important information to assist water quality restoration projects.

摘要

系统地了解水质的空间分布情况对于成功的流域管理至关重要;然而,物理监测站的数量有限,限制了对空间水质分布的评估和对影响水质特征的识别。为了弥补这一差距,我们开发了一种建模过程,该过程使用随机森林回归(RFR)来模拟中国浙江省太湖流域的水质分布,并采用 Shapley Additive exPlanations(SHAP)方法来解释潜在的驱动因素。我们首先使用 RFR 根据 16 个流域特征来对三个水质参数(高锰酸盐指数(COD)、总磷(TP)和总氮(TN))进行建模。然后,我们将构建的模型应用于生成流域的水质分布图,其中 COD 的范围为 1.39 至 6.40mg/L,TP 的范围为 0.02 至 0.23mg/L,TN 的范围为 1.43 至 4.27mg/L。这些地图显示了 COD、TN 和 TP 之间的总体一致模式,空间分布上的差异较小。SHAP 分析表明,TN 主要受农业非点源的影响,而 COD 和 TP 则受农业和生活源的影响。由于城乡之间在污水收集和处理方面的差异,人口稠密的城市地区的水质优于农村地区,这导致了水质与人口密度之间出乎意料的正相关关系。总体而言,通过 RFR 模型和 SHAP 解释,我们获得了流域内水质的连续分布模式,并确定了其驱动力。这些发现为水质恢复项目提供了重要信息。

相似文献

1
Spatial heterogeneity modeling of water quality based on random forest regression and model interpretation.基于随机森林回归和模型解释的水质空间异质性建模。
Environ Res. 2021 Nov;202:111660. doi: 10.1016/j.envres.2021.111660. Epub 2021 Jul 12.
2
Insights into spatiotemporal variations of the water quality in Taihu Lake Basin, China.中国太湖流域水质时空变化的研究进展。
Environ Monit Assess. 2021 Oct 30;193(11):757. doi: 10.1007/s10661-021-09554-5.
3
Decoding river pollution trends and their landscape determinants in an ecologically fragile karst basin using a machine learning model.利用机器学习模型解码生态脆弱岩溶流域的河流污染趋势及其景观决定因素。
Environ Res. 2022 Nov;214(Pt 4):113843. doi: 10.1016/j.envres.2022.113843. Epub 2022 Aug 2.
4
Temporal and spatial characteristics of the water pollutant concentration in Huaihe River Basin from 2003 to 2012, China.2003年至2012年中国淮河流域水污染物浓度的时空特征
Environ Monit Assess. 2016 Sep;188(9):522. doi: 10.1007/s10661-016-5503-0. Epub 2016 Aug 16.
5
Inter-annual and intra-annual variations in water quality and its response to water-level fluctuations in a river-connected lake, Dongting Lake, China.中国洞庭湖连通河湖水水质的年际和年内变化及其对水位波动的响应。
Environ Sci Pollut Res Int. 2022 Feb;29(10):14083-14097. doi: 10.1007/s11356-021-16739-5. Epub 2021 Oct 2.
6
Hydrodynamic and water quality modeling of a large floodplain lake (Poyang Lake) in China.中国大型冲积平原湖泊(鄱阳湖)水动力及水质模拟。
Environ Sci Pollut Res Int. 2018 Dec;25(35):35084-35098. doi: 10.1007/s11356-018-3387-y. Epub 2018 Oct 16.
7
In-time source tracking of watershed loads of Taihu Lake Basin, China based on spatial relationship modeling.基于空间关系建模的太湖流域面源负荷及时溯源。
Environ Sci Pollut Res Int. 2018 Aug;25(22):22085-22094. doi: 10.1007/s11356-018-2304-8. Epub 2018 May 25.
8
[Relationship Between Agricultural Land and Water Quality of Inflow River in Erhai Lake Basin].[洱海流域农田与入湖河流水质的关系]
Huan Jing Ke Xue. 2015 Nov;36(11):4005-12.
9
Water pollution characteristics of inflowing rivers under different land-use patterns in the Daye Lake basin: pollution mode and management suggestions.流域不同土地利用格局下入湖河流的水污染特征:污染模式及管理建议。
Environ Monit Assess. 2021 Dec 7;194(1):10. doi: 10.1007/s10661-021-09667-x.
10
Water pollution characteristics and analysis of Chaohu Lake basin by using different assessment methods.采用不同评价方法分析巢湖水污染特征及流域污染状况。
Environ Sci Pollut Res Int. 2020 May;27(15):18168-18181. doi: 10.1007/s11356-020-08189-2. Epub 2020 Mar 14.

引用本文的文献

1
Multi-dimensional water quality indicators forecasting from IoT sensors: A tensor decomposition and multi-head self-attention mechanism.基于物联网传感器的多维水质指标预测:张量分解与多头自注意力机制
PLoS One. 2025 Jul 11;20(7):e0326870. doi: 10.1371/journal.pone.0326870. eCollection 2025.
2
A SMOTE PCA HDBSCAN approach for enhancing water quality classification in imbalanced datasets.一种用于增强不平衡数据集中水质分类的SMOTE主成分分析-高密度基于密度空间聚类方法。
Sci Rep. 2025 Apr 16;15(1):13059. doi: 10.1038/s41598-025-97248-0.
3
High resolution data visualization and machine learning prediction of free chlorine residual in a green building water system.
绿色建筑水系统中游离氯残留量的高分辨率数据可视化与机器学习预测
Water Res X. 2024 Jul 26;24:100244. doi: 10.1016/j.wroa.2024.100244. eCollection 2024 Sep 1.
4
Establishing flood thresholds for sea level rise impact communication.确定用于海平面上升影响信息传播的洪水阈值。
Nat Commun. 2024 May 18;15(1):4251. doi: 10.1038/s41467-024-48545-1.
5
Coastal Water Quality Modelling Using , Meteorological Parameters and Machine Learning Algorithms.利用气象参数和机器学习算法进行沿海水质建模。
Int J Environ Res Public Health. 2023 Jun 24;20(13):6216. doi: 10.3390/ijerph20136216.
6
Development and application of random forest regression soft sensor model for treating domestic wastewater in a sequencing batch reactor.序批式反应器处理生活污水的随机森林回归软测量模型的开发与应用。
Sci Rep. 2023 Jun 5;13(1):9149. doi: 10.1038/s41598-023-36333-8.
7
Predictions of Milk Fatty Acid Contents by Mid-Infrared Spectroscopy in Chinese Holstein Cows.应用中波近红外光谱法预测中国荷斯坦奶牛乳脂脂肪酸含量。
Molecules. 2023 Jan 9;28(2):666. doi: 10.3390/molecules28020666.