• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

特征选择方法在地下水硝酸盐污染预测模型中的应用:筛选法、嵌入法和包裹法的评估。

Feature selection approaches for predictive modelling of groundwater nitrate pollution: An evaluation of filters, embedded and wrapper methods.

机构信息

Physical Geography and Regional Geographic Analysis, University of Seville, Seville 41004, Spain; Geography and Environment, School of Geography, University of Southampton, Southampton SO17 1BJ, United Kingdom.

Unidad del IGME en Granada, Urbanización Alcazar del Genil, 4, 18006 Granada, Spain.

出版信息

Sci Total Environ. 2018 May 15;624:661-672. doi: 10.1016/j.scitotenv.2017.12.152. Epub 2017 Dec 27.

DOI:10.1016/j.scitotenv.2017.12.152
PMID:29272835
Abstract

Recognising the various sources of nitrate pollution and understanding system dynamics are fundamental to tackle groundwater quality problems. A comprehensive GIS database of twenty parameters regarding hydrogeological and hydrological features and driving forces were used as inputs for predictive models of nitrate pollution. Additionally, key variables extracted from remotely sensed Normalised Difference Vegetation Index time-series (NDVI) were included in database to provide indications of agroecosystem dynamics. Many approaches can be used to evaluate feature importance related to groundwater pollution caused by nitrates. Filters, wrappers and embedded methods are used to rank feature importance according to the probability of occurrence of nitrates above a threshold value in groundwater. Machine learning algorithms (MLA) such as Classification and Regression Trees (CART), Random Forest (RF) and Support Vector Machines (SVM) are used as wrappers considering four different sequential search approaches: the sequential backward selection (SBS), the sequential forward selection (SFS), the sequential forward floating selection (SFFS) and sequential backward floating selection (SBFS). Feature importance obtained from RF and CART was used as an embedded approach. RF with SFFS had the best performance (mmce=0.12 and AUC=0.92) and good interpretability, where three features related to groundwater polluted areas were selected: i) industries and facilities rating according to their production capacity and total nitrogen emissions to water within a 3km buffer, ii) livestock farms rating by manure production within a 5km buffer and, iii) cumulated NDVI for the post-maximum month, being used as a proxy of vegetation productivity and crop yield.

摘要

认识到硝酸盐污染的各种来源并了解系统动态是解决地下水质量问题的基础。使用了一个包含二十个与水文地质和水文特征及驱动力有关的参数的综合 GIS 数据库作为硝酸盐污染预测模型的输入。此外,从遥感归一化差异植被指数时间序列(NDVI)中提取的关键变量也被包含在数据库中,以提供农业生态系统动态的指示。有许多方法可用于评估与硝酸盐引起的地下水污染有关的特征重要性。过滤器、包装器和嵌入式方法可根据地下水硝酸盐含量超过阈值的概率对特征重要性进行排序。机器学习算法(MLA),如分类和回归树(CART)、随机森林(RF)和支持向量机(SVM),可作为包装器考虑四种不同的顺序搜索方法:顺序后向选择(SBS)、顺序前向选择(SFS)、顺序前向浮动选择(SFFS)和顺序后向浮动选择(SBFS)。从 RF 和 CART 获得的特征重要性被用作嵌入式方法。RF 与 SFFS 的性能最佳(mmce=0.12 和 AUC=0.92),且具有良好的可解释性,其中选择了三个与地下水污染区域有关的特征:i)根据其生产能力和 3km 缓冲区内向水排放的总氮量对工业和设施进行评级,ii)根据 5km 缓冲区的粪便产量对牲畜养殖场进行评级,iii)累积 NDVI 用于后最大值月份,用作植被生产力和作物产量的代理。

相似文献

1
Feature selection approaches for predictive modelling of groundwater nitrate pollution: An evaluation of filters, embedded and wrapper methods.特征选择方法在地下水硝酸盐污染预测模型中的应用:筛选法、嵌入法和包裹法的评估。
Sci Total Environ. 2018 May 15;624:661-672. doi: 10.1016/j.scitotenv.2017.12.152. Epub 2017 Dec 27.
2
Predictive modeling of groundwater nitrate pollution using Random Forest and multisource variables related to intrinsic and specific vulnerability: a case study in an agricultural setting (Southern Spain).利用随机森林和与内在和特定脆弱性相关的多源变量对地下水硝酸盐污染进行预测建模:以西班牙南部农业区为例。
Sci Total Environ. 2014 Apr 1;476-477:189-206. doi: 10.1016/j.scitotenv.2014.01.001. Epub 2014 Jan 24.
3
Machine Learning and Feature Selection for soil spectroscopy. An evaluation of Random Forest wrappers to predict soil organic matter, clay, and carbonates.用于土壤光谱学的机器学习与特征选择。对随机森林包装器预测土壤有机质、黏土和碳酸盐的评估。
Heliyon. 2024 Apr 25;10(9):e30228. doi: 10.1016/j.heliyon.2024.e30228. eCollection 2024 May 15.
4
Two-stage hybrid feature selection algorithms for diagnosing erythemato-squamous diseases.两阶段混合特征选择算法在红斑鳞屑性疾病诊断中的应用。
Health Inf Sci Syst. 2013 May 30;1:10. doi: 10.1186/2047-2501-1-10. eCollection 2013.
5
Predicting uncertainty of machine learning models for modelling nitrate pollution of groundwater using quantile regression and UNEEC methods.使用分位数回归和UNEEC方法预测用于模拟地下水硝酸盐污染的机器学习模型的不确定性。
Sci Total Environ. 2019 Oct 20;688:855-866. doi: 10.1016/j.scitotenv.2019.06.320. Epub 2019 Jun 21.
6
A machine learning framework for spatio-temporal vulnerability mapping of groundwaters to nitrate in a data scarce region in Lenjanat Plain, Iran.一个针对伊朗 Lenjanat 平原数据匮乏地区地下水硝酸盐时空脆弱性的机器学习框架。
Environ Sci Pollut Res Int. 2024 Jun;31(29):42088-42110. doi: 10.1007/s11356-024-33920-8. Epub 2024 Jun 11.
7
Large scale prediction of groundwater nitrate concentrations from spatial data using machine learning.利用机器学习从空间数据大规模预测地下水中硝酸盐浓度
Sci Total Environ. 2019 Jun 10;668:1317-1327. doi: 10.1016/j.scitotenv.2019.03.045. Epub 2019 Mar 6.
8
A novel machine learning-based approach for the risk assessment of nitrate groundwater contamination.一种基于新型机器学习的地下水硝酸盐污染风险评估方法。
Sci Total Environ. 2018 Dec 10;644:954-962. doi: 10.1016/j.scitotenv.2018.07.054. Epub 2018 Jul 11.
9
A novel feature selection approach for biomedical data classification.一种用于生物医学数据分类的新特征选择方法。
J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.
10
A random forest classifier for lymph diseases.用于淋巴疾病的随机森林分类器。
Comput Methods Programs Biomed. 2014 Feb;113(2):465-73. doi: 10.1016/j.cmpb.2013.11.004. Epub 2013 Nov 14.

引用本文的文献

1
Cervical Cancer Detection Using Deep Neural Network and Hybrid Waterwheel Plant Optimization Algorithm.基于深度神经网络和混合水车植物优化算法的宫颈癌检测
Bioengineering (Basel). 2025 Apr 30;12(5):478. doi: 10.3390/bioengineering12050478.
2
Enhancing DDoS detection in SDIoT through effective feature selection with SMOTE-ENN.通过使用 SMOTE-ENN 进行有效特征选择来增强 SDIoT 中的 DDoS 检测。
PLoS One. 2024 Oct 17;19(10):e0309682. doi: 10.1371/journal.pone.0309682. eCollection 2024.
3
Machine Learning and Feature Selection for soil spectroscopy. An evaluation of Random Forest wrappers to predict soil organic matter, clay, and carbonates.
用于土壤光谱学的机器学习与特征选择。对随机森林包装器预测土壤有机质、黏土和碳酸盐的评估。
Heliyon. 2024 Apr 25;10(9):e30228. doi: 10.1016/j.heliyon.2024.e30228. eCollection 2024 May 15.
4
Mapping specific groundwater nitrate concentrations from spatial data using machine learning: A case study of chongqing, China.利用机器学习从空间数据映射特定的地下水硝酸盐浓度:以中国重庆为例
Heliyon. 2024 Mar 13;10(6):e27867. doi: 10.1016/j.heliyon.2024.e27867. eCollection 2024 Mar 30.
5
An intelligent decision support system for groundwater supply management and electromechanical infrastructure controls.一种用于地下水供应管理和机电基础设施控制的智能决策支持系统。
Heliyon. 2024 Jan 20;10(3):e25036. doi: 10.1016/j.heliyon.2024.e25036. eCollection 2024 Feb 15.
6
Estimation of Shape Error with Monitoring Signals.利用监测信号估计形状误差
Sensors (Basel). 2023 Nov 26;23(23):9416. doi: 10.3390/s23239416.
7
Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain.颈部疼痛预后因素研究中的自动变量选择算法
J Clin Med. 2023 Sep 27;12(19):6232. doi: 10.3390/jcm12196232.
8
Unraveling the mechanisms underlying drug-induced cholestatic liver injury: identifying key genes using machine learning techniques on human in vitro data sets.阐明药物性胆汁淤积性肝损伤的机制:使用人类体外数据集上的机器学习技术鉴定关键基因。
Arch Toxicol. 2023 Nov;97(11):2969-2981. doi: 10.1007/s00204-023-03583-4. Epub 2023 Aug 21.
9
Data quantity governance for machine learning in materials science.材料科学中机器学习的数据量治理
Natl Sci Rev. 2023 May 1;10(7):nwad125. doi: 10.1093/nsr/nwad125. eCollection 2023 Jul.
10
ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species.ATTIC 是一种综合方法,可用于预测三个物种中的 A 到 I RNA 编辑位点。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad170.