Suppr超能文献

美国大陆私人水井中砷的机器学习模型作为人类健康研究中暴露评估的工具。

Machine Learning Models of Arsenic in Private Wells Throughout the Conterminous United States As a Tool for Exposure Assessment in Human Health Studies.

机构信息

New England Water Science Center, U.S. Geological Survey, 331 Commerce Way, Pembroke, New Hampshire 03275, United States.

Institute for Minority Health Research, University of Illinois at Chicago, 1819 W. Polk, Chicago, Illinois 60612, United States.

出版信息

Environ Sci Technol. 2021 Apr 20;55(8):5012-5023. doi: 10.1021/acs.est.0c05239. Epub 2021 Mar 17.

Abstract

Arsenic from geologic sources is widespread in groundwater within the United States (U.S.). In several areas, groundwater arsenic concentrations exceed the U.S. Environmental Protection Agency maximum contaminant level of 10 μg per liter (μg/L). However, this standard applies only to public-supply drinking water and not to private-supply, which is not federally regulated and is rarely monitored. As a result, arsenic exposure from private wells is a potentially substantial, but largely hidden, public health concern. Machine learning models using boosted regression trees (BRT) and random forest classification (RFC) techniques were developed to estimate probabilities and concentration ranges of arsenic in private wells throughout the conterminous U.S. Three BRT models were fit separately to estimate the probability of private well arsenic concentrations exceeding 1, 5, or 10 μg/L whereas the RFC model estimates the most probable category (≤5, >5 to ≤10, or >10 μg/L). Overall, the models perform best at identifying areas with low concentrations of arsenic in private wells. The BRT 10 μg/L model estimates for testing data have an overall accuracy of 91.2%, sensitivity of 33.9%, and specificity of 98.2%. Influential variables identified across all models included average annual precipitation and soil geochemistry. Models were developed in collaboration with public health experts to support U.S.-based studies focused on health effects from arsenic exposure.

摘要

美国(美国)地下水普遍存在地质来源的砷。在一些地区,地下水砷浓度超过了美国环境保护署规定的每升 10 微克(μg/L)的最大污染物水平。然而,该标准仅适用于公共供水饮用水,不适用于私人供水,私人供水不受联邦监管,很少受到监测。因此,私人水井中的砷暴露是一个潜在的、但在很大程度上被忽视的公共卫生问题。

使用提升回归树 (BRT) 和随机森林分类 (RFC) 技术的机器学习模型被开发用于估计整个美国大陆私人水井中砷的概率和浓度范围。三个 BRT 模型分别拟合以估计私人水井砷浓度超过 1、5 或 10μg/L 的概率,而 RFC 模型则估计最可能的类别(≤5、>5 至≤10 或>10μg/L)。总体而言,这些模型在识别私人水井中砷浓度较低的区域方面表现最佳。BRT 10μg/L 模型对测试数据的估计总体准确率为 91.2%,灵敏度为 33.9%,特异性为 98.2%。所有模型都确定了平均年降水量和土壤地球化学等具有影响力的变量。该模型是与公共卫生专家合作开发的,旨在支持以美国为基础的研究,重点关注砷暴露对健康的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae3/8852770/7d257eafa873/nihms-1774987-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验