• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卫星数据的中国历史 PM 浓度集合机器学习模型预测

An Ensemble Machine-Learning Model To Predict Historical PM Concentrations in China from Satellite Data.

出版信息

Environ Sci Technol. 2018 Nov 20;52(22):13260-13269. doi: 10.1021/acs.est.8b02917. Epub 2018 Nov 1.

DOI:10.1021/acs.est.8b02917
PMID:30354085
Abstract

The long satellite aerosol data record enables assessments of historical PM level in regions where routine PM monitoring began only recently. However, most previous models reported decreased prediction accuracy when predicting PM levels outside the model-training period. In this study, we proposed an ensemble machine learning approach that provided reliable PM hindcast capabilities. The missing satellite data were first filled by multiple imputation. Then the modeling domain, China, was divided into seven regions using a spatial clustering method to control for unobserved spatial heterogeneity. A set of machine learning models including random forest, generalized additive model, and extreme gradient boosting were trained in each region separately. Finally, a generalized additive ensemble model was developed to combine predictions from different algorithms. The ensemble prediction characterized the spatiotemporal distribution of daily PM well with the cross-validation (CV) R (RMSE) of 0.79 (21 μg/m). The cluster-based subregion models outperformed national models and improved the CV R by ∼0.05. Compared with previous studies, our model provided more accurate out-of-range predictions at the daily level ( R = 0.58, RMSE = 29 μg/m) and monthly level ( R = 0.76, RMSE = 16 μg/m). Our hindcast modeling system allows for the construction of unbiased historical PM levels.

摘要

长卫星气溶胶数据记录使人们能够评估最近才开始进行常规 PM 监测的地区的历史 PM 水平。然而,大多数先前的模型在预测模型训练期之外的 PM 水平时报告了预测精度降低。在这项研究中,我们提出了一种集成机器学习方法,该方法提供了可靠的 PM 回溯能力。首先通过多次插补来填补缺失的卫星数据。然后,使用空间聚类方法将建模区域中国划分为七个区域,以控制未观察到的空间异质性。在每个区域中分别训练了一组包括随机森林、广义加性模型和极端梯度增强在内的机器学习模型。最后,开发了一个广义加性集成模型来组合来自不同算法的预测。该集成预测很好地描述了每日 PM 的时空分布,其交叉验证(CV)R(RMSE)为 0.79(21μg/m)。基于聚类的子区域模型优于全国模型,并将 CV R 提高了约 0.05。与先前的研究相比,我们的模型在每日水平(R=0.58,RMSE=29μg/m)和每月水平(R=0.76,RMSE=16μg/m)上提供了更准确的超范围预测。我们的回溯建模系统允许构建无偏的历史 PM 水平。

相似文献

1
An Ensemble Machine-Learning Model To Predict Historical PM Concentrations in China from Satellite Data.基于卫星数据的中国历史 PM 浓度集合机器学习模型预测
Environ Sci Technol. 2018 Nov 20;52(22):13260-13269. doi: 10.1021/acs.est.8b02917. Epub 2018 Nov 1.
2
Spatiotemporal continuous estimates of PM concentrations in China, 2000-2016: A machine learning method with inputs from satellites, chemical transport model, and ground observations.2000-2016 年中国 PM 浓度的时空连续估算:卫星、化学输送模型和地面观测输入的机器学习方法。
Environ Int. 2019 Feb;123:345-357. doi: 10.1016/j.envint.2018.11.075. Epub 2018 Dec 18.
3
A land use regression model using machine learning and locally developed low cost particulate matter sensors in Uganda.乌干达使用机器学习和本地开发的低成本颗粒物传感器的土地利用回归模型。
Environ Res. 2021 Aug;199:111352. doi: 10.1016/j.envres.2021.111352. Epub 2021 May 24.
4
A machine learning method to estimate PM concentrations across China with remote sensing, meteorological and land use information.一种利用遥感、气象和土地利用信息估算中国 PM 浓度的机器学习方法。
Sci Total Environ. 2018 Sep 15;636:52-60. doi: 10.1016/j.scitotenv.2018.04.251. Epub 2018 Apr 25.
5
Spatiotemporally continuous estimates of daily 1-km PM concentrations and their long-term exposure in China from 2000 to 2020.2000 年至 2020 年中国每日 1 公里 PM 浓度及其长期暴露的时空连续估计。
J Environ Manage. 2023 Sep 15;342:118145. doi: 10.1016/j.jenvman.2023.118145. Epub 2023 May 19.
6
Construction of a virtual PM observation network in China based on high-density surface meteorological observations using the Extreme Gradient Boosting model.基于极端梯度提升模型利用高密度地面气象观测资料构建中国虚拟 PM 观测网络。
Environ Int. 2020 Aug;141:105801. doi: 10.1016/j.envint.2020.105801. Epub 2020 May 29.
7
Predicting monthly high-resolution PM concentrations with random forest model in the North China Plain.利用随机森林模型预测华北平原逐月高分辨率 PM 浓度。
Environ Pollut. 2018 Nov;242(Pt A):675-683. doi: 10.1016/j.envpol.2018.07.016. Epub 2018 Jul 11.
8
Evaluation of machine learning techniques with multiple remote sensing datasets in estimating monthly concentrations of ground-level PM.利用多种遥感数据集评估机器学习技术估算地面 PM 月浓度
Environ Pollut. 2018 Nov;242(Pt B):1417-1426. doi: 10.1016/j.envpol.2018.08.029. Epub 2018 Aug 11.
9
Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.美国东部地区遥感气溶胶光学厚度与PM2.5之间关系的评估及统计建模
Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.
10
An ensemble-based model of PM concentration across the contiguous United States with high spatiotemporal resolution.一种具有高时空分辨率的美国大陆地区 PM 浓度的集合模型。
Environ Int. 2019 Sep;130:104909. doi: 10.1016/j.envint.2019.104909. Epub 2019 Jul 1.

引用本文的文献

1
Filling gaps in PM2.5 time series: A broad evaluation from statistical to advanced neural network models.填补细颗粒物(PM2.5)时间序列中的空白:从统计模型到先进神经网络模型的全面评估
PLoS One. 2025 Aug 14;20(8):e0330211. doi: 10.1371/journal.pone.0330211. eCollection 2025.
2
Understanding ozone variability in spatial responses to emissions and meteorology in China using interpretable machine learning.利用可解释机器学习理解中国臭氧在对排放和气象的空间响应中的变异性。
iScience. 2025 Jun 28;28(8):113036. doi: 10.1016/j.isci.2025.113036. eCollection 2025 Aug 15.
3
Using ML techniques to predict extubation outcomes for patients with central nervous system injuries in the Yun-Gui Plateau.
运用机器学习技术预测云贵高原中枢神经系统损伤患者的拔管结局。
Sci Rep. 2025 May 22;15(1):17773. doi: 10.1038/s41598-025-98861-9.
4
Application of multi-angle spaceborne observations in characterizing the long-term particulate organic carbon pollution in China.多角度星载观测在中国长期颗粒有机碳污染特征研究中的应用。
Sci Total Environ. 2025 Jan 1;958:177883. doi: 10.1016/j.scitotenv.2024.177883. Epub 2024 Dec 7.
5
National PM spatiotemporal model integrating intensive monitoring data and land use regression in a likelihood-based universal kriging framework in the United States: 2000-2019.在美国基于似然性的通用克里金框架下整合密集监测数据和土地利用回归的国家细颗粒物时空模型:2000 - 2019年
Environ Pollut. 2025 Feb 1;366:125405. doi: 10.1016/j.envpol.2024.125405. Epub 2024 Nov 28.
6
Methods for Quantifying Source-Specific Air Pollution Exposure to Serve Epidemiology, Risk Assessment, and Environmental Justice.量化特定来源空气污染暴露以服务于流行病学、风险评估和环境正义的方法。
Geohealth. 2024 Nov 5;8(11):e2024GH001188. doi: 10.1029/2024GH001188. eCollection 2024 Nov.
7
Enhancing Global Estimation of Fine Particulate Matter Concentrations by Including Geophysical Information in Deep Learning.通过在深度学习中纳入地球物理信息来加强全球细颗粒物浓度估计
ACS EST Air. 2024 Mar 27;1(5):332-345. doi: 10.1021/acsestair.3c00054. eCollection 2024 May 10.
8
Accountability analysis of health benefits related to National Action Plan on Air Pollution Prevention and Control in China.中国《大气污染防治行动计划》健康效益的责任分析
PNAS Nexus. 2024 Apr 30;3(4):pgae142. doi: 10.1093/pnasnexus/pgae142. eCollection 2024 Apr.
9
Nationwide estimation of daily ambient PM from 2008 to 2020 at 1 km in India using an ensemble approach.2008年至2020年期间,采用综合方法对印度1公里分辨率下的每日环境细颗粒物进行全国范围估算。
PNAS Nexus. 2024 Feb 27;3(3):pgae088. doi: 10.1093/pnasnexus/pgae088. eCollection 2024 Mar.
10
Remote Sensing Data as a Tool for Studying Environmental Aspects of Parkinson's Disease.遥感数据作为研究帕金森病环境因素的工具
J Parkinsons Dis. 2024;14(3):619-622. doi: 10.3233/JPD-230382.