Zhong Junting, Zhang Xiaoye, Gui Ke, Wang Yaqiang, Che Huizheng, Shen Xiaojing, Zhang Lei, Zhang Yangmei, Sun Junying, Zhang Wenjie
State Key Laboratory of Severe Weather and Key Laboratory of Atmospheric Chemistry of China Meteorological Administration, Chinese Academy of Meteorological Sciences, Beijing 100081, China.
Natl Sci Rev. 2021 Jan 5;8(10):nwaa307. doi: 10.1093/nsr/nwaa307. eCollection 2021 Oct.
Retrieving historical fine particulate matter (PM) data is key for evaluating the long-term impacts of PM on the environment, human health and climate change. Satellite-based aerosol optical depth has been used to estimate PM, but estimations have largely been undermined by massive missing values, low sampling frequency and weak predictive capability. Here, using a novel feature engineering approach to incorporate spatial effects from meteorological data, we developed a robust LightGBM model that predicts PM at an unprecedented predictive capacity on hourly (R= 0.75), daily (R= 0.84), monthly (R= 0.88) and annual (R= 0.87) timescales. By taking advantage of spatial features, our model can also construct hourly gridded networks of PM. This capability would be further enhanced if meteorological observations from regional stations were incorporated. Our results show that this model has great potential in reconstructing historical PM datasets and real-time gridded networks at high spatial-temporal resolutions. The resulting datasets can be assimilated into models to produce long-term re-analysis that incorporates interactions between aerosols and physical processes.
获取历史细颗粒物(PM)数据是评估PM对环境、人类健康和气候变化长期影响的关键。基于卫星的气溶胶光学厚度已被用于估算PM,但由于大量缺失值、低采样频率和弱预测能力,估算结果在很大程度上受到了影响。在此,我们采用一种新颖的特征工程方法,将气象数据的空间效应纳入其中,开发了一个强大的LightGBM模型,该模型在小时(R = 0.75)、每日(R = 0.84)、每月(R = 0.88)和每年(R = 0.87)时间尺度上以前所未有的预测能力预测PM。通过利用空间特征,我们的模型还可以构建PM的每小时网格化网络。如果纳入区域站点的气象观测数据,这种能力将得到进一步增强。我们的结果表明,该模型在重建高时空分辨率的历史PM数据集和实时网格化网络方面具有巨大潜力。所得数据集可被同化到模型中,以产生包含气溶胶与物理过程相互作用的长期再分析结果。