Department of Physical Geography, Faculty of Geosciences, Utrecht University, Utrecht, the Netherlands.
Department of Physical Geography, Faculty of Geosciences, Utrecht University, Utrecht, the Netherlands.
Environ Int. 2020 Sep;142:105856. doi: 10.1016/j.envint.2020.105856. Epub 2020 Jun 25.
In countries where air pollution stations are unavailable or scarce, station measurements from other countries and atmospheric remote sensing could jointly provide information to estimate ambient air quality at a sufficiently fine resolution to study the relationship between air pollution exposure and health. Predicting NO concentration globally with sufficient spatial and temporal resolution and accuracy for health studies is, however, not a trivial task. Challenges are data deficiency, in terms of NO measurements and NO predictors, and the development of a statistical model that can typify the regional and continental differences, such as traffic regulations, energy sources, and local weather.
We investigated the feasibility of mapping daytime and nighttime NO globally at a high spatial resolution (25 m), by including TROPOMI (TROPOspheric Monitoring Instrument) data and comparing various statistical learning techniques.
We separated daytime (7:00 am - 9:59 pm) and nighttime (10:00 pm - 6:59 am) based on the local times. To study if one should build models for each country separately, national models in 4 selected countries (the US, China, Germany, Spain) were developed. We build the models for 2017 and used 3636 stations. Seven statistical learning techniques were applied and the impact of the predictors, model fitting, and predicting accuracy was compared between different techniques, national models, national and global models, and models with and without including the NO vertical column density retrieved from TROPOMI.
The ensemble tree-based methods obtained higher accuracy compared to the linear regression-based methods in national and global models. The global tree-based methods obtained similar accuracy to national models. Different spatial prediction patterns are observed even when the prediction accuracy is very similar. Separating between day and night can be important for more accurate air pollution exposure assessment. The TROPOMI variable is ranked as one of the most important variables in the statistical learning techniques but adding it to global models that contain other precedent remote sensing products does not improve the prediction accuracy.
在空气污染监测站缺乏或稀少的国家,可以利用其他国家的监测站数据和大气遥感数据共同提供信息,以足够精细的分辨率估算环境空气质量,从而研究空气污染暴露与健康之间的关系。然而,要在全球范围内以足够的时空分辨率和准确性预测 NO 浓度,以支持健康研究,并非易事。挑战在于缺乏 NO 测量数据和预测因子,以及开发能够体现区域和大陆差异(如交通规则、能源和当地天气)的统计模型。
我们通过纳入 TROPOMI(对流层监测仪器)数据并比较各种统计学习技术,研究是否可以在高空间分辨率(25 米)下绘制全球范围内白天和夜间的 NO 地图。
我们根据当地时间将白天(7:00 am-9:59 pm)和夜间(10:00 pm-6:59 am)分开。为了研究是否应该分别为每个国家构建模型,我们在 4 个选定的国家(美国、中国、德国和西班牙)中构建了国家模型。我们针对 2017 年的数据进行了模型构建,使用了 3636 个站点。应用了 7 种统计学习技术,并比较了不同技术、国家模型、国家和全球模型以及是否包含 TROPOMI 反演的 NO 垂直柱密度之间的预测因子、模型拟合和预测准确性的影响。
与基于线性回归的方法相比,基于集合树的方法在国家和全球模型中具有更高的准确性。全球基于树的方法获得的预测准确性与国家模型相似。即使预测精度非常相似,也会观察到不同的空间预测模式。白天和夜间的区分对于更准确的空气污染暴露评估可能很重要。TROPOMI 变量被列为统计学习技术中最重要的变量之一,但将其添加到包含其他先前遥感产品的全球模型中并不能提高预测精度。