Ahn Seoyeong, Kim Ayoung, Chung Yeonseung, Kang Cinoo, Kim Sooyoung, Kwon Dohoon, Park Jiwoo, Oh Jieun, Park Jinah, Moon Jeongmin, Song Insung, Min Jieun, Lee Hyung Joo, Kim Ho, Lee Whanhee
Department of Information Convergence Engineering, Pusan National University, Busan 46241, South Korea.
Department of Public Health Science, Graduate School of Public Health, Seoul National University, Seoul 08826, South Korea.
Environ Health (Wash). 2025 Apr 23;3(8):878-887. doi: 10.1021/envhealth.4c00201. eCollection 2025 Aug 15.
Several tudies developed machine learning-based PM prediction models; however, nationwide models addressing both mapping prediction and forecasting were limited. Further, although the prediction accuracy is different from PM-related health risk estimation, previous studies solely examined the prediction accuracy. This study suggests a method to assess the statistical properties of PM-health risk estimation, which also can be used as a model selection. We used three machine learning algorithms and an ensemble method to construct PM mapping prediction (1 km) and two-day forecasting models majorly using satellite-driven data in South Korea (2015-2022). We performed a simulation study to examine the statistical properties of short-term PM risk estimation using prediction models. Our ensemble spatial prediction model showed better performance than single algorithms (0.956 test ). The range of the values was 0.78-0.98 across the monitoring sites. The average % bias was from 1.403%-1.787% when our mapping models for PM-mortality risk estimation, compared to the estimates from monitored PM. The best of our forecasting models was 0.904. This study developed machine learning models for spatial PM predictions and forecasting in Korea. This study also suggested a method to address risk estimation and model selection concurrently when multiple prediction models were used.
多项研究开发了基于机器学习的颗粒物(PM)预测模型;然而,涵盖映射预测和预报的全国性模型却很有限。此外,尽管预测准确性与PM相关的健康风险估计不同,但先前的研究仅考察了预测准确性。本研究提出了一种评估PM健康风险估计统计特性的方法,该方法也可用于模型选择。我们使用三种机器学习算法和一种集成方法,主要利用韩国(2015 - 2022年)的卫星驱动数据构建了PM映射预测(1公里)和两天预报模型。我们进行了一项模拟研究,以检验使用预测模型进行短期PM风险估计的统计特性。我们的集成空间预测模型表现优于单一算法(测试值为0.956)。各监测点的 值范围为0.78 - 0.98。与监测到的PM估计值相比,我们用于PM死亡率风险估计的映射模型的平均偏差百分比为1.403% - 1.787%。我们预报模型的最佳 值为0.904。本研究为韩国的空间PM预测和预报开发了机器学习模型。本研究还提出了一种在使用多个预测模型时同时处理风险估计和模型选择的方法。