Das Priyanko, Zhang Zhenke, Ghosh Suravi, Hang Ren
Institute of African Studies, School of Geography and Ocean Sciences, Nanjing University, Nanjing, China.
Institute of Atmospheric Physics, University of Chinese Academy of Sciences, Beijing, China.
Sci Rep. 2024 Jun 15;14(1):13870. doi: 10.1038/s41598-024-61520-6.
This study introduces a novel Hybrid Ensemble Machine-Learning (HEML) algorithm to merge long-term satellite-based reanalysis precipitation products (SRPPs), enabling the estimation of super drought events in the Lake Victoria Basin (LVB) during the period of 1984 to 2019. This study considers three widely used Machine learning (ML) models, including RF (Random Forest), GBM (Gradient Boosting Machine), and KNN (k-nearest Neighbors), for the emerging HEML approach. The three SRPPs, including CHIRPS (Climate Hazards Group Infra-Red Precipitation with Station), ERA5-Land, and PERSIANN-CDR (Precipitation Estimation from Remotely Sensed Information using Artificial Neural Network-Climate Data Record), were used to merge for developing new precipitation estimates from HEML model. Additionally, classification and regression models were employed as base learners in developing this algorithm. The newly developed HEML datasets were compared with other ML and SRPP products for super-drought monitoring. The Standardized precipitation evapotranspiration index (SPEI) was used to estimate super drought characteristics, including Drought frequency (DF), Drought Duration (DD), and Drought Intensity (DI) from machine learning and SRPPs products in LVB and compared with RG observation. The results revealed that the HEML algorithm shows excellent performance (CC = 0.93) compared to the single ML merging method and SRPPs against observation. Furthermore, the HEML merging product adeptly captures the spatiotemporal patterns of super drought characteristics during both training (1984-2009) and testing (2010-2019) periods. This research offers crucial insights for near-real-time drought monitoring, water resource management, and informed policy decisions.
本研究引入了一种新颖的混合集成机器学习(HEML)算法,用于合并基于卫星的长期再分析降水产品(SRPPs),从而能够估算1984年至2019年期间维多利亚湖盆地(LVB)的超级干旱事件。本研究针对新兴的HEML方法考虑了三种广泛使用的机器学习(ML)模型,包括随机森林(RF)、梯度提升机(GBM)和k近邻(KNN)。使用三种SRPPs,即气候灾害组红外降水与地面观测数据结合(CHIRPS)、ERA5-Land和人工神经网络气候数据记录遥感信息降水估计(PERSIANN-CDR),进行合并以开发来自HEML模型的新降水估计。此外,分类和回归模型被用作开发该算法的基础学习器。将新开发的HEML数据集与其他ML和SRPP产品进行比较,以进行超级干旱监测。标准化降水蒸发散指数(SPEI)用于估计超级干旱特征,包括干旱频率(DF)、干旱持续时间(DD)和干旱强度(DI),这些特征来自LVB的机器学习和SRPPs产品,并与地面观测进行比较。结果表明,与单一ML合并方法和SRPPs与观测值相比,HEML算法表现出优异的性能(相关系数CC = 0.93)。此外,HEML合并产品在训练期(1984 - 2009年)和测试期(2010 - 2019年)都能巧妙地捕捉超级干旱特征的时空模式。这项研究为近实时干旱监测、水资源管理和明智的政策决策提供了重要见解。