Estefania-Salazar Enrique, Iglesias Eva
CEIGRAM and Department of Agricultural Economics, Statistics and Business, Universidad Politécnica de Madrid, Madrid, 28040, Spain.
Heliyon. 2024 Jul 16;10(14):e34711. doi: 10.1016/j.heliyon.2024.e34711. eCollection 2024 Jul 30.
The progressive evolution of the spatial and temporal resolutions of Earth observation satellites has brought multiple benefits to scientific research. The increasing volume of data with higher frequencies and spatial resolutions offers precise and timely information, making it an invaluable tool for environmental analysis and enhanced decision-making. However, this presents a formidable challenge for large-scale environmental analyses and socioeconomic applications based on spatial time series, often compelling researchers to resort to lower-resolution imagery, which can introduce uncertainty and impact results. In response to this, our key contribution is a novel machine learning approach for dense geospatial time series rooted in superpixel segmentation, which serves as a preliminary step in mitigating the high dimensionality of data in large-scale applications. This approach, while effectively reducing dimensionality, preserves valuable information to the maximum extent, thereby substantially enhancing data accuracy and subsequent environmental analyses. This method was empirically applied within the context of a comprehensive case study encompassing the 2002-2022 period with 8--frequency-normalized difference vegetation index data at 250-m resolution in an area spanning 43,470 km. The efficacy of this methodology was assessed through a comparative analysis, comparing our results with those derived from 1000-m-resolution satellite data and an existing superpixel algorithm for time series data. An evaluation of the time-series deviations revealed that using coarser-resolution pixels introduced an error that exceeded that of the proposed algorithm by 25 % and that the proposed methodology outperformed other algorithms by more than 9 %. Notably, this methodological innovation concurrently facilitates the aggregation of pixels sharing similar land-cover classifications, thus mitigating subpixel heterogeneity within the dataset. Further, the proposed methodology, which is used as a preprocessing step, improves the clustering of pixels according to their time series and can enhance large-scale environmental analyses across a wide range of applications.
地球观测卫星的空间和时间分辨率的不断进步给科学研究带来了诸多益处。频率和空间分辨率更高的数据量不断增加,提供了精确且及时的信息,使其成为环境分析和强化决策的宝贵工具。然而,这给基于空间时间序列的大规模环境分析和社会经济应用带来了巨大挑战,常常迫使研究人员采用分辨率较低的图像,这可能会引入不确定性并影响结果。对此,我们的关键贡献是一种基于超像素分割的针对密集地理空间时间序列的新型机器学习方法,它作为减轻大规模应用中数据高维度的初步步骤。这种方法在有效降低维度的同时,最大程度地保留了有价值的信息,从而大幅提高了数据准确性和后续的环境分析。该方法在一个涵盖2002 - 2022年期间、分辨率为250米的8个频率归一化植被指数数据、面积达43470平方千米的综合案例研究中得到了实证应用。通过比较分析评估了该方法的有效性,将我们的结果与从1000米分辨率卫星数据和现有的时间序列数据超像素算法得出的结果进行了比较。对时间序列偏差的评估表明,使用分辨率较粗的像素引入的误差比所提出算法超出25%,且所提出的方法比其他算法性能高出9%以上。值得注意的是,这种方法创新同时促进了具有相似土地覆盖分类的像素的聚合,从而减轻了数据集中的亚像素异质性。此外,作为预处理步骤的所提出方法,根据像素的时间序列改进了像素聚类,并可增强广泛应用中的大规模环境分析。