Rogers Mitchell, Azhar Mihailo, Schenone Stefano, Thrush Simon, Xue Bing, Zhang Mengjie, Delmas Patrice
School of Computer Science, The University of Auckland, Auckland, New Zealand.
Institute of Marine Science, The University of Auckland, Auckland, New Zealand.
J R Soc N Z. 2024 Aug 21;55(6):1701-1731. doi: 10.1080/03036758.2024.2393297. eCollection 2025.
Assessing ecosystem health on a large scale is crucial for a wide range of management and regulatory decisions. Technologies such as hyperspectral imaging allow noninvasive and rapid estimation of key attributes based on observed reflectance. However, these images are high-dimensional and real-world applications require models based on fewer wavelengths. This paper proposes a new wavelength selection and feature extraction method for hyperspectral image analysis based on genetic programming to automatically select key wavelength regions and informative image features. A dataset of hyperspectral images of sediment in the field was collected and paired with ground-truth measurements of the sediment porosity and organic matter content. Two new program structures were proposed to construct feature extraction trees from either the mean reflectance spectra (spectra-based) or full hyperspectral images (image-based). SVR models were constructed to predict attributes based on the extracted features. Various regression models were used to predict the porosity and organic matter content. Full-wavelength models were constructed to reliably predict the organic matter content. The proposed spectra-based genetic programming solutions show competitive results compared to common wavelength selection methods, such as SPA, CARS, and RC. Finally, the best-evolved solution was applied to predict sediment organic matter content across all collected images.
在大规模评估生态系统健康状况对于广泛的管理和监管决策至关重要。高光谱成像等技术能够基于观测到的反射率对关键属性进行非侵入性快速估计。然而,这些图像具有高维度,而实际应用需要基于较少波长的模型。本文提出了一种基于遗传编程的用于高光谱图像分析的新波长选择和特征提取方法,以自动选择关键波长区域和信息丰富的图像特征。收集了一组野外沉积物高光谱图像数据集,并将其与沉积物孔隙率和有机质含量的地面实测数据配对。提出了两种新的程序结构,用于从平均反射光谱(基于光谱)或全高光谱图像(基于图像)构建特征提取树。构建了支持向量回归(SVR)模型,以基于提取的特征预测属性。使用了各种回归模型来预测孔隙率和有机质含量。构建了全波长模型以可靠地预测有机质含量。与诸如连续投影算法(SPA)、竞争性自适应重加权采样(CARS)和随机蛙跳算法(RC)等常见波长选择方法相比,所提出的基于光谱的遗传编程解决方案显示出具有竞争力的结果。最后,将最佳进化解决方案应用于预测所有收集图像中的沉积物有机质含量。