He Ping, Cheng Xianfeng, Wen Xingping, Yi Yan, Chen Zailin, Chen Yu
Faculty of Land Resources Engineering, Kunming University of Science and Technology, Kunming 650093, China.
School of Fine Art and Design, Kunming University, Kunming 650214, China.
Sensors (Basel). 2025 Jul 5;25(13):4209. doi: 10.3390/s25134209.
Accurate prediction of soil lead (Pb) content in small sample scenarios is often limited by data scarcity and variability in soil properties, with traditional spectral modeling methods yielding suboptimal precision. To address this, we propose a transfer learning-based framework integrated with SHAP analysis for predicting soil Pb content in the Gejiu mining area, Yunnan. Using pH data from the European LUCAS soil database as the source domain, spectral features were extracted via a 1D-ResNet model and transferred to the target domain (130 soil samples from Gejiu) for Pb prediction. SHAP analysis was applied to clarify the role of spectral characteristics in cross-component transfer learning, uncovering shared and adaptive features between pH and Pb predictions. The transfer learning model (ResNet-pH-Pb) significantly outperformed direct modeling methods (PLS-Pb, SVM-Pb, and ResNet-Pb), with an R of 0.77, demonstrating superior accuracy. SHAP analysis showed that the model retained key pH-related wavelengths (550-750 nm and 1600-1700 nm) while optimizing Pb-related wavelengths (e.g., 919 nm and 959 nm). This study offers a novel approach for soil heavy metal prediction under small sample constraints and provides a theoretical basis for understanding spectral prediction mechanisms through interpretability analysis.
在小样本情况下准确预测土壤铅(Pb)含量通常受到数据稀缺和土壤性质变异性的限制,传统光谱建模方法的精度欠佳。为解决这一问题,我们提出了一种基于迁移学习并集成SHAP分析的框架,用于预测云南个旧矿区的土壤Pb含量。以欧洲LUCAS土壤数据库中的pH数据作为源域,通过一维残差网络(1D-ResNet)模型提取光谱特征,并将其转移到目标域(来自个旧的130个土壤样本)进行Pb预测。应用SHAP分析来阐明光谱特征在跨分量迁移学习中的作用,揭示pH和Pb预测之间的共享特征和自适应特征。迁移学习模型(ResNet-pH-Pb)显著优于直接建模方法(PLS-Pb、SVM-Pb和ResNet-Pb),决定系数R为0.77,显示出卓越的准确性。SHAP分析表明,该模型在优化与Pb相关的波长(如919 nm和959 nm)的同时,保留了与pH相关的关键波长(550 - 750 nm和1600 - 1700 nm)。本研究为小样本约束下的土壤重金属预测提供了一种新方法,并通过可解释性分析为理解光谱预测机制提供了理论基础。