Feng Zi-Heng, Wang Lu-Yuan, Yang Zhe-Qing, Zhang Yan-Yan, Li Xiao, Song Li, He Li, Duan Jian-Zhao, Feng Wei
State Key Laboratory of Wheat and Maize Crop Science, CIMMYT-China Wheat and Maize Joint Research Center, Henan Agricultural University, Zhengzhou, China.
Information and Management Science College, Henan Agricultural University, Zhengzhou, China.
Front Plant Sci. 2022 Mar 21;13:828454. doi: 10.3389/fpls.2022.828454. eCollection 2022.
Powdery mildew has a negative impact on wheat growth and restricts yield formation. Therefore, accurate monitoring of the disease is of great significance for the prevention and control of powdery mildew to protect world food security. The canopy spectral reflectance was obtained using a ground feature hyperspectrometer during the flowering and filling periods of wheat, and then the Savitzky-Golay method was used to smooth the measured spectral data, and as original reflectivity (OR). Firstly, the OR was spectrally transformed using the mean centralization (MC), multivariate scattering correction (MSC), and standard normal variate transform (SNV) methods. Secondly, the feature bands of above four transformed spectral data were extracted through a combination of the Competitive Adaptive Reweighted Sampling (CARS) and Successive Projections Algorithm (SPA) algorithms. Finally, partial least square regression (PLSR), support vector regression (SVR), and random forest regression (RFR) were used to construct an optimal monitoring model for wheat powdery mildew disease index (mean disease index, mDI). The results showed that after Pearson correlation, two-band optimization combinations and machine learning method modeling comparisons, the comprehensive performance of the MC spectrum data was the best, and it was a better method for pretreating disease spectrum data. The transformed spectral data combined with the CARS-SPA algorithm was able to extract the characteristic bands more effectively. The number of bands screened was more than the number of bands extracted by the OR data, and the band positions were more evenly distributed. In comparison of different machine learning modeling methods, the RFR model performed the best (coefficient of determination, = 0.741-0.852), while the SVR and PLSR models performed similarly ( = 0.733-0.836). Taken together, the estimation accuracy of spectral data transformation using the MC method combined with the RFR model (MC-RFR) was the highest, the model was 0.849-0.852, and the root mean square error (RMSE) and the mean absolute error (MAE) ranged from 2.084 to 2.177 and 1.684 to 1.777, respectively. Compared with the OR combined with the RFR model (OR-RFR), the increased by 14.39%, and the of RMSE and MAE decreased by 23.9 and 27.87%. Also, the monitoring accuracy of flowering stage is better than that of grain filling stage, which is due to the relative stability of canopy structure in flowering stage. It can be seen that without changing the shape of the spectral curve, and that the use of MC to preprocess spectral data, the use of CARS and SPA algorithms to extract characteristic bands, and the use of RFR modeling methods to enhance the synergy between multiple variables, and the established model (MC-CARS-SPA-RFR) can better extract the covariant relationship between the canopy spectrum and the disease, thereby improving the monitoring accuracy of wheat powdery mildew. The research results of this study provide ideas and methods for realizing high-precision remote sensing monitoring of crop disease status.
白粉病对小麦生长有负面影响,并限制产量形成。因此,准确监测该病害对于白粉病的防治以保障世界粮食安全具有重要意义。在小麦开花期和灌浆期使用地面特征高光谱仪获取冠层光谱反射率,然后采用Savitzky-Golay方法对测量的光谱数据进行平滑处理,作为原始反射率(OR)。首先,使用均值中心化(MC)、多元散射校正(MSC)和标准正态变量变换(SNV)方法对OR进行光谱变换。其次,通过竞争性自适应重加权采样(CARS)和连续投影算法(SPA)算法相结合的方式提取上述四种变换后光谱数据的特征波段。最后,使用偏最小二乘回归(PLSR)、支持向量回归(SVR)和随机森林回归(RFR)构建小麦白粉病病情指数(平均病情指数,mDI)的最优监测模型。结果表明,经过Pearson相关性分析、双波段优化组合和机器学习方法建模比较后,MC光谱数据的综合性能最佳,是一种较好的病害光谱数据预处理方法。变换后的光谱数据结合CARS-SPA算法能够更有效地提取特征波段。筛选出的波段数量多于OR数据提取的波段数量,且波段位置分布更均匀。在不同机器学习建模方法的比较中,RFR模型表现最佳(决定系数, = 0.741 - 0.852),而SVR和PLSR模型表现相近( = 0.733 - 0.836)。综合来看,使用MC方法结合RFR模型(MC-RFR)对光谱数据进行变换的估计精度最高,模型 = 0.