Agronomy College of Henan Agriculture University/State Key Laboratory of Wheat and Maize Crop Science, Zhengzhou 450046, Henan, China.
Heilongjiang Academy of Agricultural Sciences, Haerbin 150000, Heilongjiang, China.
Food Chem. 2024 Aug 1;448:139103. doi: 10.1016/j.foodchem.2024.139103. Epub 2024 Mar 22.
The protein content (PC) and wet gluten content (WGC) are crucial indicators determining the quality of wheat, playing a pivotal role in evaluating processing and baking performance. Original reflectance (OR), wavelet feature (WF), and color index (CI) were extracted from hyperspectral and RGB sensors. Combining Pearson-competitive adaptive reweighted sampling (CARs)-variance inflation factor (VIF) with four machine learning (ML) algorithms were used to model accuracy of PC and WGC. As a result, three CIs, six ORs, and twelve WFs were selected for PC and WGC datasets. For single-modal data, the back-propagation neural network exhibited superior accuracy, with estimation accuracies (WF > OR > CI). For multi-modal data, the random forest regression paired with OR + WF + CI showed the highest validation accuracy. Utilizing the Gini impurity, WF outweighed OR and CI in the PC and WGC models. The amalgamation of MLs with multimodal data harnessed the synergies among various remote sensing sources, substantially augmenting model precision and stability.
蛋白质含量(PC)和湿面筋含量(WGC)是决定小麦品质的关键指标,在评价加工和烘焙性能方面起着关键作用。从高光谱和 RGB 传感器中提取原始反射率(OR)、小波特征(WF)和颜色指数(CI)。将 Pearson 竞争自适应重加权抽样(CARs)-方差膨胀因子(VIF)与四种机器学习(ML)算法相结合,用于建立 PC 和 WGC 的模型精度。结果表明,三个 CI、六个 OR 和十二个 WF 被选为 PC 和 WGC 数据集。对于单模态数据,反向传播神经网络表现出较高的准确性,其估计准确性(WF>OR>CI)。对于多模态数据,随机森林回归与 OR+WF+CI 相结合显示出最高的验证准确性。利用基尼杂质,WF 在 PC 和 WGC 模型中优于 OR 和 CI。将 ML 与多模态数据相结合,可以利用各种遥感源之间的协同作用,显著提高模型的精度和稳定性。