Huber Florian, Inderka Alvin, Steinhage Volker
Department of Computer Science IV, University of Bonn, 53121 Bonn, Germany.
Sensors (Basel). 2024 Jan 24;24(3):770. doi: 10.3390/s24030770.
Remote sensing data represent one of the most important sources for automized yield prediction. High temporal and spatial resolution, historical record availability, reliability, and low cost are key factors in predicting yields around the world. Yield prediction as a machine learning task is challenging, as reliable ground truth data are difficult to obtain, especially since new data points can only be acquired once a year during harvest. Factors that influence annual yields are plentiful, and data acquisition can be expensive, as crop-related data often need to be captured by experts or specialized sensors. A solution to both problems can be provided by deep transfer learning based on remote sensing data. Satellite images are free of charge, and transfer learning allows recognition of yield-related patterns within countries where data are plentiful and transfers the knowledge to other domains, thus limiting the number of ground truth observations needed. Within this study, we examine the use of transfer learning for yield prediction, where the data preprocessing towards histograms is unique. We present a deep transfer learning framework for yield prediction and demonstrate its successful application to transfer knowledge gained from US soybean yield prediction to soybean yield prediction within Argentina. We perform a temporal alignment of the two domains and improve transfer learning by applying several transfer learning techniques, such as L2-SP, BSS, and layer freezing, to overcome catastrophic forgetting and negative transfer problems. Lastly, we exploit spatio-temporal patterns within the data by applying a Gaussian process. We are able to improve the performance of soybean yield prediction in Argentina by a total of 19% in terms of RMSE and 39% in terms of R2 compared to predictions without transfer learning and Gaussian processes. This proof of concept for advanced transfer learning techniques for yield prediction and remote sensing data in the form of histograms can enable successful yield prediction, especially in emerging and developing countries, where reliable data are usually limited.
遥感数据是自动产量预测最重要的数据源之一。高时间和空间分辨率、历史记录可用性、可靠性以及低成本是全球产量预测的关键因素。产量预测作为一项机器学习任务具有挑战性,因为难以获得可靠的地面真值数据,特别是由于新的数据点每年只能在收获期间获取一次。影响年产量的因素众多,且数据采集成本高昂,因为作物相关数据通常需要由专家或专用传感器来获取。基于遥感数据的深度迁移学习可以为这两个问题提供解决方案。卫星图像是免费的,迁移学习能够在数据丰富的国家识别与产量相关的模式,并将知识转移到其他地区,从而减少所需的地面真值观测数量。在本研究中,我们研究了迁移学习在产量预测中的应用,其中针对直方图的数据预处理是独特的。我们提出了一个用于产量预测的深度迁移学习框架,并展示了其成功应用,即将从美国大豆产量预测中获得的知识转移到阿根廷的大豆产量预测中。我们对两个地区进行了时间对齐,并通过应用几种迁移学习技术(如L2-SP、BSS和层冻结)来改进迁移学习,以克服灾难性遗忘和负迁移问题。最后,我们通过应用高斯过程来挖掘数据中的时空模式。与未使用迁移学习和高斯过程的预测相比,我们能够将阿根廷大豆产量预测的RMSE性能提高19%,R2性能提高39%。这种以直方图形式呈现的用于产量预测和遥感数据的先进迁移学习技术的概念验证,能够实现成功的产量预测,特别是在可靠数据通常有限的新兴和发展中国家。