Department of Computer Science, Utah State University, Logan, UT 84322, USA.
Department of Mathematics and Statistics, Utah State University, Logan, UT 84322, USA.
Sensors (Basel). 2024 Oct 4;24(19):6433. doi: 10.3390/s24196433.
From June to October, 2022, we recorded the weight, the internal temperature, and the hive entrance video traffic of ten managed honey bee () colonies at a research apiary of the Carl Hayden Bee Research Center in Tucson, AZ, USA. The weight and temperature were recorded every five minutes around the clock. The 30 s videos were recorded every five minutes daily from 7:00 to 20:55. We curated the collected data into a dataset of 758,703 records (280,760-weight; 322,570-temperature; 155,373-video). A principal objective of Part I of our investigation was to use the curated dataset to investigate the discrete univariate time series forecasting of hive weight, in-hive temperature, and hive entrance traffic with shallow artificial, convolutional, and long short-term memory networks and to compare their predictive performance with traditional autoregressive integrated moving average models. We trained and tested all models with a 70/30 train/test split. We varied the intake and the predicted horizon of each model from 6 to 24 hourly means. Each artificial, convolutional, and long short-term memory network was trained for 500 epochs. We evaluated 24,840 trained models on the test data with the mean squared error. The autoregressive integrated moving average models performed on par with their machine learning counterparts, and all model types were able to predict falling, rising, and unchanging trends over all predicted horizons. We made the curated dataset public for replication.
从 2022 年 6 月到 10 月,我们在亚利桑那州图森市卡尔海登蜜蜂研究中心的一个研究养蜂场记录了 10 个管理蜜蜂()蜂群的体重、内部温度和蜂箱入口视频流量。体重和温度每五小时记录一次,全天 24 小时不间断。每天从 7:00 到 20:55 每五分钟记录 30 秒的视频。我们将收集到的数据整理成一个包含 758703 条记录的数据集(280760 条体重;322570 条温度;155373 条视频)。我们调查的第一部分的一个主要目标是使用整理后的数据集来调查蜂箱重量、蜂箱内温度和蜂箱入口流量的离散单变量时间序列预测,使用浅层人工、卷积和长短时记忆网络,并将其预测性能与传统自回归综合移动平均模型进行比较。我们使用 70/30 的训练/测试分割来训练和测试所有模型。我们将每个模型的摄入量和预测范围从 6 到 24 个小时平均值进行了变化。每个人工、卷积和长短时记忆网络都训练了 500 个时期。我们使用均方误差在测试数据上评估了 24840 个经过训练的模型。自回归综合移动平均模型与它们的机器学习对应模型表现相当,所有模型类型都能够预测所有预测范围内的下降、上升和不变趋势。我们公开了整理后的数据集供复制。