Pattern Recognition and Machine Learning Laboratory, Gachon University, Seongnam 13120, Korea.
Department of Fire Service Administration, Chodang University, Muan 58530, Korea.
Sensors (Basel). 2022 Nov 2;22(21):8424. doi: 10.3390/s22218424.
The potency of object detection techniques using Unmanned Aerial Vehicles (UAVs) is unprecedented due to their mobility. This potency has stimulated the use of UAVs with object detection functionality in numerous crucial real-life applications. Additionally, more efficient and accurate object detection techniques are being researched and developed for usage in UAV applications. However, object detection in UAVs presents challenges that are not common to general object detection. First, as UAVs fly at varying altitudes, the objects imaged via UAVs vary vastly in size, making the task at hand more challenging. Second due to the motion of the UAVs, there could be a presence of blur in the captured images. To deal with these challenges, we present a You Only Look Once v5 (YOLOv5)-like architecture with ConvMixers in its prediction heads and an additional prediction head to deal with minutely-small objects. The proposed architecture has been trained and tested on the VisDrone 2021 dataset, and the acquired results are comparable with the existing state-of-the-art methods.
由于其移动性,使用无人机 (UAV) 的目标检测技术的功效是前所未有的。这种功效刺激了具有目标检测功能的无人机在众多关键的现实生活应用中的使用。此外,人们正在研究和开发更高效、更准确的目标检测技术,以用于无人机应用。然而,无人机中的目标检测带来了一些在一般目标检测中不常见的挑战。首先,由于无人机在不同的高度飞行,通过无人机拍摄的物体的大小差异很大,这使得手头的任务更加具有挑战性。其次,由于无人机的运动,捕获的图像可能会出现模糊。为了应对这些挑战,我们提出了一种类似于 You Only Look Once v5 (YOLOv5) 的架构,在其预测头中使用 ConvMixers,并增加了一个预测头来处理微小物体。所提出的架构已经在 VisDrone 2021 数据集上进行了训练和测试,所获得的结果与现有的最先进方法相当。