Muhammad Saqib Sheikh, Mazhar Tehseen, Iqbal Muhammad, Almogren Ahmad, Shahazad Tariq, Rehman Ateeq Ur, Hamam Habib
Department of Computing and Information Technology, Gomal University, Dera Ismail Khan, 29220, Pakistan.
Department of Computer Science, School Education Department, Government of Punjab, Layyah 31200, Pakistan.
Heliyon. 2024 Oct 21;10(21):e39537. doi: 10.1016/j.heliyon.2024.e39537. eCollection 2024 Nov 15.
Detection of aerial activities, including drones and birds, has practical implications for automating bird surveys and developing radar systems for aerial object collision detection. Convolutional neural networks (CNNs) have been extensively utilized for image recognition and classification tasks, albeit prior research predominantly focuses on single-class 'drone' classification. However, a gap persists in achieving high accuracy for multi-class classification. To address the limitations of traditional CNNs, such as vanishing gradients and the necessity for numerous layers, this study introduces a novel model termed "MobVGG." This model combines the architectures of MobileNetV2 and VGG16 to accurately classify images as either 'bird' or 'drone'. The dataset comprises 4212 images for each category of 'bird' and 'drone'. The stringent methodology was applied for dataset preparation and model training to ensure the reliability of the findings. Comparative analysis with previous research demonstrates that the proposed MobVGG model, trained on both 'bird' and 'drone' images, achieves superior accuracy (96 %) compared to benchmark studies. Our paper targets researchers and graduate students as its primary audience.
检测包括无人机和鸟类在内的空中活动,对于实现鸟类调查自动化以及开发用于空中物体碰撞检测的雷达系统具有实际意义。卷积神经网络(CNN)已被广泛用于图像识别和分类任务,尽管先前的研究主要集中在单类“无人机”分类上。然而,在实现多类分类的高精度方面仍然存在差距。为了解决传统CNN的局限性,如梯度消失和需要大量层,本研究引入了一种名为“MobVGG”的新型模型。该模型结合了MobileNetV2和VGG16的架构,以准确地将图像分类为“鸟类”或“无人机”。数据集包括“鸟类”和“无人机”每个类别的4212张图像。采用严格的方法进行数据集准备和模型训练,以确保研究结果的可靠性。与先前研究的对比分析表明,在“鸟类”和“无人机”图像上训练的所提出的MobVGG模型,与基准研究相比,实现了更高的准确率(96%)。我们的论文主要面向研究人员和研究生。