Feng Qihan, Xu Xinzheng, Wang Zhixiao
College of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China.
Mine Digitization Engineering Research Center of the Ministry of Education, Xuzhou 221116, China.
Math Biosci Eng. 2023 Feb 2;20(4):6551-6590. doi: 10.3934/mbe.2023282.
Small object detection (SOD) is significant for many real-world applications, including criminal investigation, autonomous driving and remote sensing images. SOD has been one of the most challenging tasks in computer vision due to its low resolution and noise representation. With the development of deep learning, it has been introduced to boost the performance of SOD. In this paper, focusing on the difficulties of SOD, we analyze the deep learning-based SOD research papers from four perspectives, including boosting the resolution of input features, scale-aware training, incorporating contextual information and data augmentation. We also review the literature on crucial SOD tasks, including small face detection, small pedestrian detection and aerial image object detection. In addition, we conduct a thorough performance evaluation of generic SOD algorithms and methods for crucial SOD tasks on four well-known small object datasets. Our experimental results show that network configuring to boost the resolution of input features can enable significant performance gains on WIDER FACE and Tiny Person. Finally, several potential directions for future research in the area of SOD are provided.
小目标检测(SOD)对于许多实际应用都具有重要意义,包括刑事调查、自动驾驶和遥感图像。由于其低分辨率和噪声表示,SOD一直是计算机视觉中最具挑战性的任务之一。随着深度学习的发展,它被引入以提高SOD的性能。在本文中,针对SOD的难点,我们从四个角度分析了基于深度学习的SOD研究论文,包括提高输入特征的分辨率、尺度感知训练、融入上下文信息和数据增强。我们还回顾了关于关键SOD任务的文献,包括小面部检测、小行人检测和航空图像目标检测。此外,我们对通用SOD算法和关键SOD任务的方法在四个著名的小目标数据集上进行了全面的性能评估。我们的实验结果表明,配置网络以提高输入特征的分辨率可以在WIDER FACE和Tiny Person数据集上显著提高性能。最后,给出了SOD领域未来研究的几个潜在方向。