Alavianmehr M A, Helfroush M S, Danyali H, Tashk A
Department of Electrical Engineering, Shiraz University of Technology, Shiraz, Iran.
Maersk Mc-Kinney Møller Institute (MMMI), University of Southern Denmark (SDU), Odense, Denmark.
J Real Time Image Process. 2023;20(1):9. doi: 10.1007/s11554-023-01273-z. Epub 2023 Feb 2.
The detection of multi-scale pedestrians is one of the challenging tasks in pedestrian detection applications. Moreover, the task of small-scale pedestrian detection, i.e., accurate localization of pedestrians as low-scale target objects, can help solve the issue of occluded pedestrian detection as well. In this paper, we present a fully convolutional neural network with a new architecture and an innovative, fully detailed supervision for semantic segmentation of pedestrians. The proposed network has been named butterfly network (BF-Net) because of its architecture analogous to a butterfly. The proposed BF-Net preserves the ability of simplicity so that it can process static images with a real-time image processing rate. The sub-path blocks embedded in the architecture of the proposed BF-Net provides a higher accuracy for detecting multi-scale objective targets including the small ones. The other advantage of the proposed architecture is replacing common batch normalization with conditional one. In conclusion, the experimental results of the proposed method demonstrate that the proposed network outperform the other state-of-the-art networks such as U-Net + + , U-Net3 + , Mask-RCNN, and Deeplabv3 + for the semantic segmentation of the pedestrians.
多尺度行人检测是行人检测应用中的挑战性任务之一。此外,小尺度行人检测任务,即将行人作为低尺度目标物体进行精确定位,也有助于解决遮挡行人检测的问题。在本文中,我们提出了一种具有新架构和创新的、针对行人语义分割的全详细监督的全卷积神经网络。由于其架构类似于蝴蝶,所提出的网络被命名为蝴蝶网络(BF-Net)。所提出的BF-Net保留了简单性,使其能够以实时图像处理速率处理静态图像。嵌入在所提出的BF-Net架构中的子路径块为检测包括小目标在内的多尺度目标提供了更高的准确性。所提出架构的另一个优点是用条件归一化取代了普通的批归一化。总之,所提出方法的实验结果表明,在所进行的行人语义分割中,所提出的网络优于其他先进网络,如U-Net++、U-Net3+、Mask-RCNN和Deeplabv3+ 。