Tian Yan, Gelernter Judith, Wang Xun, Chen Weigang, Gao Junxiang, Zhang Yujie, Li Xiaolan
School of Computer and Information Engineering, Zhejiang Gongshang University, Hangzhou, P.R.China.
Information Technology Laboratory, National Institute of Standards and Technology, Pittsburgh, US.
Neurocomputing (Amst). 2018 Mar 6;280:46-55. doi: 10.1016/j.neucom.2017.09.098. Epub 2017 Nov 21.
Research on Faster R-CNN has recently witnessed the progress in both accuracy and execution efficiency in detecting objects such as faces, hands or pedestrians in photograph or video. However, constrained by the size of its convolution feature map output, it is unable to clearly detect small or tiny objects. Therefore, we presented a fast, deep convolutional neural network based on a modified Faster R-CNN. Multiple strategies, such as fast multi-level combination, context cues, and a new anchor generating method were employed for small object detection in this paper. We demonstrated performance of our algorithm both on the KITTI-ROAD dataset and our own traffic scene lane markings dataset. Experiments demonstrated that our algorithm obtained better accuracy than Faster R-CNN in small object detection.
最近,对快速R-CNN的研究在检测照片或视频中的面部、手部或行人等物体时,在准确性和执行效率方面都取得了进展。然而,由于其卷积特征图输出的大小限制,它无法清晰地检测小物体或微小物体。因此,我们提出了一种基于改进的快速R-CNN的快速深度卷积神经网络。本文采用了多种策略,如快速多级组合、上下文线索和一种新的锚点生成方法来进行小物体检测。我们在KITTI-ROAD数据集和我们自己的交通场景车道标记数据集上展示了我们算法的性能。实验表明,我们的算法在小物体检测中比快速R-CNN获得了更好的准确性。