VSSA-NET：用于交通标志检测的垂直空间序列注意力网络

VSSA-NET: Vertical Spatial Sequence Attention Network for Traffic Sign Detection.

作者信息

Yuan Yuan, Xiong Zhitong, Wang Qi

出版信息

IEEE Trans Image Process. 2019 Jul;28(7):3423-3434. doi: 10.1109/TIP.2019.2896952. Epub 2019 Feb 1.

DOI:10.1109/TIP.2019.2896952

Abstract

Although traffic sign detection has been studied for years and great progress has been made with the rise of deep learning technique, there are still many problems remaining to be addressed. For complicated real-world traffic scenes, there are two main challenges. First, traffic signs are usually small-sized objects, which makes them more difficult to detect than large ones; second, it is hard to distinguish false targets which resemble real traffic signs in complex street scenes without context information. To handle these problems, we propose a novel end-to-end deep learning method for traffic sign detection in complex environments. Our contributions are as follows: 1) we propose a multi-resolution feature fusion network architecture which exploits densely connected deconvolution layers with skip connections, and can learn more effective features for a small-size object and 2) we frame the traffic sign detection as a spatial sequence classification and regression task, and propose a vertical spatial sequence attention module to gain more context information for better detection performance. To comprehensively evaluate the proposed method, we experiment on several traffic sign datasets as well as the general object detection dataset, and the results have shown the effectiveness of our proposed method.

摘要

尽管交通标志检测已经研究了多年，并且随着深度学习技术的兴起取得了很大进展，但仍有许多问题有待解决。对于复杂的现实世界交通场景，存在两个主要挑战。第一，交通标志通常是小尺寸物体，这使得它们比大尺寸物体更难检测；第二，在没有上下文信息的复杂街道场景中，很难区分类似于真实交通标志的虚假目标。为了解决这些问题，我们提出了一种新颖的端到端深度学习方法，用于复杂环境中的交通标志检测。我们的贡献如下：1）我们提出了一种多分辨率特征融合网络架构，该架构利用带有跳跃连接的密集连接反卷积层，并且可以为小尺寸物体学习更有效的特征；2）我们将交通标志检测框架为空间序列分类和回归任务，并提出了一个垂直空间序列注意力模块，以获取更多上下文信息以获得更好的检测性能。为了全面评估所提出的方法，我们在几个交通标志数据集以及通用目标检测数据集上进行了实验，结果表明了我们所提出方法的有效性。