Chen Yunfan, Shin Hyunchul
J Opt Soc Am A Opt Image Sci Vis. 2020 May 1;37(5):768-779. doi: 10.1364/JOSAA.386410.
Recent research has demonstrated that effective fusion of multispectral images (visible and thermal images) enables robust pedestrian detection under various illumination conditions (e.g., daytime and nighttime). However, there are some open problems such as poor performance in small-sized pedestrian detection and high computational cost of multispectral information fusion. This paper proposes a multilayer fused deconvolutional single-shot detector that contains a two-stream convolutional module (TCM) and a multilayer fused deconvolutional module (MFDM). The TCM is used to extract convolutional features from multispectral input images. Then fusion blocks are incorporated into the MFDM to combine high-level features with rich semantic information and low-level features with detailed information to generate features with strong a representational power for small pedestrian instances. In addition, we fuse multispectral information at multiple deconvolutional layers in the MFDM via fusion blocks. This multilayer fusion strategy adaptively makes the most use of visible and thermal information. In addition, using fusion blocks for multilayer fusion can reduce the extra computational cost and redundant parameters. Empirical experiments show that the proposed approach achieves an 81.82% average precision (AP) on a new small-sized multispectral pedestrian dataset. The proposed method achieves the best performance on two well-known public multispectral datasets. On the KAIST multispectral pedestrian benchmark, for example, our method achieves a 97.36% AP and a 20 fps detection speed, which outperforms the state-of-the-art published method by 6.82% in AP and is three times faster in its detection speed.
最近的研究表明,多光谱图像(可见光和热图像)的有效融合能够在各种光照条件下(如白天和夜间)实现强大的行人检测。然而,仍存在一些未解决的问题,比如在小尺寸行人检测中性能不佳以及多光谱信息融合的计算成本较高。本文提出了一种多层融合反卷积单阶段检测器,它包含一个双流卷积模块(TCM)和一个多层融合反卷积模块(MFDM)。TCM用于从多光谱输入图像中提取卷积特征。然后将融合块集成到MFDM中,将具有丰富语义信息的高级特征与具有详细信息的低级特征相结合,以生成对小行人实例具有强大表示能力的特征。此外,我们通过融合块在MFDM的多个反卷积层融合多光谱信息。这种多层融合策略能够自适应地充分利用可见光和热信息。此外,使用融合块进行多层融合可以降低额外的计算成本和冗余参数。实证实验表明,所提出的方法在一个新的小尺寸多光谱行人数据集上实现了81.82%的平均精度(AP)。所提出的方法在两个著名的公共多光谱数据集上取得了最佳性能。例如,在KAIST多光谱行人基准测试中,我们的方法实现了97.36%的AP和20帧/秒的检测速度,在AP方面比已发表的最先进方法高出6.82%,并且检测速度快三倍。