Lin Qifeng, Zhao Jianhui, Fu Gang, Yuan Zhiyong
IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):416-429. doi: 10.1109/TNNLS.2020.3027924. Epub 2022 Jan 5.
Limited by the GPU memory, the current mainstream detectors fail to directly apply to large-scale remote sensing images for object detection. Moreover, the scale range of objects in remote sensing images is much wider than that of general images, which also greatly hinders the existing methods to effectively detect geospatial objects of various scales. For achieving high-performance object detection on large-scale remote sensing images, this article proposes a much faster and more accurate detecting framework, called cropping region proposal network-based scale folding network (CRPN-SFNet). In our framework, the CRPN includes a weak semantic RPN for quickly locating interesting regions and a strategy of generating cropping regions to effectively filter out meaningless regions, which can greatly reduce the computation and storage burden. Meanwhile, the proposed SFNet leverages the scale folding-based training and testing methods to extend the valid detection range of existing detectors, which is beneficial for detecting remote sensing objects of various scales, including very small and very large geospatial objects. Extensive experiments on the public Dataset for Object deTection in Aerial images data set indicate that our CRPN can help our detector deal the larger image faster with the limited GPU memory; meanwhile, the SFNet is beneficial to achieve more accurate detection of geospatial objects with wide-scale range. For large-scale remote sensing images, the proposed detection framework outperforms the state-of-the-art object detection methods in terms of accuracy and speed.
受GPU内存限制,当前主流检测器无法直接应用于大规模遥感图像的目标检测。此外,遥感图像中目标的尺度范围比一般图像宽得多,这也极大地阻碍了现有方法有效检测各种尺度的地理空间目标。为了在大规模遥感图像上实现高性能目标检测,本文提出了一种更快、更准确的检测框架,称为基于裁剪区域提议网络的尺度折叠网络(CRPN-SFNet)。在我们的框架中,CRPN包括一个用于快速定位感兴趣区域的弱语义RPN和一种生成裁剪区域的策略,以有效滤除无意义区域,这可以大大减轻计算和存储负担。同时,所提出的SFNet利用基于尺度折叠的训练和测试方法来扩展现有检测器的有效检测范围,这有利于检测各种尺度的遥感目标,包括非常小和非常大的地理空间目标。在公开的航空图像目标检测数据集上进行的大量实验表明,我们的CRPN可以帮助我们的检测器在有限的GPU内存下更快地处理更大的图像;同时,SFNet有利于在宽尺度范围内更准确地检测地理空间目标。对于大规模遥感图像,所提出的检测框架在准确性和速度方面优于当前最先进的目标检测方法。