Center for Artificial Intelligence, Korea Institute of Science and Technology, Seoul 02792, Korea.
Division of Nano & Information Technology, KIST School, University of Science and Technology, Seoul 02792, Korea.
Sensors (Basel). 2021 Nov 2;21(21):7294. doi: 10.3390/s21217294.
Custom inspection using X-ray imaging is a very promising application of modern pattern recognition technology. However, the lack of data or renewal of tariff items makes the application of such technology difficult. In this paper, we present a data augmentation technique based on a new image-to-image translation method to deal with these difficulties. Unlike the conventional methods that convert a semantic label image into a realistic image, the proposed method takes a texture map with a special modification as an additional input of a generative adversarial network to reproduce domain-specific characteristics, such as background clutter or sensor-specific noise patterns. The proposed method was validated by applying it to backscatter X-ray (BSX) vehicle data augmentation. The Fréchet inception distance (FID) of the result indicates the visual quality of the translated image was significantly improved from the baseline when the texture parameters were used. Additionally, in terms of data augmentation, the experimental results of classification, segmentation, and detection show that the use of the translated image data, along with the real data consistently, improved the performance of the trained models. Our findings show that detailed depiction of the texture in translated images is crucial for data augmentation. Considering the comparatively few studies that have examined custom inspections of container scale goods, such as cars, we believe that this study will facilitate research on the automation of container screening, and the security of aviation and ports.
使用 X 射线成像进行定制检查是现代模式识别技术非常有前途的应用。然而,由于缺乏数据或关税项目的更新,使得此类技术的应用变得困难。在本文中,我们提出了一种基于新的图像到图像转换方法的数据增强技术,以解决这些困难。与传统方法将语义标签图像转换为逼真图像不同,所提出的方法将具有特殊修改的纹理图作为生成对抗网络的附加输入,以再现特定于域的特征,例如背景杂波或传感器特定的噪声模式。通过将其应用于反向散射 X 射线 (BSX) 车辆数据增强来验证所提出的方法。结果的 Fréchet inception 距离 (FID) 表明,当使用纹理参数时,翻译图像的视觉质量从基线显著提高。此外,在数据增强方面,分类、分割和检测的实验结果表明,使用翻译图像数据以及真实数据一致地提高了训练模型的性能。我们的研究结果表明,翻译图像中纹理的详细描述对于数据增强至关重要。考虑到对汽车等集装箱规模货物的定制检查的研究相对较少,我们相信这项研究将有助于促进集装箱筛选的自动化以及航空和港口的安全研究。