Department of Computer Science, University College London, London, UK.
J Xray Sci Technol. 2019;27(6):1007-1020. doi: 10.3233/XST-190545.
X-ray imaging is a crucial and ubiquitous tool for detecting threats to transport security, but interpretation of the images presents a logistical bottleneck. Recent advances in Deep Learning image classification offer hope of improving throughput through automation. However, Deep Learning methods require large quantities of labelled training data. While photographic data is cheap and plentiful, comparable training sets are seldom available for the X-ray domain.
To determine whether and to what extent it is feasible to exploit the availability of photo data to supplement the training of X-ray threat detectors.
A new dataset was collected, consisting of 1901 matched pairs of photo & X-ray images of 501 common objects. Of these, 258 pairs were of 69 objects considered threats in the context of aviation. This data was used to test a variety of transfer learning approaches. A simple model of threat cue availability was developed to understand the limits of this transferability.
Appearance features learned from photos provide a useful basis for training classifiers. Some transfer from the photo to the X-ray domain is possible as ∼40% of danger cues are shared between the modalities, but the effectiveness of this transfer is limited since ∼60% of cues are not.
Transfer learning is beneficial when X-ray data is very scarce-of the order of tens of training images in our experiments-but provides no significant benefit when hundreds or thousands of X-ray images are available.
X 射线成像是检测运输安全威胁的重要且无处不在的工具,但图像解释存在物流瓶颈。深度学习图像分类的最新进展为通过自动化提高吞吐量带来了希望。然而,深度学习方法需要大量标记的训练数据。虽然摄影数据便宜且丰富,但 X 射线领域很少有可比的训练集。
确定是否以及在何种程度上可以利用照片数据的可用性来补充 X 射线威胁探测器的训练。
收集了一个新的数据集,由 501 个常见物体的 1901 对匹配的照片和 X 射线图像组成。其中,69 个物体中有 258 对被认为是航空领域的威胁。该数据用于测试各种迁移学习方法。开发了一种简单的威胁线索可用性模型,以了解这种可转移性的限制。
从照片中学习到的外观特征为训练分类器提供了有用的基础。从照片到 X 射线域的一些转移是可能的,因为两种模态之间共享约 40%的危险线索,但由于约 60%的线索不共享,这种转移的有效性有限。
当 X 射线数据非常稀缺(在我们的实验中只有几十张训练图像)时,迁移学习是有益的,但当有数百或数千张 X 射线图像可用时,它并没有带来显著的好处。