Dong Xuanyi, Zheng Liang, Ma Fan, Yang Yi, Meng Deyu
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1641-1654. doi: 10.1109/TPAMI.2018.2844853. Epub 2018 Jun 7.
In this paper, we study object detection using a large pool of unlabeled images and only a few labeled images per category, named "few-example object detection". The key challenge consists in generating trustworthy training samples as many as possible from the pool. Using few training examples as seeds, our method iterates between model training and high-confidence sample selection. In training, easy samples are generated first and, then the poorly initialized model undergoes improvement. As the model becomes more discriminative, challenging but reliable samples are selected. After that, another round of model improvement takes place. To further improve the precision and recall of the generated training samples, we embed multiple detection models in our framework, which has proven to outperform the single model baseline and the model ensemble method. Experiments on PASCAL VOC'07, MS COCO'14, and ILSVRC'13 indicate that by using as few as three or four samples selected for each category, our method produces very competitive results when compared to the state-of-the-art weakly-supervised approaches using a large number of image-level labels.
在本文中,我们研究了利用大量未标记图像且每个类别仅有少量标记图像的目标检测方法,即“少样本目标检测”。关键挑战在于从图像池中尽可能多地生成可靠的训练样本。我们的方法以少量训练样本为种子,在模型训练和高置信度样本选择之间进行迭代。在训练过程中,首先生成简单样本,然后对初始化不佳的模型进行改进。随着模型的判别能力增强,选择具有挑战性但可靠的样本。之后,进行新一轮的模型改进。为了进一步提高生成训练样本的精度和召回率,我们在框架中嵌入了多个检测模型,事实证明该方法优于单模型基线和模型集成方法。在PASCAL VOC'07、MS COCO'14和ILSVRC'13上的实验表明,对于每个类别仅选择三到四个样本的情况,与使用大量图像级标签的当前最先进的弱监督方法相比,我们的方法能产生极具竞争力的结果。