School of Computer Science and Technology, Zhejiang University of Technology, Hangzhou, Zhejiang, 310014, China; Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui, 230031, China.
School of Computer Science and Technology, Zhejiang University of Technology, Hangzhou, Zhejiang, 310014, China.
Comput Biol Med. 2023 Nov;166:107487. doi: 10.1016/j.compbiomed.2023.107487. Epub 2023 Sep 20.
Deep learning object detection networks require a large amount of box annotation data for training, which is difficult to obtain in the medical image field. The few-shot object detection algorithm is significant for an unseen category, which can be identified and localized with a few labeled data. For medical image datasets, the image style and target features are incredibly different from the knowledge obtained from training on the original dataset. We propose a background suppression attention(BSA) and feature space fine-tuning module (FSF) for this cross-domain situation where there is a large gap between the source and target domains. The background suppression attention reduces the influence of background information in the training process. The feature space fine-tuning module adjusts the feature distribution of the interest features, which helps to make better predictions. Our approach improves detection performance by using only the information extracted from the model without maintaining additional information, which is convenient and can be easily plugged into other networks. We evaluate the detection performance in the in-domain situation and cross-domain situation. In-domain experiments on the VOC and COCO datasets and the cross-domain experiments on the VOC to medical image dataset UriSed2K show that our proposed method effectively improves the few-shot detection performance.
深度学习目标检测网络需要大量的框标注数据进行训练,而在医学图像领域,这是很难获取的。在未见类别中,少量样本目标检测算法非常重要,它可以使用少量标记数据进行识别和定位。对于医学图像数据集,图像样式和目标特征与从原始数据集训练中获得的知识有很大的不同。针对这种源域和目标域之间存在较大差距的跨域情况,我们提出了一种背景抑制注意力(BSA)和特征空间微调模块(FSF)。背景抑制注意力在训练过程中减少背景信息的影响。特征空间微调模块调整感兴趣特征的特征分布,有助于做出更好的预测。我们的方法通过仅使用从模型中提取的信息来提高检测性能,而无需维护其他信息,这既方便又可以轻松地插入到其他网络中。我们在域内情况和跨域情况评估了检测性能。在 VOC 和 COCO 数据集的域内实验以及在 VOC 到 UriSed2K 医学图像数据集的跨域实验中,我们提出的方法有效地提高了少量样本的检测性能。