Suppr超能文献

用于半监督少样本学习的以样本为中心的特征生成

Sample-Centric Feature Generation for Semi-Supervised Few-Shot Learning.

作者信息

Zhang Bo, Ye Hancheng, Yu Gang, Wang Bin, Wu Yike, Fan Jiayuan, Chen Tao

出版信息

IEEE Trans Image Process. 2022;31:2309-2320. doi: 10.1109/TIP.2022.3154938. Epub 2022 Mar 11.

Abstract

Semi-supervised few-shot learning aims to improve the model generalization ability by means of both limited labeled data and widely-available unlabeled data. Previous works attempt to model the relations between the few-shot labeled data and extra unlabeled data, by performing a label propagation or pseudo-labeling process using an episodic training strategy. However, the feature distribution represented by the pseudo-labeled data itself is coarse-grained, meaning that there might be a large distribution gap between the pseudo-labeled data and the real query data. To this end, we propose a sample-centric feature generation (SFG) approach for semi-supervised few-shot image classification. Specifically, the few-shot labeled samples from different classes are initially trained to predict pseudo-labels for the potential unlabeled samples. Next, a semi-supervised meta-generator is utilized to produce derivative features centering around each pseudo-labeled sample, enriching the intra-class feature diversity. Meanwhile, the sample-centric generation constrains the generated features to be compact and close to the pseudo-labeled sample, ensuring the inter-class feature discriminability. Further, a reliability assessment (RA) metric is developed to weaken the influence of generated outliers on model learning. Extensive experiments validate the effectiveness of the proposed feature generation approach on challenging one- and few-shot image classification benchmarks.

摘要

半监督少样本学习旨在通过有限的标记数据和广泛可用的未标记数据来提高模型的泛化能力。先前的工作试图通过使用情节训练策略执行标签传播或伪标签过程来对少样本标记数据与额外的未标记数据之间的关系进行建模。然而,伪标记数据本身所代表的特征分布是粗粒度的,这意味着伪标记数据与真实查询数据之间可能存在较大的分布差距。为此,我们提出了一种以样本为中心的特征生成(SFG)方法用于半监督少样本图像分类。具体而言,首先对来自不同类别的少样本标记样本进行训练,以预测潜在未标记样本的伪标签。接下来,利用半监督元生成器围绕每个伪标记样本生成派生特征,丰富类内特征多样性。同时,以样本为中心的生成将生成的特征约束为紧凑且接近伪标记样本,确保类间特征可区分性。此外,还开发了一种可靠性评估(RA)指标来减弱生成的离群值对模型学习的影响。大量实验验证了所提出的特征生成方法在具有挑战性的单样本和少样本图像分类基准上的有效性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验