IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):8694-8700. doi: 10.1109/TPAMI.2021.3082567. Epub 2022 Oct 4.
In this paper, we propose the K-Shot Contrastive Learning (KSCL) of visual features by applying multiple augmentations to investigate the sample variations within individual instances. It aims to combine the advantages of inter-instance discrimination by learning discriminative features to distinguish between different instances, as well as intra-instance variations by matching queries against the variants of augmented samples over instances. Particularly, for each instance, it constructs an instance subspace to model the configuration of how the significant factors of variations in K-shot augmentations can be combined to form the variants of augmentations. Given a query, the most relevant variant of instances is then retrieved by projecting the query onto their subspaces to predict the positive instance class. This generalizes the existing contrastive learning that can be viewed as a special one-shot case. An eigenvalue decomposition is performed to configure instance subspaces, and the embedding network can be trained end-to-end through the differentiable subspace configuration. Experiment results demonstrate the proposed K-shot contrastive learning achieves superior performances to the state-of-the-art unsupervised methods.
在本文中,我们通过应用多种增强方法来提出 K -shot 对比学习(KSCL)的视觉特征,以研究单个实例内的样本变化。它旨在通过学习区分特征来结合内部实例区分的优势,从而区分不同的实例,以及通过匹配查询与实例的增强样本的变体来匹配内部实例变化。特别是,对于每个实例,它构建一个实例子空间来建模如何组合 K-shot 增强中的显著因素变化来形成增强的变体的配置。给定一个查询,然后通过将查询投影到它们的子空间上来预测阳性实例类,从而检索最相关的实例变体。这推广了现有的对比学习,它可以被视为一种特殊的单样本情况。通过特征值分解来配置实例子空间,并且可以通过可微分子空间配置来端到端地训练嵌入网络。实验结果表明,所提出的 K-shot 对比学习方法优于最新的无监督方法。