Quan Quan, Yao Qingsong, Zhu Heqin, Kevin Zhou S
IEEE Trans Med Imaging. 2025 Jan;44(1):154-164. doi: 10.1109/TMI.2024.3436713. Epub 2025 Jan 2.
Contrastive learning (CL) is a form of self-supervised learning and has been widely used for various tasks. Different from widely studied instance-level contrastive learning, pixel-wise contrastive learning mainly helps with pixel-wise dense prediction tasks. The counterpart to an instance in instance-level CL is a pixel, along with its neighboring context, in pixel-wise CL. Aiming to build better feature representation, there is a vast literature about designing instance augmentation strategies for instance-level CL; but there is little similar work on pixel augmentation for pixel-wise CL with a pixel granularity. In this paper, we attempt to bridge this gap. We first classify a pixel into three categories, namely low-, medium-, and high-informative, based on the information quantity the pixel contains. We then adaptively design separate augmentation strategies for each category in terms of augmentation intensity and sampling ratio. Extensive experiments validate that our information-guided pixel augmentation strategy succeeds in encoding more discriminative representations and surpassing other competitive approaches in unsupervised local feature matching. Furthermore, our pretrained model improves the performance of both one-shot and fully supervised models. To the best of our knowledge, we are the first to propose a pixel augmentation method with a pixel granularity for enhancing unsupervised pixel-wise contrastive learning. Code is available at https://github.com/Curli-quan/IGU-Aug.
对比学习(CL)是一种自监督学习形式,已被广泛应用于各种任务。与广泛研究的实例级对比学习不同,逐像素对比学习主要有助于逐像素密集预测任务。在实例级对比学习中,实例的对应物在逐像素对比学习中是一个像素及其相邻上下文。为了构建更好的特征表示,有大量关于为实例级对比学习设计实例增强策略的文献;但在逐像素对比学习中,针对像素粒度的像素增强方面,类似的工作很少。在本文中,我们试图弥合这一差距。我们首先根据像素包含的信息量将像素分为三类,即低信息、中等信息和高信息。然后,我们根据增强强度和采样率为每个类别自适应地设计单独的增强策略。大量实验验证了我们的信息引导像素增强策略成功地编码了更具判别力的表示,并在无监督局部特征匹配方面超越了其他竞争方法。此外,我们的预训练模型提高了一次性和全监督模型的性能。据我们所知,我们是第一个提出具有像素粒度的像素增强方法以增强无监督逐像素对比学习的。代码可在https://github.com/Curli - quan/IGU - Aug获取。