Albelwi Saleh
Faculty of Computing and Information Technology, University of Tabuk, Tabuk 47731, Saudi Arabia.
Industrial Innovation and Robotic Center (IIRC), University of Tabuk, Tabuk 47731, Saudi Arabia.
Entropy (Basel). 2022 Apr 14;24(4):551. doi: 10.3390/e24040551.
Although deep learning algorithms have achieved significant progress in a variety of domains, they require costly annotations on huge datasets. Self-supervised learning (SSL) using unlabeled data has emerged as an alternative, as it eliminates manual annotation. To do this, SSL constructs feature representations using pretext tasks that operate without manual annotation, which allows models trained in these tasks to extract useful latent representations that later improve downstream tasks such as object classification and detection. The early methods of SSL are based on auxiliary pretext tasks as a way to learn representations using pseudo-labels, or labels that were created automatically based on the dataset's attributes. Furthermore, contrastive learning has also performed well in learning representations via SSL. To succeed, it pushes positive samples closer together, and negative ones further apart, in the latent space. This paper provides a comprehensive literature review of the top-performing SSL methods using auxiliary pretext and contrastive learning techniques. It details the motivation for this research, a general pipeline of SSL, the terminologies of the field, and provides an examination of pretext tasks and self-supervised methods. It also examines how self-supervised methods compare to supervised ones, and then discusses both further considerations and ongoing challenges faced by SSL.
尽管深度学习算法在各个领域都取得了显著进展,但它们需要对海量数据集进行昂贵的标注。使用未标注数据的自监督学习(SSL)作为一种替代方法应运而生,因为它无需人工标注。为此,SSL 使用无需人工标注即可运行的前置任务来构建特征表示,这使得在这些任务中训练的模型能够提取有用的潜在表示,从而改善诸如目标分类和检测等下游任务。早期的 SSL 方法基于辅助前置任务,以此作为使用伪标签(即根据数据集属性自动创建的标签)来学习表示的一种方式。此外,对比学习在通过 SSL 学习表示方面也表现出色。为了取得成功,它在潜在空间中将正样本拉近,将负样本推远。本文对使用辅助前置和对比学习技术的顶级 SSL 方法进行了全面的文献综述。它详细阐述了这项研究的动机、SSL 的一般流程、该领域的术语,并对前置任务和自监督方法进行了审视。它还研究了自监督方法与监督方法的比较情况,然后讨论了 SSL 进一步需要考虑的因素和面临的持续挑战。