使用卷积自动编码器和对比学习的半监督主动学习

Semi-supervised active learning using convolutional auto- encoder and contrastive learning.

作者信息

Roda Hezi, Geva Amir B

机构信息

Electrical and Computer Engineering, Ben-Gurion University, Be'er Sheva, Israel.

InnerEye Ltd CTO, Herzliya, Israel.

出版信息

Front Artif Intell. 2024 May 30;7:1398844. doi: 10.3389/frai.2024.1398844. eCollection 2024.

DOI:10.3389/frai.2024.1398844

PMID:38873178

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11170704/

Abstract

Active learning is a field of machine learning that seeks to find the most efficient labels to annotate with a given budget, particularly in cases where obtaining labeled data is expensive or infeasible. This is becoming increasingly important with the growing success of learning-based methods, which often require large amounts of labeled data. Computer vision is one area where active learning has shown promise in tasks such as image classification, semantic segmentation, and object detection. In this research, we propose a pool-based semi-supervised active learning method for image classification that takes advantage of both labeled and unlabeled data. Many active learning approaches do not utilize unlabeled data, but we believe that incorporating these data can improve performance. To address this issue, our method involves several steps. First, we cluster the latent space of a pre-trained convolutional autoencoder. Then, we use a proposed clustering contrastive loss to strengthen the latent space's clustering while using a small amount of labeled data. Finally, we query the samples with the highest uncertainty to annotate with an oracle. We repeat this process until the end of the given budget. Our method is effective when the number of annotated samples is small, and we have validated its effectiveness through experiments on benchmark datasets. Our empirical results demonstrate the power of our method for image classification tasks in accuracy terms.

摘要

主动学习是机器学习的一个领域，旨在在给定预算下找到最有效的标签进行标注，特别是在获取标注数据成本高昂或不可行的情况下。随着基于学习的方法越来越成功，而这些方法通常需要大量标注数据，这一点变得越来越重要。计算机视觉是主动学习在图像分类、语义分割和目标检测等任务中显示出前景的一个领域。在本研究中，我们提出了一种基于池的半监督主动学习方法用于图像分类，该方法利用了标注数据和未标注数据。许多主动学习方法没有利用未标注数据，但我们认为纳入这些数据可以提高性能。为了解决这个问题，我们的方法包括几个步骤。首先，我们对预训练的卷积自动编码器的潜在空间进行聚类。然后，我们使用提出的聚类对比损失来加强潜在空间的聚类，同时使用少量标注数据。最后，我们查询具有最高不确定性的样本，由神谕进行标注。我们重复这个过程，直到给定预算结束。当标注样本数量较少时，我们的方法是有效的，并且我们通过在基准数据集上的实验验证了其有效性。我们的实证结果在准确性方面证明了我们的方法在图像分类任务中的强大能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a93e/11170704/721de92882d0/frai-07-1398844-g0001.jpg

相似文献

Semi-supervised active learning using convolutional auto- encoder and contrastive learning.使用卷积自动编码器和对比学习的半监督主动学习

Front Artif Intell. 2024 May 30;7:1398844. doi: 10.3389/frai.2024.1398844. eCollection 2024.

Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。

Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.

Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.用于图像分类和分割的深度嵌入聚类半监督学习

IEEE Access. 2019;7:11093-11104. doi: 10.1109/ACCESS.2019.2891970. Epub 2019 Jan 9.

Semantic contrast with uncertainty-aware pseudo label for lumbar semi-supervised classification.基于具有不确定性感知的伪标签的语义对比进行腰椎半监督分类。

Comput Biol Med. 2024 Aug;178:108754. doi: 10.1016/j.compbiomed.2024.108754. Epub 2024 Jun 15.

Uncertainty-Guided Voxel-Level Supervised Contrastive Learning for Semi-Supervised Medical Image Segmentation.不确定性引导的体素级监督对比学习在半监督医学图像分割中的应用。

Int J Neural Syst. 2022 Apr;32(4):2250016. doi: 10.1142/S0129065722500162. Epub 2022 Feb 25.

Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification.基于伪标签的半监督深度学习在高光谱图像分类中的应用。

IEEE Trans Image Process. 2018 Mar;27(3):1259-1270. doi: 10.1109/TIP.2017.2772836. Epub 2017 Nov 13.

Semi-supervised medical image segmentation via a tripled-uncertainty guided mean teacher model with contrastive learning.基于三重不确定性引导的均值教师模型与对比学习的半监督医学图像分割。

Med Image Anal. 2022 Jul;79:102447. doi: 10.1016/j.media.2022.102447. Epub 2022 Apr 8.

Semi-TMS: an efficient regularization-oriented triple-teacher semi-supervised medical image segmentation model.Semi-TMS：一种面向正则化的高效三教师半监督医学图像分割模型。

Phys Med Biol. 2023 Oct 4;68(20). doi: 10.1088/1361-6560/acf90f.

RCPS: Rectified Contrastive Pseudo Supervision for Semi-Supervised Medical Image Segmentation.RCPS：用于半监督医学图像分割的校正对比伪监督

IEEE J Biomed Health Inform. 2023 Oct 6;PP. doi: 10.1109/JBHI.2023.3322590.

A veracity dissemination consistency-based few-shot fake news detection framework by synergizing adversarial and contrastive self-supervised learning.一种基于真实性传播一致性的少样本假新闻检测框架，通过协同对抗性和对比性自监督学习实现。

Sci Rep. 2024 Aug 22;14(1):19470. doi: 10.1038/s41598-024-70039-9.

引用本文的文献

Active learning with human heuristics: an algorithm robust to labeling bias.结合人类启发式方法的主动学习：一种对标签偏差具有鲁棒性的算法

Front Artif Intell. 2024 Nov 19;7:1491932. doi: 10.3389/frai.2024.1491932. eCollection 2024.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用卷积自动编码器和对比学习的半监督主动学习

Semi-supervised active learning using convolutional auto- encoder and contrastive learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献