一种使用伪标签进行分类任务的半监督堆叠自动编码器

A Semi-Supervised Stacked Autoencoder Using the Pseudo Label for Classification Tasks.

作者信息

Lai Jie, Wang Xiaodan, Xiang Qian, Quan Wen, Song Yafei

机构信息

College of Air and Missile Defense, Air Force Engineering University, Xi'an 710051, China.

College of Air Traffic Control and Navigation, Air Force Engineering University, Xi'an 710051, China.

出版信息

Entropy (Basel). 2023 Aug 30;25(9):1274. doi: 10.3390/e25091274.

DOI:10.3390/e25091274

PMID:37761573

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10528325/

Abstract

The efficiency and cognitive limitations of manual sample labeling result in a large number of unlabeled training samples in practical applications. Making full use of both labeled and unlabeled samples is the key to solving the semi-supervised problem. However, as a supervised algorithm, the stacked autoencoder (SAE) only considers labeled samples and is difficult to apply to semi-supervised problems. Thus, by introducing the pseudo-labeling method into the SAE, a novel pseudo label-based semi-supervised stacked autoencoder (PL-SSAE) is proposed to address the semi-supervised classification tasks. The PL-SSAE first utilizes the unsupervised pre-training on all samples by the autoencoder (AE) to initialize the network parameters. Then, by the iterative fine-tuning of the network parameters based on the labeled samples, the unlabeled samples are identified, and their pseudo labels are generated. Finally, the pseudo-labeled samples are used to construct the regularization term and fine-tune the network parameters to complete the training of the PL-SSAE. Different from the traditional SAE, the PL-SSAE requires all samples in pre-training and the unlabeled samples with pseudo labels in fine-tuning to fully exploit the feature and category information of the unlabeled samples. Empirical evaluations on various benchmark datasets show that the semi-supervised performance of the PL-SSAE is more competitive than that of the SAE, sparse stacked autoencoder (SSAE), semi-supervised stacked autoencoder (Semi-SAE) and semi-supervised stacked autoencoder (Semi-SSAE).

摘要

在实际应用中，人工样本标注的效率和认知局限性导致大量未标注的训练样本。充分利用已标注和未标注样本是解决半监督问题的关键。然而，作为一种监督算法，堆叠自编码器（SAE）仅考虑已标注样本，难以应用于半监督问题。因此，通过将伪标签方法引入SAE，提出了一种基于伪标签的新型半监督堆叠自编码器（PL-SSAE）来解决半监督分类任务。PL-SSAE首先利用自编码器（AE）对所有样本进行无监督预训练，以初始化网络参数。然后，基于已标注样本对网络参数进行迭代微调，识别未标注样本并生成其伪标签。最后，利用伪标注样本构建正则化项并微调网络参数，以完成PL-SSAE的训练。与传统SAE不同，PL-SSAE在预训练中需要所有样本，在微调中需要带有伪标签的未标注样本，以充分利用未标注样本的特征和类别信息。在各种基准数据集上的实证评估表明，PL-SSAE的半监督性能比SAE、稀疏堆叠自编码器（SSAE）、半监督堆叠自编码器（Semi-SAE）和半监督堆叠自编码器（Semi-SSAE）更具竞争力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/661d/10528325/5f86826f3f46/entropy-25-01274-g001.jpg

相似文献

A Semi-Supervised Stacked Autoencoder Using the Pseudo Label for Classification Tasks.

Entropy (Basel). 2023 Aug 30;25(9):1274. doi: 10.3390/e25091274.

CPSS: Fusing consistency regularization and pseudo-labeling techniques for semi-supervised deep cardiovascular disease detection using all unlabeled electrocardiograms.

Comput Methods Programs Biomed. 2024 Sep;254:108315. doi: 10.1016/j.cmpb.2024.108315. Epub 2024 Jul 4.

FaxMatch: Multi-Curriculum Pseudo-Labeling for semi-supervised medical image classification.

Med Phys. 2023 May;50(5):3210-3222. doi: 10.1002/mp.16312. Epub 2023 Feb 21.

Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.

Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.

Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification.

IEEE Trans Image Process. 2018 Mar;27(3):1259-1270. doi: 10.1109/TIP.2017.2772836. Epub 2017 Nov 13.

Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.

IEEE Access. 2019;7:11093-11104. doi: 10.1109/ACCESS.2019.2891970. Epub 2019 Jan 9.

Robust Semi-Supervised Traffic Sign Recognition via Self-Training and Weakly-Supervised Learning.

Sensors (Basel). 2020 May 8;20(9):2684. doi: 10.3390/s20092684.

Semantic contrast with uncertainty-aware pseudo label for lumbar semi-supervised classification.

Comput Biol Med. 2024 Aug;178:108754. doi: 10.1016/j.compbiomed.2024.108754. Epub 2024 Jun 15.

Graph-Based Self-Training for Semi-Supervised Deep Similarity Learning.

Sensors (Basel). 2023 Apr 13;23(8):3944. doi: 10.3390/s23083944.

Deep virtual adversarial self-training with consistency regularization for semi-supervised medical image classification.

Med Image Anal. 2021 May;70:102010. doi: 10.1016/j.media.2021.102010. Epub 2021 Feb 22.

本文引用的文献

SGAEMDA: Predicting miRNA-Disease Associations Based on Stacked Graph Autoencoder.

Cells. 2022 Dec 9;11(24):3984. doi: 10.3390/cells11243984.

Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain.

J Digit Imaging. 2022 Oct;35(5):1308-1325. doi: 10.1007/s10278-021-00554-y. Epub 2022 Jun 29.

IMS-CDA: Prediction of CircRNA-Disease Associations From the Integration of Multisource Similarity Information With Deep Stacked Autoencoder Model.

IEEE Trans Cybern. 2021 Nov;51(11):5522-5531. doi: 10.1109/TCYB.2020.3022852. Epub 2021 Nov 9.

A semi-supervised deep learning method based on stacked sparse auto-encoder for cancer prediction using RNA-seq data.

Comput Methods Programs Biomed. 2018 Nov;166:99-105. doi: 10.1016/j.cmpb.2018.10.004. Epub 2018 Oct 5.

Deep learning in neural networks: an overview.

Neural Netw. 2015 Jan;61:85-117. doi: 10.1016/j.neunet.2014.09.003. Epub 2014 Oct 13.

Representation learning: a review and new perspectives.

IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1798-828. doi: 10.1109/TPAMI.2013.50.

Reducing the dimensionality of data with neural networks.

Science. 2006 Jul 28;313(5786):504-7. doi: 10.1126/science.1127647.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种使用伪标签进行分类任务的半监督堆叠自动编码器

A Semi-Supervised Stacked Autoencoder Using the Pseudo Label for Classification Tasks.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献