• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大数据时代的小数据挑战:无监督和半监督方法的最新进展综述。

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2168-2187. doi: 10.1109/TPAMI.2020.3031898. Epub 2022 Mar 4.

DOI:10.1109/TPAMI.2020.3031898
PMID:33074801
Abstract

Representation learning with small labeled data have emerged in many problems, since the success of deep neural networks often relies on the availability of a huge amount of labeled data that is expensive to collect. To address it, many efforts have been made on training sophisticated models with few labeled data in an unsupervised and semi-supervised fashion. In this paper, we will review the recent progresses on these two major categories of methods. A wide spectrum of models will be categorized in a big picture, where we will show how they interplay with each other to motivate explorations of new ideas. We will review the principles of learning the transformation equivariant, disentangled, self-supervised and semi-supervised representations, all of which underpin the foundation of recent progresses. Many implementations of unsupervised and semi-supervised generative models have been developed on the basis of these criteria, greatly expanding the territory of existing autoencoders, generative adversarial nets (GANs) and other deep networks by exploring the distribution of unlabeled data for more powerful representations. We will discuss emerging topics by revealing the intrinsic connections between unsupervised and semi-supervised learning, and propose in future directions to bridge the algorithmic and theoretical gap between transformation equivariance for unsupervised learning and supervised invariance for supervised learning, and unify unsupervised pretraining and supervised finetuning. We will also provide a broader outlook of future directions to unify transformation and instance equivariances for representation learning, connect unsupervised and semi-supervised augmentations, and explore the role of the self-supervised regularization for many learning problems.

摘要

在许多问题中,使用少量标注数据的表示学习已经出现,因为深度学习网络的成功往往依赖于大量标注数据的可用性,而这些数据的收集成本很高。为了解决这个问题,人们已经在使用少量标注数据进行无监督和半监督训练复杂模型方面做出了很多努力。在本文中,我们将回顾这两类方法的最新进展。我们将在一个大的框架中对广泛的模型进行分类,展示它们是如何相互作用的,以激发新思想的探索。我们将回顾学习变换等变、解耦、自监督和半监督表示的原理,这些原理都是最近进展的基础。基于这些标准,已经开发了许多无监督和半监督生成模型的实现,通过探索未标记数据的分布,为更强大的表示形式,极大地扩展了现有自动编码器、生成对抗网络(GAN)和其他深度网络的领域。我们将通过揭示无监督学习和半监督学习之间的内在联系,讨论新兴的主题,并提出未来的方向,以弥合无监督学习的变换等变和监督学习的监督不变之间的算法和理论差距,并统一无监督预训练和监督微调。我们还将提供更广泛的未来方向展望,以统一表示学习的变换和实例等变,连接无监督和半监督增强,并探索自监督正则化在许多学习问题中的作用。

相似文献

1
Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods.大数据时代的小数据挑战:无监督和半监督方法的最新进展综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2168-2187. doi: 10.1109/TPAMI.2020.3031898. Epub 2022 Mar 4.
2
The Utility of Unsupervised Machine Learning in Anatomic Pathology.无监督机器学习在解剖病理学中的应用。
Am J Clin Pathol. 2022 Jan 6;157(1):5-14. doi: 10.1093/ajcp/aqab085.
3
A unified deep semi-supervised graph learning scheme based on nodes re-weighting and manifold regularization.一种基于节点重新加权和流形正则化的统一深度半监督图学习方案。
Neural Netw. 2023 Jan;158:188-196. doi: 10.1016/j.neunet.2022.11.017. Epub 2022 Nov 19.
4
Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology.无监督和半监督学习:植物系统生物学机器学习的下一个前沿。
Plant J. 2022 Sep;111(6):1527-1538. doi: 10.1111/tpj.15905. Epub 2022 Jul 27.
5
Semi-supervised generative adversarial networks for closed-angle detection on anterior segment optical coherence tomography images: an empirical study with a small training dataset.用于眼前节光学相干断层扫描图像闭角检测的半监督生成对抗网络:基于小训练数据集的实证研究
Ann Transl Med. 2021 Jul;9(13):1073. doi: 10.21037/atm-20-7436.
6
Deep virtual adversarial self-training with consistency regularization for semi-supervised medical image classification.深度对偶对抗自训练与一致性正则化在半监督医学图像分类中的应用。
Med Image Anal. 2021 May;70:102010. doi: 10.1016/j.media.2021.102010. Epub 2021 Feb 22.
7
Multi-class motor imagery EEG classification using collaborative representation-based semi-supervised extreme learning machine.基于协同表示的半监督极限学习机的多类运动想象 EEG 分类。
Med Biol Eng Comput. 2020 Sep;58(9):2119-2130. doi: 10.1007/s11517-020-02227-4. Epub 2020 Jul 16.
8
Unsupervised and self-supervised deep learning approaches for biomedical text mining.无监督和自监督深度学习方法在生物医学文本挖掘中的应用。
Brief Bioinform. 2021 Mar 22;22(2):1592-1603. doi: 10.1093/bib/bbab016.
9
Graph Convolution Networks with manifold regularization for semi-supervised learning.图卷积网络与流形正则化的半监督学习。
Neural Netw. 2020 Jul;127:160-167. doi: 10.1016/j.neunet.2020.04.016. Epub 2020 Apr 23.
10
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey.半监督与无监督深度视觉学习:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1327-1347. doi: 10.1109/TPAMI.2022.3201576. Epub 2024 Feb 6.

引用本文的文献

1
XGB-BIF: An XGBoost-Driven Biomarker Identification Framework for Detecting Cancer Using Human Genomic Data.XGB-BIF:一种用于利用人类基因组数据检测癌症的基于XGBoost的生物标志物识别框架。
Int J Mol Sci. 2025 Jun 11;26(12):5590. doi: 10.3390/ijms26125590.
2
Applications of machine learning-assisted extracellular vesicles analysis technology in tumor diagnosis.机器学习辅助的细胞外囊泡分析技术在肿瘤诊断中的应用
Comput Struct Biotechnol J. 2025 Jun 6;27:2460-2472. doi: 10.1016/j.csbj.2025.06.014. eCollection 2025.
3
Electrolyte-Gated Transistor Array (20 × 20) with Low-Programming Interference Based on Coplanar Gate Structure for Unsupervised Learning.
基于共面栅极结构的具有低编程干扰的电解质门控晶体管阵列(20×20)用于无监督学习。
Small Sci. 2024 Mar 3;4(4):2300306. doi: 10.1002/smsc.202300306. eCollection 2024 Apr.
4
Enhancing Brain Age Prediction and Neurodegeneration Detection with Contrastive Learning on Regional Biomechanical Properties.利用区域生物力学特性的对比学习增强脑年龄预测和神经退行性变检测
bioRxiv. 2025 Mar 26:2025.03.25.645330. doi: 10.1101/2025.03.25.645330.
5
Clinical validation and optimization of machine learning models for early prediction of sepsis.用于脓毒症早期预测的机器学习模型的临床验证与优化
Front Med (Lausanne). 2025 Feb 5;12:1521660. doi: 10.3389/fmed.2025.1521660. eCollection 2025.
6
Semisupervised Contrastive Learning for Bioactivity Prediction Using Cell Painting Image Data.使用细胞绘画图像数据进行生物活性预测的半监督对比学习
J Chem Inf Model. 2025 Jan 27;65(2):528-543. doi: 10.1021/acs.jcim.4c00835. Epub 2025 Jan 6.
7
Semi-supervised recognition for artificial intelligence assisted pathology image diagnosis.人工智能辅助病理图像诊断的半监督识别。
Sci Rep. 2024 Sep 20;14(1):21984. doi: 10.1038/s41598-024-70750-7.
8
Small data methods in omics: the power of one.组学中的小数据方法:以一当十。
Nat Methods. 2024 Sep;21(9):1597-1602. doi: 10.1038/s41592-024-02390-8. Epub 2024 Aug 22.
9
Dual-branch Transformer for semi-supervised medical image segmentation.双分支Transformer 用于半监督医学图像分割。
J Appl Clin Med Phys. 2024 Oct;25(10):e14483. doi: 10.1002/acm2.14483. Epub 2024 Aug 12.
10
Clustering analysis for classifying fake real estate listings.用于对虚假房地产列表进行分类的聚类分析。
PeerJ Comput Sci. 2024 Jun 20;10:e2019. doi: 10.7717/peerj-cs.2019. eCollection 2024.