• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于矩阵草图的深度无监督主动学习。

Deep Unsupervised Active Learning via Matrix Sketching.

出版信息

IEEE Trans Image Process. 2021;30:9280-9293. doi: 10.1109/TIP.2021.3124317. Epub 2021 Nov 12.

DOI:10.1109/TIP.2021.3124317
PMID:34739378
Abstract

Most existing unsupervised active learning methods aim at minimizing the data reconstruction loss by using the linear models to choose representative samples for manually labeling in an unsupervised setting. Thus these methods often fail in modelling data with complex non-linear structure. To address this issue, we propose a new deep unsupervised Active Learning method for classification tasks, inspired by the idea of Matrix Sketching, called ALMS. Specifically, ALMS leverages a deep auto-encoder to embed data into a latent space, and then describes all the embedded data with a small size sketch to summarize the major characteristics of the data. In contrast to previous approaches that reconstruct the whole data matrix for selecting the representative samples, ALMS aims to select a representative subset of samples to well approximate the sketch, which can preserve the major information of data meanwhile significantly reducing the number of network parameters. This makes our algorithm alleviate the issue of model overfitting and readily cope with large datasets. Actually, the sketch provides a type of self-supervised signal to guide the learning of the model. Moreover, we propose to construct an auxiliary self-supervised task by classifying real/fake samples, in order to further improve the representation ability of the encoder. We thoroughly evaluate the performance of ALMS on both single-label and multi-label classification tasks, and the results demonstrate its superior performance against the state-of-the-art methods. The code can be found at https://github.com/lrq99/ALMS.

摘要

大多数现有的无监督主动学习方法旨在通过使用线性模型在无监督环境中选择具有代表性的样本进行手动标记,从而最小化数据重建损失。因此,这些方法在建模具有复杂非线性结构的数据时往往会失败。为了解决这个问题,我们提出了一种新的基于矩阵素描的深度无监督主动学习分类方法,称为 ALMS。具体来说,ALMS 利用深度自动编码器将数据嵌入到潜在空间中,然后使用小尺寸草图来描述所有嵌入的数据,以总结数据的主要特征。与之前为选择代表性样本而重建整个数据矩阵的方法不同,ALMS 旨在选择代表样本的子集,以很好地近似草图,这可以保留数据的主要信息,同时大大减少网络参数的数量。这使得我们的算法缓解了模型过拟合的问题,并能够轻松处理大型数据集。实际上,草图提供了一种自我监督信号来指导模型的学习。此外,我们提出通过分类真实/假样本来构建辅助自我监督任务,以进一步提高编码器的表示能力。我们在单标签和多标签分类任务上对 ALMS 的性能进行了彻底评估,结果表明它优于最先进的方法。代码可以在 https://github.com/lrq99/ALMS 上找到。

相似文献

1
Deep Unsupervised Active Learning via Matrix Sketching.基于矩阵草图的深度无监督主动学习。
IEEE Trans Image Process. 2021;30:9280-9293. doi: 10.1109/TIP.2021.3124317. Epub 2021 Nov 12.
2
Structure Guided Deep Neural Network for Unsupervised Active Learning.结构引导的深度神经网络无监督主动学习。
IEEE Trans Image Process. 2022;31:2767-2781. doi: 10.1109/TIP.2022.3161076. Epub 2022 Apr 4.
3
Relation-Guided Representation Learning.关系引导的表示学习。
Neural Netw. 2020 Nov;131:93-102. doi: 10.1016/j.neunet.2020.07.014. Epub 2020 Jul 31.
4
S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.S-CUDA:用于医学图像分割的自清洁无监督域适应
Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12.
5
Self-Supervised Learning for Label Sparsity in Computational Drug Repositioning.计算药物重新定位中标签稀疏性的自监督学习
IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):3245-3256. doi: 10.1109/TCBB.2023.3254163. Epub 2023 Oct 9.
6
Unsupervised Abstract Reasoning for Raven's Problem Matrices.无监督抽象推理在瑞文标准推理测验中的应用。
IEEE Trans Image Process. 2021;30:8332-8341. doi: 10.1109/TIP.2021.3114987. Epub 2021 Oct 5.
7
Self-supervised driven consistency training for annotation efficient histopathology image analysis.用于高效标注组织病理学图像分析的自监督驱动一致性训练
Med Image Anal. 2022 Jan;75:102256. doi: 10.1016/j.media.2021.102256. Epub 2021 Oct 13.
8
Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.用于图像分类和分割的深度嵌入聚类半监督学习
IEEE Access. 2019;7:11093-11104. doi: 10.1109/ACCESS.2019.2891970. Epub 2019 Jan 9.
9
HyperGen: Compact and Efficient Genome Sketching using Hyperdimensional Vectors.HyperGen:使用超维向量进行紧凑且高效的基因组草图绘制
Bioinformatics. 2024 Jul 16;40(7). doi: 10.1093/bioinformatics/btae452.
10
Lung and Pancreatic Tumor Characterization in the Deep Learning Era: Novel Supervised and Unsupervised Learning Approaches.深度学习时代的肺部和胰腺肿瘤特征分析:新型监督式和非监督式学习方法。
IEEE Trans Med Imaging. 2019 Aug;38(8):1777-1787. doi: 10.1109/TMI.2019.2894349. Epub 2019 Jan 23.

引用本文的文献

1
Small Object Detection Pixel Level Balancing With Applications to Blood Cell Detection.小目标检测:用于血细胞检测的像素级平衡及应用
Front Physiol. 2022 Jun 17;13:911297. doi: 10.3389/fphys.2022.911297. eCollection 2022.