• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用类别信息进行增强型不确定性采样以改进主动学习。

Enhanced uncertainty sampling with category information for improved active learning.

作者信息

Wang Xiaochuan, Zhang Bo, Wang Fei, Bao Tao, Lu Zhiqing, Bao Jiawei

机构信息

China Ship Scientific Research Center, Wuxi, China.

Taihu Laboratory of Deepsea Technology Science, Wuxi, China.

出版信息

PLoS One. 2025 Jul 7;20(7):e0327694. doi: 10.1371/journal.pone.0327694. eCollection 2025.

DOI:10.1371/journal.pone.0327694
PMID:40622992
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12233261/
Abstract

Traditional uncertainty sampling methods in active learning often neglect category information, leading to imbalanced sample selection in multi-class computer vision tasks. Our approach integrates category information with uncertainty sampling through a novel active learning framework to address this limitation. Our method employs a pre-trained VGG16 architecture and cosine similarity metrics to efficiently extract category features without requiring additional model training. The framework combines these features with traditional uncertainty measures to ensure balanced sampling across classes while maintaining computational efficiency. Extensive experiments across both object detection and image classification tasks validate our method's effectiveness. For object detection, our approach achieves competitive mAP scores while ensuring balanced category representation. For image classification, our method achieves accuracy comparable to state-of-the-art approaches while reducing computational overhead by up to 80%. The results validate our approach's ability to balance sampling efficiency with dataset representativeness across different computer vision tasks. This work offers a practical, efficient solution for large-scale data annotation in domains with limited labeled data and diverse class distributions.

摘要

主动学习中的传统不确定性采样方法在多类计算机视觉任务中常常忽略类别信息,导致样本选择不均衡。我们的方法通过一个新颖的主动学习框架将类别信息与不确定性采样相结合,以解决这一局限性。我们的方法采用预训练的VGG1,6架构和余弦相似性度量,无需额外的模型训练就能有效地提取类别特征。该框架将这些特征与传统的不确定性度量相结合,以确保在保持计算效率的同时跨类别进行均衡采样。在目标检测和图像分类任务上进行的大量实验验证了我们方法的有效性。对于目标检测而言,我们的方法在确保类别表示均衡的同时取得了具有竞争力的平均精度均值(mAP)分数。对于图像分类,我们的方法在将计算开销降低多达80%的情况下,实现了与最先进方法相当的准确率。这些结果验证了我们的方法在不同计算机视觉任务中平衡采样效率与数据集代表性的能力。这项工作为在标记数据有限且类别分布多样的领域中进行大规模数据标注提供了一个实用、高效的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/a6f870a6843b/pone.0327694.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/69f0b8fb7328/pone.0327694.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/412cd7adbcd1/pone.0327694.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/d5ae5e2a4e46/pone.0327694.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/be166d70ece5/pone.0327694.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/82305b47deff/pone.0327694.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/d0c2efb67cf1/pone.0327694.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/04a3eaf9a430/pone.0327694.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/c5d13b2d36d2/pone.0327694.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/a6f870a6843b/pone.0327694.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/69f0b8fb7328/pone.0327694.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/412cd7adbcd1/pone.0327694.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/d5ae5e2a4e46/pone.0327694.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/be166d70ece5/pone.0327694.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/82305b47deff/pone.0327694.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/d0c2efb67cf1/pone.0327694.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/04a3eaf9a430/pone.0327694.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/c5d13b2d36d2/pone.0327694.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7c8/12233261/a6f870a6843b/pone.0327694.g009.jpg

相似文献

1
Enhanced uncertainty sampling with category information for improved active learning.利用类别信息进行增强型不确定性采样以改进主动学习。
PLoS One. 2025 Jul 7;20(7):e0327694. doi: 10.1371/journal.pone.0327694. eCollection 2025.
2
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
3
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
4
Privacy-Preserving Glycemic Management in Type 1 Diabetes: Development and Validation of a Multiobjective Federated Reinforcement Learning Framework.1型糖尿病中保护隐私的血糖管理:多目标联邦强化学习框架的开发与验证
JMIR Diabetes. 2025 Jul 4;10:e72874. doi: 10.2196/72874.
5
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
6
WeedSwin hierarchical vision transformer with SAM-2 for multi-stage weed detection and classification.用于多阶段杂草检测和分类的带有SAM-2的WeedSwin分层视觉变换器。
Sci Rep. 2025 Jul 2;15(1):23274. doi: 10.1038/s41598-025-05092-z.
7
A novel deep learning framework for retinal disease detection leveraging contextual and local features cues from retinal images.一种用于视网膜疾病检测的新型深度学习框架,利用来自视网膜图像的上下文和局部特征线索。
Med Biol Eng Comput. 2025 Feb 7. doi: 10.1007/s11517-025-03314-0.
8
Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力:结合混合方法和多模态分析的综合框架
JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.
9
Video swin-CLSTM transformer: Enhancing human action recognition with optical flow and long-term dependencies.视频双流卷积长短期记忆变压器:利用光流和长期依赖性增强人体动作识别
PLoS One. 2025 Jul 7;20(7):e0327717. doi: 10.1371/journal.pone.0327717. eCollection 2025.
10
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.

本文引用的文献

1
Multi-Label Active Learning Algorithms for Image Classification: Overview and Future Promise.用于图像分类的多标签主动学习算法:概述与未来展望
ACM Comput Surv. 2020 Jun;53(2). doi: 10.1145/3379504. Epub 2020 Mar 13.
2
Batch Mode Active Learning for Regression With Expected Model Change.批量模式下基于预期模型变化的回归主动学习。
IEEE Trans Neural Netw Learn Syst. 2017 Jul;28(7):1668-1681. doi: 10.1109/TNNLS.2016.2542184. Epub 2016 Apr 18.
3
Scalable active learning for multiclass image classification.多类图像分类的可扩展主动学习。
IEEE Trans Pattern Anal Mach Intell. 2012 Nov;34(11):2259-73. doi: 10.1109/TPAMI.2012.21.