• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

WS-SAM:将SAM推广到使用类别标签的弱监督目标检测

WS-SAM: Generalizing SAM to Weakly Supervised Object Detection With Category Label.

作者信息

Wang Hao, Jia Tong, Wang Qilong, Zuo Wangmeng

出版信息

IEEE Trans Image Process. 2025;34:4052-4066. doi: 10.1109/TIP.2025.3581729.

DOI:10.1109/TIP.2025.3581729
PMID:40569798
Abstract

Building an effective object detector usually depends on large well-annotated training samples. While annotating such dataset is extremely laborious and costly, where box-level supervision which contains both accurate classification category and localization coordinate is required. Compared to above box-level supervised annotation, those weakly supervised learning manners (e.g,, category, point and scribble) need relatively less laborious annotation cost, and provide a feasible way to mitigate the reliance on the dataset. Because of the lack of sufficient supervised information, current weakly supervised methods cannot achieve satisfactory detection performance. Recently, Segment Anything Model (SAM) has appeared as a task-agnostic foundation model and shown promising performance improvement in many related works due to its powerful generalization and data processing abilities. The properties of the SAM inspire us to adopt such basic benchmark to weakly supervised object detection field to compensate the deficiencies in supervised information. However, directly deploying SAM on weakly supervised object detection task meets with two issues. Firstly, SAM needs meticulously-designed prompts, and such expert-level prompts restrict their applicability and practicality. Besides, SAM is a category unawareness model, and it cannot assign the category labels to the generated predictions. To solve above issues, we propose WS-SAM, which generalizes Segment Anything Model (SAM) to weakly supervised object detection with category label. Specifically, we design an adaptive prompt generator to take full advantages of the spatial and semantic information from the prompt. It employs in a self-prompting manner by taking the output of SAM from the previous iteration as the prompt input to guide the next iteration, where the prompts can be adaptively generated based on the classification activation map. We also develop a segmentation mask refinement module and formulate the label assignment process as a shortest path optimization problem by considering the similarity between each location and prompts. Furthermore, a bidirectional adapter is also implemented to resolve the domain discrepancy by incorporating domain-specific information. We evaluate the effectiveness of our method on several detection datasets (e.g., PASCAL VOC and MS COCO), and the experiment results show that our proposed method can achieve clear improvement over state-of-the-art methods, while performing favorably against state-of-the-arts.

摘要

构建一个有效的目标检测器通常依赖于大量标注良好的训练样本。虽然标注这样的数据集极其费力且成本高昂,因为需要框级监督,其中包含准确的分类类别和定位坐标。与上述框级监督标注相比,那些弱监督学习方式(例如类别、点和涂鸦)需要相对较少的标注成本,并提供了一种可行的方法来减轻对数据集的依赖。由于缺乏足够的监督信息,当前的弱监督方法无法实现令人满意的检测性能。最近,分割一切模型(SAM)作为一个与任务无关的基础模型出现,并因其强大的泛化和数据处理能力在许多相关工作中显示出有前景的性能提升。SAM的特性启发我们将这样的基础基准应用于弱监督目标检测领域,以弥补监督信息中的不足。然而,直接将SAM部署在弱监督目标检测任务上会遇到两个问题。首先,SAM需要精心设计的提示,而这种专家级提示限制了它们的适用性和实用性。此外,SAM是一个不区分类别的模型,它不能为生成的预测分配类别标签。为了解决上述问题,我们提出了WS-SAM,它将分割一切模型(SAM)推广到带有类别标签的弱监督目标检测。具体来说,我们设计了一个自适应提示生成器,以充分利用来自提示的空间和语义信息。它以自提示的方式工作,将上一次迭代中SAM的输出作为提示输入来指导下一次迭代,其中提示可以基于分类激活图自适应生成。我们还开发了一个分割掩码细化模块,并通过考虑每个位置与提示之间的相似性,将标签分配过程表述为一个最短路径优化问题。此外,还实现了一个双向适配器,通过纳入特定领域信息来解决域差异问题。我们在几个检测数据集(例如PASCAL VOC和MS COCO)上评估了我们方法的有效性,实验结果表明,我们提出的方法相对于现有方法可以实现明显的改进,同时与现有技术相比表现良好。

相似文献

1
WS-SAM: Generalizing SAM to Weakly Supervised Object Detection With Category Label.WS-SAM:将SAM推广到使用类别标签的弱监督目标检测
IEEE Trans Image Process. 2025;34:4052-4066. doi: 10.1109/TIP.2025.3581729.
2
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
3
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
5
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
6
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
7
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
8
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
10
The educational effects of portfolios on undergraduate student learning: a Best Evidence Medical Education (BEME) systematic review. BEME Guide No. 11.档案袋对本科学生学习的教育效果:最佳证据医学教育(BEME)系统评价。BEME指南第11号。
Med Teach. 2009 Apr;31(4):282-98. doi: 10.1080/01421590902889897.