• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于工业制造的小训练数据集自动边界框标注

Automatic Bounding Box Annotation with Small Training Datasets for Industrial Manufacturing.

作者信息

Geiß Manuela, Wagner Raphael, Baresch Martin, Steiner Josef, Zwick Michael

机构信息

Software Competence Center Hagenberg GmbH, Softwarepark 32a, 4232 Hagenberg, Austria.

KEBA Group AG, Reindlstraße 51, 4040 Linz, Austria.

出版信息

Micromachines (Basel). 2023 Feb 13;14(2):442. doi: 10.3390/mi14020442.

DOI:10.3390/mi14020442
PMID:36838142
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9962188/
Abstract

In the past few years, object detection has attracted a lot of attention in the context of human-robot collaboration and Industry 5.0 due to enormous quality improvements in deep learning technologies. In many applications, object detection models have to be able to quickly adapt to a changing environment, i.e., to learn new objects. A crucial but challenging prerequisite for this is the automatic generation of new training data which currently still limits the broad application of object detection methods in industrial manufacturing. In this work, we discuss how to adapt state-of-the-art object detection methods for the task of automatic bounding box annotation in a use case where the background is homogeneous and the object's label is provided by a human. We compare an adapted version of Faster R-CNN and the Scaled-YOLOv4-p5 architecture and show that both can be trained to distinguish unknown objects from a complex but homogeneous background using only a small amount of training data. In contrast to most other state-of-the-art methods for bounding box labeling, our proposed method neither requires human verification, a predefined set of classes, nor a very large manually annotated dataset. Our method outperforms the state-of-the-art, transformer-based object discovery method on our simple fruits dataset by large margins.

摘要

在过去几年中,由于深度学习技术在质量上有了巨大提升,目标检测在人机协作和工业5.0的背景下受到了广泛关注。在许多应用中,目标检测模型必须能够快速适应不断变化的环境,即学习新的目标。实现这一点的一个关键但具有挑战性的前提条件是自动生成新的训练数据,而这目前仍然限制了目标检测方法在工业制造中的广泛应用。在这项工作中,我们讨论了在背景均匀且目标标签由人工提供的用例中,如何使最先进的目标检测方法适应自动边界框标注任务。我们比较了Faster R-CNN的改进版本和Scaled-YOLOv4-p5架构,结果表明,仅使用少量训练数据,两者都可以被训练以从复杂但均匀的背景中区分出未知目标。与大多数其他用于边界框标注的最先进方法不同,我们提出的方法既不需要人工验证,也不需要预定义的类别集,也不需要非常大的人工标注数据集。在我们简单的水果数据集上,我们的方法大幅优于基于变压器的最先进目标发现方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/98ba66821b5b/micromachines-14-00442-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/2c49e38324ba/micromachines-14-00442-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/283c1ec89fa8/micromachines-14-00442-g0A2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/dbd92394cdc3/micromachines-14-00442-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/defda637ece5/micromachines-14-00442-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/f9435efeccbc/micromachines-14-00442-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/2c090ba27d2e/micromachines-14-00442-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/7f0357122a0c/micromachines-14-00442-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/6f9ec7d0b762/micromachines-14-00442-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/b4e1226b3e40/micromachines-14-00442-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/e9f3535b43f7/micromachines-14-00442-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/e42cbafed0c1/micromachines-14-00442-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/98ba66821b5b/micromachines-14-00442-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/2c49e38324ba/micromachines-14-00442-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/283c1ec89fa8/micromachines-14-00442-g0A2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/dbd92394cdc3/micromachines-14-00442-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/defda637ece5/micromachines-14-00442-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/f9435efeccbc/micromachines-14-00442-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/2c090ba27d2e/micromachines-14-00442-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/7f0357122a0c/micromachines-14-00442-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/6f9ec7d0b762/micromachines-14-00442-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/b4e1226b3e40/micromachines-14-00442-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/e9f3535b43f7/micromachines-14-00442-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/e42cbafed0c1/micromachines-14-00442-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a65/9962188/98ba66821b5b/micromachines-14-00442-g010.jpg

相似文献

1
Automatic Bounding Box Annotation with Small Training Datasets for Industrial Manufacturing.用于工业制造的小训练数据集自动边界框标注
Micromachines (Basel). 2023 Feb 13;14(2):442. doi: 10.3390/mi14020442.
2
Automatic creation of annotations for chest radiographs based on the positional information extracted from radiographic image reports.基于从放射影像报告中提取的位置信息,为胸部 X 光片自动创建注释。
Comput Methods Programs Biomed. 2021 Sep;209:106331. doi: 10.1016/j.cmpb.2021.106331. Epub 2021 Aug 4.
3
Oriented Vehicle Detection in Aerial Images Based on YOLOv4.基于 YOLOv4 的航空图像中定向车辆检测。
Sensors (Basel). 2022 Nov 1;22(21):8394. doi: 10.3390/s22218394.
4
Few-Shot Object Detection: Application to Medieval Musicological Studies.少样本目标检测:在中世纪音乐学研究中的应用。
J Imaging. 2022 Jan 19;8(2):18. doi: 10.3390/jimaging8020018.
5
Prediction of glioma-subtypes: comparison of performance on a DL classifier using bounding box areas versus annotated tumors.胶质瘤亚型的预测:使用边界框面积的深度学习分类器与标注肿瘤在性能上的比较。
BMC Biomed Eng. 2022 May 19;4(1):4. doi: 10.1186/s42490-022-00061-3.
6
SHEL5K: An Extended Dataset and Benchmarking for Safety Helmet Detection.SHEL5K:用于安全头盔检测的扩展数据集和基准测试。
Sensors (Basel). 2022 Mar 17;22(6):2315. doi: 10.3390/s22062315.
7
Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data.基于全自动化 DCNN 的热图像注释,使用基于 RGB 数据预训练的神经网络。
Sensors (Basel). 2021 Feb 23;21(4):1552. doi: 10.3390/s21041552.
8
3D-BoxSup: Positive-Unlabeled Learning of Brain Tumor Segmentation Networks From 3D Bounding Boxes.3D-BoxSup:基于3D边界框的脑肿瘤分割网络的正样本-无标签学习
Front Neurosci. 2020 Apr 28;14:350. doi: 10.3389/fnins.2020.00350. eCollection 2020.
9
Weakly supervised salient object detection via image category annotation.通过图像类别标注实现弱监督显著目标检测。
Math Biosci Eng. 2023 Dec 1;20(12):21359-21381. doi: 10.3934/mbe.2023945.
10
Improved region proposal network for enhanced few-shot object detection.改进的区域提议网络,用于增强少样本目标检测。
Neural Netw. 2024 Dec;180:106699. doi: 10.1016/j.neunet.2024.106699. Epub 2024 Sep 3.

本文引用的文献

1
Few-Shot Object Detection: A Comprehensive Survey.少样本目标检测:全面综述。
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):11958-11978. doi: 10.1109/TNNLS.2023.3265051. Epub 2024 Sep 3.
2
Automated Data Annotation for 6-DoF AI-Based Navigation Algorithm Development.用于基于人工智能的六自由度导航算法开发的自动数据标注
J Imaging. 2021 Nov 10;7(11):236. doi: 10.3390/jimaging7110236.
3
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.
4
Object Detection With Deep Learning: A Review.基于深度学习的目标检测研究综述。
IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3212-3232. doi: 10.1109/TNNLS.2018.2876865. Epub 2019 Jan 28.
5
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
6
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
7
Toward open set recognition.面向开集识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1757-72. doi: 10.1109/TPAMI.2012.256.