• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

少样本目标检测:在中世纪音乐学研究中的应用。

Few-Shot Object Detection: Application to Medieval Musicological Studies.

作者信息

Ibrahim Bekkouch Imad Eddine, Eyharabide Victoria, Le Page Valérie, Billiet Frédéric

机构信息

Sorbonne Center for Artificial Intelligence, Sorbonne University, 75005 Paris, France.

STIH Laboratory, Sorbonne University, 75005 Paris, France.

出版信息

J Imaging. 2022 Jan 19;8(2):18. doi: 10.3390/jimaging8020018.

DOI:10.3390/jimaging8020018
PMID:35200721
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8880595/
Abstract

Detecting objects with a small representation in images is a challenging task, especially when the style of the images is very different from recent photos, which is the case for cultural heritage datasets. This problem is commonly known as few-shot object detection and is still a new field of research. This article presents a simple and effective method for black box few-shot object detection that works with all the current state-of-the-art object detection models. We also present a new dataset called MMSD for medieval musicological studies that contains five classes and 693 samples, manually annotated by a group of musicology experts. Due to the significant diversity of styles and considerable disparities between the artistic representations of the objects, our dataset is more challenging than the current standards. We evaluate our method on YOLOv4 (m/s), (Mask/Faster) RCNN, and ViT/Swin-t. We present two methods of benchmarking these models based on the overall data size and the worst-case scenario for object detection. The experimental results show that our method always improves object detector results compared to traditional transfer learning, regardless of the underlying architecture.

摘要

在图像中检测具有小表示的物体是一项具有挑战性的任务,特别是当图像的风格与近期照片非常不同时,文化遗产数据集就是这种情况。这个问题通常被称为少样本目标检测,并且仍然是一个新的研究领域。本文提出了一种简单有效的黑盒少样本目标检测方法,该方法适用于所有当前最先进的目标检测模型。我们还提出了一个名为MMSD的用于中世纪音乐学研究的新数据集,它包含五个类别和693个样本,由一组音乐学专家进行手动标注。由于风格的显著多样性以及物体艺术表现之间的巨大差异,我们的数据集比当前标准更具挑战性。我们在YOLOv4(m/s)、(Mask/Faster)RCNN和ViT/Swin-t上评估我们的方法。我们基于整体数据大小和目标检测的最坏情况场景提出了两种对这些模型进行基准测试的方法。实验结果表明,与传统迁移学习相比,我们的方法总能提高目标检测器的结果,无论底层架构如何。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7c/8880595/66b8a104b9bc/jimaging-08-00018-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7c/8880595/66b8a104b9bc/jimaging-08-00018-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7c/8880595/66b8a104b9bc/jimaging-08-00018-g002.jpg

相似文献

1
Few-Shot Object Detection: Application to Medieval Musicological Studies.少样本目标检测:在中世纪音乐学研究中的应用。
J Imaging. 2022 Jan 19;8(2):18. doi: 10.3390/jimaging8020018.
2
Automatic Bounding Box Annotation with Small Training Datasets for Industrial Manufacturing.用于工业制造的小训练数据集自动边界框标注
Micromachines (Basel). 2023 Feb 13;14(2):442. doi: 10.3390/mi14020442.
3
Expandable-RCNN: toward high-efficiency incremental few-shot object detection.可扩展区域卷积神经网络:迈向高效增量少样本目标检测
Front Artif Intell. 2024 Apr 23;7:1377337. doi: 10.3389/frai.2024.1377337. eCollection 2024.
4
Decoupled Metric Network for Single-Stage Few-Shot Object Detection.用于单阶段少样本目标检测的解耦度量网络
IEEE Trans Cybern. 2023 Jan;53(1):514-525. doi: 10.1109/TCYB.2022.3149825. Epub 2022 Dec 23.
5
Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild.基于小样本的野外目标检测与视角估计
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3090-3106. doi: 10.1109/TPAMI.2022.3174072. Epub 2023 Feb 3.
6
Composite Object Relation Modeling for Few-Shot Scene Recognition.用于少样本场景识别的复合对象关系建模
IEEE Trans Image Process. 2023;32:5678-5691. doi: 10.1109/TIP.2023.3321475. Epub 2023 Oct 17.
7
SHEL5K: An Extended Dataset and Benchmarking for Safety Helmet Detection.SHEL5K:用于安全头盔检测的扩展数据集和基准测试。
Sensors (Basel). 2022 Mar 17;22(6):2315. doi: 10.3390/s22062315.
8
Improved region proposal network for enhanced few-shot object detection.改进的区域提议网络,用于增强少样本目标检测。
Neural Netw. 2024 Dec;180:106699. doi: 10.1016/j.neunet.2024.106699. Epub 2024 Sep 3.
9
Cross-modality interaction for few-shot multispectral object detection with semantic knowledge.基于语义知识的少样本多光谱目标检测的跨模态交互。
Neural Netw. 2024 May;173:106156. doi: 10.1016/j.neunet.2024.106156. Epub 2024 Feb 5.
10
Towards Generalized Few-Shot Open-Set Object Detection.迈向广义少样本开放集目标检测
IEEE Trans Image Process. 2024 Feb 15;PP. doi: 10.1109/TIP.2024.3364495.

引用本文的文献

1
Graph Neural Network and LSTM Integration for Enhanced Multi-Label Style Classification of Piano Sonatas.用于增强钢琴奏鸣曲多标签风格分类的图神经网络与长短期记忆网络集成
Sensors (Basel). 2025 Jan 23;25(3):666. doi: 10.3390/s25030666.
2
Improved YOLOv5 Network for Real-Time Object Detection in Vehicle-Mounted Camera Capture Scenarios.基于车载相机采集场景的实时目标检测的改进型 YOLOv5 网络。
Sensors (Basel). 2023 May 9;23(10):4589. doi: 10.3390/s23104589.
3
Computer Vision and Robotics for Cultural Heritage: Theory and Applications.用于文化遗产的计算机视觉与机器人技术:理论与应用

本文引用的文献

1
SSD-EMB: An Improved SSD Using Enhanced Feature Map Block for Object Detection.SSD-EMB:一种利用增强特征图块的 SSD 目标检测改进方法。
Sensors (Basel). 2021 Apr 17;21(8):2842. doi: 10.3390/s21082842.
2
Anomaly Detection Based on Zero-Shot Outlier Synthesis and Hierarchical Feature Distillation.
IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):281-291. doi: 10.1109/TNNLS.2020.3027667. Epub 2022 Jan 5.
3
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
J Imaging. 2022 Dec 30;9(1):9. doi: 10.3390/jimaging9010009.
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
4
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.