• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

置信度得分:目标检测性能评估中被遗忘的维度。

Confidence Score: The Forgotten Dimension of Object Detection Performance Evaluation.

机构信息

Marduk Technologies OÜ, 12618 Tallinn, Estonia.

National Center for Robotics and Internet of Things Technology, Communication and Information Technologies Research Institute, King Abdulaziz City for Science and Technology-KACST, Riyadh 11442, Saudi Arabia.

出版信息

Sensors (Basel). 2021 Jun 25;21(13):4350. doi: 10.3390/s21134350.

DOI:10.3390/s21134350
PMID:34202089
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8271464/
Abstract

When deploying a model for object detection, a confidence score threshold is chosen to filter out false positives and ensure that a predicted bounding box has a certain minimum score. To achieve state-of-the-art performance on benchmark datasets, most neural networks use a rather low threshold as a high number of false positives is not penalized by standard evaluation metrics. However, in scenarios of Artificial Intelligence (AI) applications that require high confidence scores (e.g., due to legal requirements or consequences of incorrect detections are severe) or a certain level of model robustness is required, it is unclear which base model to use since they were mainly optimized for benchmark scores. In this paper, we propose a method to find the optimum performance point of a model as a basis for fairer comparison and deeper insights into the trade-offs caused by selecting a confidence score threshold.

摘要

在部署用于对象检测的模型时,会选择置信度得分阈值来过滤掉误报,并确保预测的边界框具有一定的最低得分。为了在基准数据集上实现最先进的性能,大多数神经网络使用相当低的阈值,因为标准评估指标不会因大量误报而受到惩罚。然而,在需要高置信度得分的人工智能 (AI) 应用场景中(例如,由于法律要求或错误检测的后果严重),或者需要一定水平的模型鲁棒性的情况下,由于它们主要针对基准得分进行优化,因此不清楚应该使用哪个基础模型。在本文中,我们提出了一种方法来找到模型的最佳性能点,作为更公平的比较和更深入了解选择置信度得分阈值所带来的权衡的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/53fcb20eba1a/sensors-21-04350-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/7f5a029dcf23/sensors-21-04350-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/84eef44c7b1c/sensors-21-04350-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/af53724eccb5/sensors-21-04350-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/01bbd82e914c/sensors-21-04350-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/32576dc7a311/sensors-21-04350-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/1d561cb90bfd/sensors-21-04350-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/9ed49e0dae51/sensors-21-04350-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/d7761c0503ad/sensors-21-04350-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/ec58d46babce/sensors-21-04350-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/0ebb5d6c3d30/sensors-21-04350-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/69845b3faff2/sensors-21-04350-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/b38cbc306b36/sensors-21-04350-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/996cf21728db/sensors-21-04350-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/f404f75b1f9f/sensors-21-04350-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/8221bd031365/sensors-21-04350-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/53fcb20eba1a/sensors-21-04350-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/7f5a029dcf23/sensors-21-04350-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/84eef44c7b1c/sensors-21-04350-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/af53724eccb5/sensors-21-04350-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/01bbd82e914c/sensors-21-04350-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/32576dc7a311/sensors-21-04350-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/1d561cb90bfd/sensors-21-04350-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/9ed49e0dae51/sensors-21-04350-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/d7761c0503ad/sensors-21-04350-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/ec58d46babce/sensors-21-04350-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/0ebb5d6c3d30/sensors-21-04350-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/69845b3faff2/sensors-21-04350-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/b38cbc306b36/sensors-21-04350-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/996cf21728db/sensors-21-04350-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/f404f75b1f9f/sensors-21-04350-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/8221bd031365/sensors-21-04350-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9992/8271464/53fcb20eba1a/sensors-21-04350-g016.jpg

相似文献

1
Confidence Score: The Forgotten Dimension of Object Detection Performance Evaluation.置信度得分:目标检测性能评估中被遗忘的维度。
Sensors (Basel). 2021 Jun 25;21(13):4350. doi: 10.3390/s21134350.
2
The Use of Artificial Intelligence Approaches for Performance Improvement of Low-Cost Integrated Navigation Systems.利用人工智能方法提高低成本集成导航系统的性能。
Sensors (Basel). 2023 Jul 3;23(13):6127. doi: 10.3390/s23136127.
3
Rethinking the performance comparison between SNNS and ANNS.重新思考 SNNS 和 ANNS 的性能比较。
Neural Netw. 2020 Jan;121:294-307. doi: 10.1016/j.neunet.2019.09.005. Epub 2019 Sep 19.
4
Data Association for Multi-Object Tracking via Deep Neural Networks.基于深度神经网络的多目标跟踪数据关联。
Sensors (Basel). 2019 Jan 29;19(3):559. doi: 10.3390/s19030559.
5
CrimeNet: Neural Structured Learning using Vision Transformer for violence detection.犯罪网络:使用视觉Transformer进行暴力检测的神经结构化学习
Neural Netw. 2023 Apr;161:318-329. doi: 10.1016/j.neunet.2023.01.048. Epub 2023 Feb 2.
6
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。
J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.
7
Smart chest X-ray worklist prioritization using artificial intelligence: a clinical workflow simulation.使用人工智能进行智能胸部X光检查工作列表优先级排序:临床工作流程模拟
Eur Radiol. 2021 Jun;31(6):3837-3845. doi: 10.1007/s00330-020-07480-7. Epub 2020 Nov 21.
8
Weakly Supervised Object Localization and Detection: A Survey.弱监督目标定位与检测:综述
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5866-5885. doi: 10.1109/TPAMI.2021.3074313. Epub 2022 Aug 4.
9
Two-stage multiple object detection using CNN and correlative filter for accuracy improvement.使用卷积神经网络(CNN)和相关滤波器进行两阶段多目标检测以提高准确率。
Heliyon. 2023 Jan 10;9(1):e12716. doi: 10.1016/j.heliyon.2022.e12716. eCollection 2023 Jan.
10
Embedding topological features into convolutional neural network salient object detection.将拓扑特征嵌入卷积神经网络显著目标检测中。
Neural Netw. 2020 Jan;121:308-318. doi: 10.1016/j.neunet.2019.09.009. Epub 2019 Sep 25.

引用本文的文献

1
A novel dataset and deep learning object detection benchmark for grapevine pest surveillance.一个用于葡萄害虫监测的新型数据集和深度学习目标检测基准。
Front Plant Sci. 2024 Dec 12;15:1485216. doi: 10.3389/fpls.2024.1485216. eCollection 2024.
2
Enhancing Grapevine Node Detection to Support Pruning Automation: Leveraging State-of-the-Art YOLO Detection Models for 2D Image Analysis.增强葡萄蔓节点检测以支持修剪自动化:利用最先进的 YOLO 检测模型进行 2D 图像分析。
Sensors (Basel). 2024 Oct 22;24(21):6774. doi: 10.3390/s24216774.
3
Optimizing Camera Exposure Time for Automotive Applications.
优化汽车应用中的相机曝光时间。
Sensors (Basel). 2024 Aug 8;24(16):5135. doi: 10.3390/s24165135.
4
Application of Machine Learning for Automating Behavioral Tracking of Captive Bornean Orangutans ().机器学习在圈养婆罗洲猩猩行为跟踪自动化中的应用()。 (注:原文括号部分内容缺失,翻译时保留原样)
Animals (Basel). 2024 Jun 8;14(12):1729. doi: 10.3390/ani14121729.
5
Enhancing Caries Detection in Bitewing Radiographs Using YOLOv7.基于 YOLOv7 提升口内根尖片龋病检测能力。
J Digit Imaging. 2023 Dec;36(6):2635-2647. doi: 10.1007/s10278-023-00871-4. Epub 2023 Aug 28.
6
Experimental validation of computer-vision methods for the successful detection of endodontic treatment obturation and progression from noisy radiographs.利用计算机视觉方法成功检测根管治疗封闭和从嘈杂的射线照片中进展的实验验证。
Oral Radiol. 2023 Oct;39(4):683-698. doi: 10.1007/s11282-023-00685-8. Epub 2023 Apr 25.
7
A Study of an Online Tracking System for Spark Images of Abrasive Belt-Polishing Workpieces.基于在线跟踪系统的砂带抛光工件火花图像研究
Sensors (Basel). 2023 Feb 10;23(4):2025. doi: 10.3390/s23042025.
8
Influence of Insufficient Dataset Augmentation on IoU and Detection Threshold in CNN Training for Object Detection on Aerial Images.在用于航空图像目标检测的 CNN 训练中,数据集不足对 IoU 和检测阈值的影响。
Sensors (Basel). 2022 Nov 23;22(23):9080. doi: 10.3390/s22239080.
9
Could simplified stimuli change how the brain performs visual search tasks? A deep neural network study.简化刺激能否改变大脑执行视觉搜索任务的方式?一项深度神经网络研究。
J Vis. 2022 Jun 1;22(7):3. doi: 10.1167/jov.22.7.3.
10
LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection.LEOD-Net:用于实时目标检测的学习线编码边界框。
Sensors (Basel). 2022 May 12;22(10):3699. doi: 10.3390/s22103699.