• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

让我们疯狂起来:超越边界框表示法,用于自动驾驶中基于鱼眼相机的目标检测

Let's Go Bananas: Beyond Bounding Box Representations for Fisheye Camera-Based Object Detection in Autonomous Driving.

作者信息

Yogamani Senthil, Sistu Ganesh, Denny Patrick, Courtney Jane

机构信息

School of Electrical & Electronic Engineering, Technological University Dublin, D07 ADY7 Dublin, Ireland.

D2ICE Research Centre, University of Limerick, V94 T9PX Limerick, Ireland.

出版信息

Sensors (Basel). 2025 Jun 14;25(12):3735. doi: 10.3390/s25123735.

DOI:10.3390/s25123735
PMID:40573620
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12196831/
Abstract

Object detection is a mature problem in autonomous driving, with pedestrian detection being one of the first commercially deployed algorithms. It has been extensively studied in the literature. However, object detection is relatively less explored for fisheye cameras used for surround-view near-field sensing. The standard bounding-box representation fails in fisheye cameras due to heavy radial distortion, particularly in the periphery. In this paper, a generic object detection framework is implemented using the base YOLO (You Only Look Once) detector to systematically explore various object representations using the public WoodScape dataset. First, we implement basic representations, namely the standard bounding box, the oriented bounding box, and the ellipse. Secondly, we implement a generic polygon and propose a novel curvature-adaptive polygon, which obtains an improvement of 3 mAP (mean average precision) points. A polygon is expensive to annotate and complex to use in downstream tasks; thus, it is not practical to use it in real-world applications. However, we utilize it to demonstrate that the accuracy gap between the polygon and the bounding box representation is very high due to strong distortion in fisheye cameras. This motivates the design of a distortion-aware optimal representation of the bounding box for fisheye images, which tend to be banana-shaped near the periphery. We derive a novel representation called a curved box and improve it further by leveraging vanishing-point constraints. The proposed curved box representations outperform the bounding box by 3 mAP points and the oriented bounding box by 1.6 mAP points. In addition, the camera geometry tensor is formulated to provide adaptation to non-linear fisheye camera distortion characteristics and improves the performance further by 1.4 mAP points.

摘要

目标检测在自动驾驶领域是一个成熟的问题,行人检测是最早商业化部署的算法之一。它在文献中已得到广泛研究。然而,对于用于环视近场传感的鱼眼相机,目标检测的研究相对较少。由于严重的径向畸变,特别是在图像边缘,标准的边界框表示在鱼眼相机中失效。在本文中,使用基础的YOLO(You Only Look Once)检测器实现了一个通用的目标检测框架,以利用公开的WoodScape数据集系统地探索各种目标表示。首先,我们实现了基本表示,即标准边界框、定向边界框和椭圆。其次,我们实现了一个通用多边形,并提出了一种新颖的曲率自适应多边形,其平均精度均值(mAP)提高了3个点。多边形标注成本高且在下游任务中使用复杂;因此,在实际应用中使用它并不实际。然而,我们利用它来证明由于鱼眼相机中的强烈畸变,多边形和边界框表示之间的精度差距非常大。这促使我们设计一种针对鱼眼图像的边界框的畸变感知最优表示,鱼眼图像在边缘附近往往呈香蕉形。我们推导出一种称为弯曲框的新颖表示,并通过利用消失点约束进一步改进它。所提出的弯曲框表示比边界框的mAP高3个点,比定向边界框高1.6个点。此外,还制定了相机几何张量,以适应非线性鱼眼相机的畸变特性,并使性能进一步提高1.4个mAP点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/81477948d418/sensors-25-03735-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/ef54226f9c9a/sensors-25-03735-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/173ccfb0b334/sensors-25-03735-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/0237377e0dac/sensors-25-03735-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/05a4c904b242/sensors-25-03735-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/f1e7d8ab01ac/sensors-25-03735-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/04a445c0a9ae/sensors-25-03735-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/c9a772bac09e/sensors-25-03735-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/6074925fb6fd/sensors-25-03735-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/88dfe55bc0a5/sensors-25-03735-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/0ec241d1bc2e/sensors-25-03735-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/f69ea81c0be0/sensors-25-03735-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/46b476933875/sensors-25-03735-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/e07c000220a0/sensors-25-03735-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/ed43a9702f1c/sensors-25-03735-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/81477948d418/sensors-25-03735-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/ef54226f9c9a/sensors-25-03735-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/173ccfb0b334/sensors-25-03735-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/0237377e0dac/sensors-25-03735-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/05a4c904b242/sensors-25-03735-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/f1e7d8ab01ac/sensors-25-03735-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/04a445c0a9ae/sensors-25-03735-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/c9a772bac09e/sensors-25-03735-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/6074925fb6fd/sensors-25-03735-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/88dfe55bc0a5/sensors-25-03735-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/0ec241d1bc2e/sensors-25-03735-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/f69ea81c0be0/sensors-25-03735-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/46b476933875/sensors-25-03735-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/e07c000220a0/sensors-25-03735-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/ed43a9702f1c/sensors-25-03735-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fb6/12196831/81477948d418/sensors-25-03735-g015.jpg

相似文献

1
Let's Go Bananas: Beyond Bounding Box Representations for Fisheye Camera-Based Object Detection in Autonomous Driving.让我们疯狂起来:超越边界框表示法,用于自动驾驶中基于鱼眼相机的目标检测
Sensors (Basel). 2025 Jun 14;25(12):3735. doi: 10.3390/s25123735.
2
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
3
Integrating computer vision algorithms and RFID system for identification and tracking of group-housed animals: an example with pigs.整合计算机视觉算法和射频识别系统用于群居动物的识别与跟踪:以猪为例。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae174.
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
7
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
8
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.
9
Rapid, point-of-care antigen tests for diagnosis of SARS-CoV-2 infection.用于 SARS-CoV-2 感染诊断的快速、即时抗原检测。
Cochrane Database Syst Rev. 2022 Jul 22;7(7):CD013705. doi: 10.1002/14651858.CD013705.pub3.
10
Incentives for preventing smoking in children and adolescents.预防儿童和青少年吸烟的激励措施。
Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3.

本文引用的文献

1
Overview and Empirical Analysis of ISP Parameter Tuning for Visual Perception in Autonomous Driving.自动驾驶视觉感知中ISP参数调整的概述与实证分析
J Imaging. 2019 Sep 24;5(10):78. doi: 10.3390/jimaging5100078.
2
Proposal-Free Network for Instance-Level Object Segmentation.用于实例级目标分割的无提案网络
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2978-2991. doi: 10.1109/TPAMI.2017.2775623. Epub 2017 Nov 22.
3
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.