• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于KITTI城市场景中3D感知特征的视觉目标识别

Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes.

作者信息

Yebes J Javier, Bergasa Luis M, García-Garrido Miguel Ángel

机构信息

Department of Electronics, University of Alcalá, Alcalá de Henares 28871, Spain.

出版信息

Sensors (Basel). 2015 Apr 20;15(4):9228-50. doi: 10.3390/s150409228.

DOI:10.3390/s150409228
PMID:25903553
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4431302/
Abstract

Driver assistance systems and autonomous robotics rely on the deployment of several sensors for environment perception. Compared to LiDAR systems, the inexpensive vision sensors can capture the 3D scene as perceived by a driver in terms of appearance and depth cues. Indeed, providing 3D image understanding capabilities to vehicles is an essential target in order to infer scene semantics in urban environments. One of the challenges that arises from the navigation task in naturalistic urban scenarios is the detection of road participants (e.g., cyclists, pedestrians and vehicles). In this regard, this paper tackles the detection and orientation estimation of cars, pedestrians and cyclists, employing the challenging and naturalistic KITTI images. This work proposes 3D-aware features computed from stereo color images in order to capture the appearance and depth peculiarities of the objects in road scenes. The successful part-based object detector, known as DPM, is extended to learn richer models from the 2.5D data (color and disparity), while also carrying out a detailed analysis of the training pipeline. A large set of experiments evaluate the proposals, and the best performing approach is ranked on the KITTI website. Indeed, this is the first work that reports results with stereo data for the KITTI object challenge, achieving increased detection ratios for the classes car and cyclist compared to a baseline DPM.

摘要

驾驶辅助系统和自主机器人依靠部署多个传感器来进行环境感知。与激光雷达系统相比,价格低廉的视觉传感器能够从外观和深度线索方面捕捉驾驶员所感知的三维场景。事实上,为车辆提供三维图像理解能力是推断城市环境中场景语义的一个重要目标。在自然主义城市场景中的导航任务所带来的挑战之一是检测道路参与者(如骑自行车的人、行人和车辆)。在这方面,本文利用具有挑战性的自然主义基蒂(KITTI)图像,解决汽车、行人和骑自行车者的检测与方向估计问题。这项工作提出了从立体彩色图像计算出的三维感知特征,以便捕捉道路场景中物体的外观和深度特性。成功的基于部件的目标检测器(称为DPM)被扩展,以从2.5D数据(颜色和视差)中学习更丰富的模型,同时还对训练流程进行了详细分析。大量实验对这些提议进行了评估,表现最佳的方法在基蒂网站上进行了排名。事实上,这是第一项针对基蒂目标挑战赛报告立体数据结果的工作,与基线DPM相比,汽车和骑自行车者类别的检测率有所提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/d3897e29ef09/sensors-15-09228f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/67d820c447ec/sensors-15-09228f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/59d1419401ca/sensors-15-09228f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/266533ab2aba/sensors-15-09228f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/b98cd4616720/sensors-15-09228f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/7b9ab93900dc/sensors-15-09228f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/a99ea425aad6/sensors-15-09228f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/e344db1f4386/sensors-15-09228f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/a2183e6f96fa/sensors-15-09228f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/e8b23a653ff2/sensors-15-09228f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/6c4fcf088a7d/sensors-15-09228f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/ef23a9b44164/sensors-15-09228f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/d3897e29ef09/sensors-15-09228f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/67d820c447ec/sensors-15-09228f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/59d1419401ca/sensors-15-09228f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/266533ab2aba/sensors-15-09228f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/b98cd4616720/sensors-15-09228f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/7b9ab93900dc/sensors-15-09228f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/a99ea425aad6/sensors-15-09228f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/e344db1f4386/sensors-15-09228f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/a2183e6f96fa/sensors-15-09228f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/e8b23a653ff2/sensors-15-09228f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/6c4fcf088a7d/sensors-15-09228f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/ef23a9b44164/sensors-15-09228f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe9/4431302/d3897e29ef09/sensors-15-09228f12.jpg

相似文献

1
Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes.基于KITTI城市场景中3D感知特征的视觉目标识别
Sensors (Basel). 2015 Apr 20;15(4):9228-50. doi: 10.3390/s150409228.
2
3D Object Proposals Using Stereo Imagery for Accurate Object Class Detection.基于立体图像的 3D 对象建议,用于准确的目标类别检测。
IEEE Trans Pattern Anal Mach Intell. 2018 May;40(5):1259-1272. doi: 10.1109/TPAMI.2017.2706685. Epub 2017 May 19.
3
3D Object Detection with SLS-Fusion Network in Foggy Weather Conditions.雾天条件下基于SLS融合网络的3D目标检测
Sensors (Basel). 2021 Oct 9;21(20):6711. doi: 10.3390/s21206711.
4
3D Traffic Scene Understanding From Movable Platforms.基于移动平台的三维交通场景理解
IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):1012-25. doi: 10.1109/TPAMI.2013.185.
5
Road and Railway Smart Mobility: A High-Definition Ground Truth Hybrid Dataset.道路与铁路智能交通:高清地面实况混合数据集。
Sensors (Basel). 2022 May 22;22(10):3922. doi: 10.3390/s22103922.
6
Evaluation of 3D Vulnerable Objects' Detection Using a Multi-Sensors System for Autonomous Vehicles.基于多传感器系统的自动驾驶车辆三维易损物检测评估。
Sensors (Basel). 2022 Feb 21;22(4):1663. doi: 10.3390/s22041663.
7
BAFusion: Bidirectional Attention Fusion for 3D Object Detection Based on LiDAR and Camera.BAFusion:基于激光雷达和摄像头的用于3D目标检测的双向注意力融合
Sensors (Basel). 2024 Jul 20;24(14):4718. doi: 10.3390/s24144718.
8
Multi-view and 3D deformable part models.多视图和 3D 可变形部件模型。
IEEE Trans Pattern Anal Mach Intell. 2015 Nov;37(11):2232-45. doi: 10.1109/TPAMI.2015.2408347.
9
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D.KITTI-360:用于二维和三维城市场景理解的新型数据集和基准
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3292-3310. doi: 10.1109/TPAMI.2022.3179507. Epub 2023 Feb 3.
10
A New 3D Object Pose Detection Method Using LIDAR Shape Set.一种使用激光雷达形状集的新型三维物体姿态检测方法。
Sensors (Basel). 2018 Mar 16;18(3):882. doi: 10.3390/s18030882.

引用本文的文献

1
Positioning, Navigation, and Book Accessing/Returning in an Autonomous Library Robot using Integrated Binocular Vision and QR Code Identification Systems.使用集成双目视觉和 QR 码识别系统的自主图书馆机器人的定位、导航和图书存取/归还。
Sensors (Basel). 2019 Feb 14;19(4):783. doi: 10.3390/s19040783.
2
Self-Adaption Matched Filter and Bi-Directional Difference Method for Moving Target Detection.自适匹配滤波器和双向差分法用于动目标检测。
Sensors (Basel). 2018 Sep 20;18(10):3177. doi: 10.3390/s18103177.
3
Object Detection and Classification by Decision-Level Fusion for Intelligent Vehicle Systems.

本文引用的文献

1
3D Traffic Scene Understanding From Movable Platforms.基于移动平台的三维交通场景理解
IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):1012-25. doi: 10.1109/TPAMI.2013.185.
2
Monocular visual scene understanding: understanding multi-object traffic scenes.单目视觉场景理解:理解多目标交通场景。
IEEE Trans Pattern Anal Mach Intell. 2013 Apr;35(4):882-97. doi: 10.1109/TPAMI.2012.174.
3
Pedestrian detection: an evaluation of the state of the art.行人检测:现状评估。
用于智能车辆系统的基于决策级融合的目标检测与分类
Sensors (Basel). 2017 Jan 22;17(1):207. doi: 10.3390/s17010207.
4
Vision-Based Steering Control, Speed Assistance and Localization for Inner-City Vehicles.基于视觉的城市车辆转向控制、速度辅助与定位
Sensors (Basel). 2016 Mar 11;16(3):362. doi: 10.3390/s16030362.
5
Reliable Fusion of Stereo Matching and Depth Sensor for High Quality Dense Depth Maps.用于高质量密集深度图的立体匹配与深度传感器的可靠融合
Sensors (Basel). 2015 Aug 21;15(8):20894-924. doi: 10.3390/s150820894.
IEEE Trans Pattern Anal Mach Intell. 2012 Apr;34(4):743-61. doi: 10.1109/TPAMI.2011.155.
4
Object detection with discriminatively trained part-based models.基于判别式训练的部件模型的目标检测。
IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1627-45. doi: 10.1109/TPAMI.2009.167.
5
Stereo processing by semiglobal matching and mutual information.通过半全局匹配和互信息进行立体处理。
IEEE Trans Pattern Anal Mach Intell. 2008 Feb;30(2):328-41. doi: 10.1109/TPAMI.2007.1166.