• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

以视觉为中心的鸟瞰图感知:综述

Vision-Centric BEV Perception: A Survey.

作者信息

Ma Yuexin, Wang Tai, Bai Xuyang, Yang Huitong, Hou Yuenan, Wang Yaming, Qiao Yu, Yang Ruigang, Zhu Xinge

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10978-10997. doi: 10.1109/TPAMI.2024.3449912. Epub 2024 Nov 6.

DOI:10.1109/TPAMI.2024.3449912
PMID:39250358
Abstract

In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant interest from both industry and academia due to its inherent advantages, such as providing an intuitive representation of the world and being conducive to data fusion. The rapid advancements in deep learning have led to the proposal of numerous methods for addressing vision-centric BEV perception challenges. However, there has been no recent survey encompassing this novel and burgeoning research field. To catalyze future research, this paper presents a comprehensive survey of the latest developments in vision-centric BEV perception and its extensions. It compiles and organizes up-to-date knowledge, offering a systematic review and summary of prevalent algorithms. Additionally, the paper provides in-depth analyses and comparative results on various BEV perception tasks, facilitating the evaluation of future works and sparking new research directions. Furthermore, the paper discusses and shares valuable empirical implementation details to aid in the advancement of related algorithms.

摘要

近年来,以视觉为中心的鸟瞰视图(BEV)感知因其固有的优势,如提供直观的世界表示并有利于数据融合,而受到了工业界和学术界的广泛关注。深度学习的快速发展催生了众多应对以视觉为中心的BEV感知挑战的方法。然而,最近还没有关于这个新颖且蓬勃发展的研究领域的综述。为了推动未来的研究,本文对以视觉为中心的BEV感知及其扩展的最新进展进行了全面综述。它汇编并整理了最新知识,对流行算法进行了系统回顾和总结。此外,本文还对各种BEV感知任务进行了深入分析和比较结果,便于对未来工作进行评估并激发新的研究方向。此外,本文还讨论并分享了有价值的经验性实现细节,以帮助推进相关算法。

相似文献

1
Vision-Centric BEV Perception: A Survey.以视觉为中心的鸟瞰图感知:综述
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10978-10997. doi: 10.1109/TPAMI.2024.3449912. Epub 2024 Nov 6.
2
Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe.深入探究鸟瞰视角感知的难题:一篇综述、评估与方法
IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2151-2170. doi: 10.1109/TPAMI.2023.3333838. Epub 2024 Mar 6.
3
Robust BEV 3D Object Detection for Vehicles with Tire Blow-Out.用于爆胎车辆的鲁棒性鸟瞰图3D目标检测
Sensors (Basel). 2024 Jul 9;24(14):4446. doi: 10.3390/s24144446.
4
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline.Fast-BEV:一种快速且强大的鸟瞰视角感知基线。
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8665-8679. doi: 10.1109/TPAMI.2024.3414835. Epub 2024 Nov 6.
5
HeightFormer: Explicit Height Modeling Without Extra Data for Camera-Only 3D Object Detection in Bird's Eye View.HeightFormer:无需额外数据的显式高度建模,用于鸟瞰视角下仅基于相机的3D目标检测
IEEE Trans Image Process. 2025;34:689-700. doi: 10.1109/TIP.2024.3427701. Epub 2025 Jan 28.
6
Crop-saving with AI: latest trends in deep learning techniques for plant pathology.利用人工智能拯救作物:植物病理学深度学习技术的最新趋势
Front Plant Sci. 2023 Aug 1;14:1224709. doi: 10.3389/fpls.2023.1224709. eCollection 2023.
7
3D Object Detection From Images for Autonomous Driving: A Survey.用于自动驾驶的基于图像的3D目标检测:综述
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3537-3556. doi: 10.1109/TPAMI.2023.3346386. Epub 2024 Apr 3.
8
Accurate 3D to 2D Object Distance Estimation from the Mapped Point Cloud Data.从映射点云数据中准确估计 3D 到 2D 的物体距离。
Sensors (Basel). 2023 Feb 13;23(4):2103. doi: 10.3390/s23042103.
9
Fast vehicle detection based on colored point cloud with bird's eye view representation.基于鸟瞰彩色点云的快速车辆检测。
Sci Rep. 2023 May 8;13(1):7447. doi: 10.1038/s41598-023-34479-z.
10
Knowledge-Augmented Deep Learning and its Applications: A Survey.知识增强深度学习及其应用:一项综述。
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2133-2153. doi: 10.1109/TNNLS.2023.3338619. Epub 2025 Feb 6.

引用本文的文献

1
LGMMFusion: A LiDAR-guided multi-modal fusion framework for enhanced 3D object detection.LGMMFusion:一种用于增强三维目标检测的激光雷达引导多模态融合框架。
PLoS One. 2025 Sep 4;20(9):e0331195. doi: 10.1371/journal.pone.0331195. eCollection 2025.
2
KPMapNet: Keypoint Representation Learning for Online Vectorized High-Definition Map Construction.KPMapNet:用于在线矢量化高清地图构建的关键点表示学习
Sensors (Basel). 2025 Mar 18;25(6):1897. doi: 10.3390/s25061897.
3
A Comparative Study and Optimization of Camera-Based BEV Segmentation for Real-Time Autonomous Driving.
用于实时自动驾驶的基于摄像头的鸟瞰图分割的比较研究与优化
Sensors (Basel). 2025 Apr 4;25(7):2300. doi: 10.3390/s25072300.
4
Combining OpenStreetMap with Satellite Imagery to Enhance Cross-View Geo-Localization.结合开放街道地图与卫星图像以增强跨视图地理定位
Sensors (Basel). 2024 Dec 25;25(1):44. doi: 10.3390/s25010044.