• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

阿波罗景观开放数据集在自动驾驶中的应用

The ApolloScape Open Dataset for Autonomous Driving and Its Application.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2702-2719. doi: 10.1109/TPAMI.2019.2926463. Epub 2019 Jul 2.

DOI:10.1109/TPAMI.2019.2926463
PMID:31283496
Abstract

Autonomous driving has attracted tremendous attention especially in the past few years. The key techniques for a self-driving car include solving tasks like 3D map construction, self-localization, parsing the driving road and understanding objects, which enable vehicles to reason and act. However, large scale data set for training and system evaluation is still a bottleneck for developing robust perception models. In this paper, we present the ApolloScape dataset [1] and its applications for autonomous driving. Compared with existing public datasets from real scenes, e.g., KITTI [2] or Cityscapes [3] , ApolloScape contains much large and richer labelling including holistic semantic dense point cloud for each site, stereo, per-pixel semantic labelling, lanemark labelling, instance segmentation, 3D car instance, high accurate location for every frame in various driving videos from multiple sites, cities and daytimes. For each task, it contains at lease 15x larger amount of images than SOTA datasets. To label such a complete dataset, we develop various tools and algorithms specified for each task to accelerate the labelling process, such as joint 3D-2D segment labeling, active labelling in videos etc. Depend on ApolloScape, we are able to develop algorithms jointly consider the learning and inference of multiple tasks. In this paper, we provide a sensor fusion scheme integrating camera videos, consumer-grade motion sensors (GPS/IMU), and a 3D semantic map in order to achieve robust self-localization and semantic segmentation for autonomous driving. We show that practically, sensor fusion and joint learning of multiple tasks are beneficial to achieve a more robust and accurate system. We expect our dataset and proposed relevant algorithms can support and motivate researchers for further development of multi-sensor fusion and multi-task learning in the field of computer vision.

摘要

自动驾驶技术在过去几年中引起了广泛关注。自动驾驶汽车的关键技术包括解决 3D 地图构建、自定位、解析驾驶道路和理解目标等任务,这些任务使车辆能够进行推理和行动。然而,用于训练和系统评估的大规模数据集仍然是开发鲁棒感知模型的瓶颈。在本文中,我们提出了 ApolloScape 数据集[1]及其在自动驾驶中的应用。与现有的真实场景公共数据集相比,例如 KITTI[2]或 Cityscapes[3],ApolloScape 包含更多、更丰富的标注,包括每个地点的整体语义密集点云、立体图像、逐像素语义标注、车道标记、实例分割、3D 车辆实例、来自多个地点、城市和不同时段的各种驾驶视频中的每一帧的高精度位置。对于每个任务,它包含的图像数量至少是 SOTA 数据集的 15 倍。为了标注这样一个完整的数据集,我们开发了针对每个任务的各种工具和算法来加速标注过程,例如联合 3D-2D 分割标注、视频中的主动标注等。基于 ApolloScape,我们能够开发联合学习多个任务的算法。在本文中,我们提供了一种传感器融合方案,该方案集成了摄像机视频、消费级运动传感器(GPS/IMU)和 3D 语义图,以实现自动驾驶的鲁棒自定位和语义分割。我们表明,实际上,传感器融合和多任务联合学习有助于实现更鲁棒和准确的系统。我们希望我们的数据集和提出的相关算法能够支持和激励研究人员进一步开发计算机视觉领域的多传感器融合和多任务学习。

相似文献

1
The ApolloScape Open Dataset for Autonomous Driving and Its Application.阿波罗景观开放数据集在自动驾驶中的应用
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2702-2719. doi: 10.1109/TPAMI.2019.2926463. Epub 2019 Jul 2.
2
Improving Semantic Segmentation of Urban Scenes for Self-Driving Cars with Synthetic Images.利用合成图像提高自动驾驶汽车的城市场景语义分割。
Sensors (Basel). 2022 Mar 14;22(6):2252. doi: 10.3390/s22062252.
3
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D.KITTI-360:用于二维和三维城市场景理解的新型数据集和基准
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3292-3310. doi: 10.1109/TPAMI.2022.3179507. Epub 2023 Feb 3.
4
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud.基于迁移学习的点云三维目标检测语义分割。
Sensors (Basel). 2021 Jun 8;21(12):3964. doi: 10.3390/s21123964.
5
Lane and Road Marker Semantic Video Segmentation Using Mask Cropping and Optical Flow Estimation.基于掩模裁剪和光流估计的车道和路牌语义视频分割。
Sensors (Basel). 2021 Oct 28;21(21):7156. doi: 10.3390/s21217156.
6
3D Object Detection with SLS-Fusion Network in Foggy Weather Conditions.雾天条件下基于SLS融合网络的3D目标检测
Sensors (Basel). 2021 Oct 9;21(20):6711. doi: 10.3390/s21206711.
7
Semantic segmentation of autonomous driving scenes based on multi-scale adaptive attention mechanism.基于多尺度自适应注意力机制的自动驾驶场景语义分割
Front Neurosci. 2023 Oct 19;17:1291674. doi: 10.3389/fnins.2023.1291674. eCollection 2023.
8
Implementation of a Lightweight Semantic Segmentation Algorithm in Road Obstacle Detection.在道路障碍物检测中实现轻量级语义分割算法。
Sensors (Basel). 2020 Dec 10;20(24):7089. doi: 10.3390/s20247089.
9
Real-time depth completion based on LiDAR-stereo for autonomous driving.基于激光雷达-立体视觉的自动驾驶实时深度补全
Front Neurorobot. 2023 Apr 18;17:1124676. doi: 10.3389/fnbot.2023.1124676. eCollection 2023.
10
Enhanced Perception for Autonomous Driving Using Semantic and Geometric Data Fusion.利用语义和几何数据融合实现自动驾驶的增强感知。
Sensors (Basel). 2022 Jul 5;22(13):5061. doi: 10.3390/s22135061.

引用本文的文献

1
A Survey of Deep Learning-Based 3D Object Detection Methods for Autonomous Driving Across Different Sensor Modalities.基于深度学习的跨不同传感器模态的自动驾驶3D目标检测方法综述。
Sensors (Basel). 2025 Aug 24;25(17):5264. doi: 10.3390/s25175264.
2
A lightweight algorithm for synchronized multimodal data acquisition using temporal sample alignment.一种使用时间样本对齐的同步多模态数据采集的轻量级算法。
Sci Rep. 2025 Jul 2;15(1):22711. doi: 10.1038/s41598-025-07797-7.
3
TFF-Net: A Feature Fusion Graph Neural Network-Based Vehicle Type Recognition Approach for Low-Light Conditions.
TFF-Net:一种基于特征融合图神经网络的低光照条件下车辆类型识别方法
Sensors (Basel). 2025 Jun 9;25(12):3613. doi: 10.3390/s25123613.
4
A Survey of the Multi-Sensor Fusion Object Detection Task in Autonomous Driving.自动驾驶中多传感器融合目标检测任务的综述
Sensors (Basel). 2025 Apr 29;25(9):2794. doi: 10.3390/s25092794.
5
Evaluation of Semantic Segmentation Performance for a Multimodal Roadside Vehicle Detection System on the Edge.边缘多模态路边车辆检测系统的语义分割性能评估
Sensors (Basel). 2025 Jan 10;25(2):370. doi: 10.3390/s25020370.
6
Three-Dimensional Dense Reconstruction: A Review of Algorithms and Datasets.三维密集重建:算法与数据集综述
Sensors (Basel). 2024 Sep 10;24(18):5861. doi: 10.3390/s24185861.
7
Reducing Training Data Using Pre-Trained Foundation Models: A Case Study on Traffic Sign Segmentation Using the Segment Anything Model.使用预训练基础模型减少训练数据:以使用“分割一切模型”进行交通标志分割为例
J Imaging. 2024 Sep 7;10(9):220. doi: 10.3390/jimaging10090220.
8
STRIDE: Street View-based Environmental Feature Detection and Pedestrian Collision Prediction.STRIDE:基于街景的环境特征检测与行人碰撞预测
IEEE Int Conf Comput Vis Workshops. 2023 Oct;2023:3222-3234. doi: 10.1109/iccvw60793.2023.00347. Epub 2023 Dec 25.
9
A road surface reconstruction dataset for autonomous driving.面向自动驾驶的道路表面重建数据集。
Sci Data. 2024 May 6;11(1):459. doi: 10.1038/s41597-024-03261-9.
10
Object Detection in Adverse Weather for Autonomous Driving through Data Merging and YOLOv8.通过数据融合和YOLOv8实现自动驾驶在恶劣天气下的目标检测
Sensors (Basel). 2023 Oct 14;23(20):8471. doi: 10.3390/s23208471.