Suppr超能文献

场景占用与重建:用于非结构化场景理解的综合数据集

Scene as Occupancy and Reconstruction: A Comprehensive Dataset for Unstructured Scene Understanding.

作者信息

Chen Long, Song Ruiqi, Wu Hangbin, Ding Baiyong, Li Lingxi, Wang Fei-Yue

机构信息

Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China.

Waytous, Beijing, 100183, China.

出版信息

Sci Data. 2025 Jul 15;12(1):1232. doi: 10.1038/s41597-025-05532-5.

Abstract

As autonomous driving technology steps into the phase of large-scale commercialization, safety and comfort have become key indicators for measuring its performance. Currently, some studies have begun to focus on improving the safety and comfort of urban driving by paying attention to irregular surface regions. However, datasets and studies for unstructured scenes, which are characterized by numerous irregular obstacles and road surface undulations, remain exceedingly rare. To expand the scope of autonomous driving applications, a perception dataset, which focuses on irregular obstacles and road surface vibrations in unstructured scenes, has been built. It takes into consideration the fact that the detection of various irregular obstacles in unstructured scenes plays a key role in trajectory planning, while the recognition of undulating road surface conditions in these scenes is crucial for speed planning. Therefore, we investigate unstructured scene understanding through 3D semantic occupancy prediction, which is used to detect irregular obstacles in unstructured scenes, and road surface elevation reconstruction, which characterizes the bumpy and uneven conditions of road surfaces. The dataset provides detailed annotations for 3D semantic occupancy prediction and road surface elevation reconstruction, offering a comprehensive representation of unstructured scenes. In addition, trajectory and speed planning information is provided to explore the relationship between perception and planning in unstructured scenes. Natural language descriptions of scenes are also provided to explore the interpretability of autonomous driving decision-making. Experiments have been conducted with various state-of-the-art methods to demonstrate the effectiveness of our dataset and the challenges posed by these tasks. To the best of our knowledge, this is the world's first comprehensive benchmark for perception in unstructured scenes, which serves as a valuable resource for extending autonomous driving technology from urban to unstructured scenes.

摘要

随着自动驾驶技术步入大规模商业化阶段,安全性和舒适性已成为衡量其性能的关键指标。目前,一些研究已开始通过关注不规则路面区域来致力于提高城市驾驶的安全性和舒适性。然而,针对以大量不规则障碍物和路面起伏为特征的非结构化场景的数据集和研究仍然极为稀少。为了扩大自动驾驶应用的范围,已构建了一个专注于非结构化场景中不规则障碍物和路面振动的感知数据集。该数据集考虑到在非结构化场景中检测各种不规则障碍物在轨迹规划中起着关键作用,而识别这些场景中起伏的路面状况对于速度规划至关重要。因此,我们通过用于检测非结构化场景中不规则障碍物的3D语义占用预测以及表征路面颠簸不平状况的路面高程重建来研究非结构化场景理解。该数据集为3D语义占用预测和路面高程重建提供了详细注释,全面呈现了非结构化场景。此外,还提供了轨迹和速度规划信息,以探索非结构化场景中感知与规划之间的关系。同时还提供了场景的自然语言描述,以探索自动驾驶决策的可解释性。已使用各种先进方法进行了实验,以证明我们数据集的有效性以及这些任务所带来的挑战。据我们所知,这是世界上首个针对非结构化场景感知的综合基准,它是将自动驾驶技术从城市扩展到非结构化场景的宝贵资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a98b/12264051/d0c12d7678e8/41597_2025_5532_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验