• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基础模型在推动自动驾驶汽车发展中的前瞻性作用。

Prospective Role of Foundation Models in Advancing Autonomous Vehicles.

作者信息

Wu Jianhua, Gao Bingzhao, Gao Jincheng, Yu Jianhao, Chu Hongqing, Yu Qiankun, Gong Xun, Chang Yi, Tseng H Eric, Chen Hong, Chen Jie

机构信息

School of Automotive Studies, Tongji University, Shanghai 201804, China.

Frontiers Science Center for Intelligent Autonomous Systems, Tongji University, Shanghai 201210, China.

出版信息

Research (Wash D C). 2024 Jul 16;7:0399. doi: 10.34133/research.0399. eCollection 2024.

DOI:10.34133/research.0399
PMID:39015204
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11249913/
Abstract

With the development of artificial intelligence and breakthroughs in deep learning, large-scale foundation models (FMs), such as generative pre-trained transformer (GPT), Sora, etc., have achieved remarkable results in many fields including natural language processing and computer vision. The application of FMs in autonomous driving holds considerable promise. For example, they can contribute to enhancing scene understanding and reasoning. By pre-training on rich linguistic and visual data, FMs can understand and interpret various elements in a driving scene, and provide cognitive reasoning to give linguistic and action instructions for driving decisions and planning. Furthermore, FMs can augment data based on the understanding of driving scenarios to provide feasible scenes of those rare occurrences in the long tail distribution that are unlikely to be encountered during routine driving and data collection. The enhancement can subsequently lead to improvement in the accuracy and reliability of autonomous driving systems. Another testament to the potential of FMs' applications lies in world models, exemplified by the DREAMER series, which showcases the ability to comprehend physical laws and dynamics. Learning from massive data under the paradigm of self-supervised learning, world models can generate unseen yet plausible driving environments, facilitating the enhancement in the prediction of road users' behaviors and the off-line training of driving strategies. In this paper, we synthesize the applications and future trends of FMs in autonomous driving. By utilizing the powerful capabilities of FMs, we strive to tackle the potential issues stemming from the long-tail distribution in autonomous driving, consequently advancing overall safety in this domain.

摘要

随着人工智能的发展以及深度学习的突破,大规模基础模型(FMs),如生成式预训练变换器(GPT)、Sora等,在自然语言处理和计算机视觉等许多领域都取得了显著成果。FMs在自动驾驶中的应用前景广阔。例如,它们有助于增强场景理解和推理能力。通过在丰富的语言和视觉数据上进行预训练,FMs能够理解和解释驾驶场景中的各种元素,并提供认知推理,为驾驶决策和规划给出语言和行动指令。此外,FMs可以基于对驾驶场景的理解来扩充数据,以提供常规驾驶和数据收集过程中不太可能遇到的长尾分布中那些罕见情况的可行场景。这种增强随后可以提高自动驾驶系统的准确性和可靠性。FMs应用潜力的另一个证明在于世界模型,以DREAMER系列为例,它展示了理解物理定律和动力学的能力。在自监督学习范式下从海量数据中学习,世界模型可以生成未见但合理的驾驶环境,有助于增强对道路使用者行为的预测以及驾驶策略的离线训练。在本文中,我们综合了FMs在自动驾驶中的应用和未来趋势。通过利用FMs的强大功能,我们努力解决自动驾驶中长尾分布带来的潜在问题,从而提高该领域的整体安全性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/57f9ab7d3897/research.0399.fig.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/343312e2d99a/research.0399.fig.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/60c0fed2ee75/research.0399.fig.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/85f28ccacfd7/research.0399.fig.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/2ca321c98a4b/research.0399.fig.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/30f798e287ca/research.0399.fig.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/6008dda0d9d0/research.0399.fig.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/ffdb9ce1da83/research.0399.fig.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/57f9ab7d3897/research.0399.fig.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/343312e2d99a/research.0399.fig.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/60c0fed2ee75/research.0399.fig.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/85f28ccacfd7/research.0399.fig.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/2ca321c98a4b/research.0399.fig.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/30f798e287ca/research.0399.fig.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/6008dda0d9d0/research.0399.fig.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/ffdb9ce1da83/research.0399.fig.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7933/11249913/57f9ab7d3897/research.0399.fig.008.jpg

相似文献

1
Prospective Role of Foundation Models in Advancing Autonomous Vehicles.基础模型在推动自动驾驶汽车发展中的前瞻性作用。
Research (Wash D C). 2024 Jul 16;7:0399. doi: 10.34133/research.0399. eCollection 2024.
2
Trajectory planning framework for autonomous vehicles based on collision injury prediction for vulnerable road users.基于对弱势道路使用者碰撞伤害预测的自动驾驶车辆轨迹规划框架。
Accid Anal Prev. 2024 Aug;203:107610. doi: 10.1016/j.aap.2024.107610. Epub 2024 May 14.
3
Explainable AI in Scene Understanding for Autonomous Vehicles in Unstructured Traffic Environments on Indian Roads Using the Inception U-Net Model with Grad-CAM Visualization.使用带有 Grad-CAM 可视化的 Inception U-Net 模型,在印度道路的非结构化交通环境中,解释自动驾驶车辆场景理解中的人工智能。
Sensors (Basel). 2022 Dec 10;22(24):9677. doi: 10.3390/s22249677.
4
Lightweight Object Detection Ensemble Framework for Autonomous Vehicles in Challenging Weather Conditions.轻量级目标检测集成框架,用于在挑战性天气条件下的自动驾驶车辆。
Comput Intell Neurosci. 2021 Oct 7;2021:5278820. doi: 10.1155/2021/5278820. eCollection 2021.
5
Semantic Segmentation with Transfer Learning for Off-Road Autonomous Driving.用于越野自动驾驶的基于迁移学习的语义分割
Sensors (Basel). 2019 Jun 6;19(11):2577. doi: 10.3390/s19112577.
6
End-to-End Automated Lane-Change Maneuvering Considering Driving Style Using a Deep Deterministic Policy Gradient Algorithm.基于深度确定性策略梯度算法的考虑驾驶风格的端到端自动变道行驶。
Sensors (Basel). 2020 Sep 22;20(18):5443. doi: 10.3390/s20185443.
7
Implementation of a Potential Field-Based Decision-Making Algorithm on Autonomous Vehicles for Driving in Complex Environments.一种基于势场的决策算法在复杂环境中自动驾驶车辆上的实现
Sensors (Basel). 2019 Jul 28;19(15):3318. doi: 10.3390/s19153318.
8
Law compliance decision making for autonomous vehicles on highways.高速公路自动驾驶汽车的法律合规决策。
Accid Anal Prev. 2024 Sep;204:107620. doi: 10.1016/j.aap.2024.107620. Epub 2024 May 31.
9
DriveLLaVA: Human-Level Behavior Decisions via Vision Language Model.DriveLLaVA:通过视觉语言模型实现人类水平的行为决策。
Sensors (Basel). 2024 Jun 25;24(13):4113. doi: 10.3390/s24134113.
10
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving.基于恐惧神经的强化学习在自动驾驶中的安全应用。
IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):267-279. doi: 10.1109/TPAMI.2023.3322426. Epub 2023 Dec 5.

本文引用的文献

1
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects.用于时间序列分析的自监督学习:分类、进展与展望
IEEE Trans Pattern Anal Mach Intell. 2024 Oct;46(10):6775-6794. doi: 10.1109/TPAMI.2024.3387317. Epub 2024 Sep 6.
2
Diffusion Probabilistic Modeling for Video Generation.用于视频生成的扩散概率建模
Entropy (Basel). 2023 Oct 20;25(10):1469. doi: 10.3390/e25101469.
3
Advanced Reinforcement Learning and Its Connections with Brain Neuroscience.深度强化学习及其与大脑神经科学的联系
Research (Wash D C). 2023;6:0064. doi: 10.34133/research.0064. Epub 2023 Mar 15.
4
TransFuser: Imitation With Transformer-Based Sensor Fusion for Autonomous Driving.TransFuser:基于Transformer的传感器融合在自动驾驶中的模仿应用
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12878-12895. doi: 10.1109/TPAMI.2022.3200245. Epub 2023 Oct 3.
5
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning.MetaDrive:为可泛化强化学习合成多样化驾驶场景
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3461-3475. doi: 10.1109/TPAMI.2022.3190471. Epub 2023 Feb 3.
6
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D.KITTI-360:用于二维和三维城市场景理解的新型数据集和基准
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3292-3310. doi: 10.1109/TPAMI.2022.3179507. Epub 2023 Feb 3.
7
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.基于深度神经网络的自监督视觉特征学习:综述
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4037-4058. doi: 10.1109/TPAMI.2020.2992393. Epub 2021 Oct 1.
8
Quantizing Euclidean Motions via Double-Coset Decomposition.通过双陪集分解对欧几里得运动进行量化。
Research (Wash D C). 2019 Sep 15;2019:1608396. doi: 10.34133/2019/1608396. eCollection 2019.
9
Congested traffic states in empirical observations and microscopic simulations.实证观察和微观模拟中的拥堵交通状态。
Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 2000 Aug;62(2 Pt A):1805-24. doi: 10.1103/physreve.62.1805.