• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多任务学习的自动驾驶车辆道路场景理解研究。

Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning.

机构信息

Department of Mechanical and Electrical Engineering, Xiamen University, Xiamen 361005, China.

Department of Automation, Xiamen University, Xiamen 361005, China.

出版信息

Sensors (Basel). 2023 Jul 7;23(13):6238. doi: 10.3390/s23136238.

DOI:10.3390/s23136238
PMID:37448087
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10346996/
Abstract

Road scene understanding is crucial to the safe driving of autonomous vehicles. Comprehensive road scene understanding requires a visual perception system to deal with a large number of tasks at the same time, which needs a perception model with a small size, fast speed, and high accuracy. As multi-task learning has evident advantages in performance and computational resources, in this paper, a multi-task model YOLO-Object, Drivable Area, and Lane Line Detection (YOLO-ODL) based on hard parameter sharing is proposed to realize joint and efficient detection of traffic objects, drivable areas, and lane lines. In order to balance tasks of YOLO-ODL, a weight balancing strategy is introduced so that the weight parameters of the model can be automatically adjusted during training, and a Mosaic migration optimization scheme is adopted to improve the evaluation indicators of the model. Our YOLO-ODL model performs well on the challenging BDD100K dataset, achieving the state of the art in terms of accuracy and computational efficiency.

摘要

道路场景理解对自动驾驶汽车的安全行驶至关重要。全面的道路场景理解需要一个视觉感知系统来同时处理大量任务,这需要一个具有小尺寸、快速和高精度的感知模型。由于多任务学习在性能和计算资源方面具有明显的优势,因此本文提出了一种基于硬参数共享的多任务模型 YOLO-Object、Drivable Area 和 Lane Line Detection(YOLO-ODL),以实现交通对象、可行驶区域和车道线的联合高效检测。为了平衡 YOLO-ODL 的任务,引入了一种权重平衡策略,以便在训练过程中自动调整模型的权重参数,并采用 Mosaic 迁移优化方案来提高模型的评估指标。我们的 YOLO-ODL 模型在具有挑战性的 BDD100K 数据集上表现出色,在准确性和计算效率方面达到了最新水平。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/99147f162f3e/sensors-23-06238-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/d6877c8466d9/sensors-23-06238-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/f9c4ef1fa93e/sensors-23-06238-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/7fc8f48eca38/sensors-23-06238-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/a05e08b1b6a0/sensors-23-06238-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/e11e09663295/sensors-23-06238-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/fa531d5e4ecb/sensors-23-06238-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/71b571a93ade/sensors-23-06238-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/b6c82372cbd0/sensors-23-06238-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/99147f162f3e/sensors-23-06238-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/d6877c8466d9/sensors-23-06238-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/f9c4ef1fa93e/sensors-23-06238-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/7fc8f48eca38/sensors-23-06238-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/a05e08b1b6a0/sensors-23-06238-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/e11e09663295/sensors-23-06238-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/fa531d5e4ecb/sensors-23-06238-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/71b571a93ade/sensors-23-06238-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/b6c82372cbd0/sensors-23-06238-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fae/10346996/99147f162f3e/sensors-23-06238-g009.jpg

相似文献

1
Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning.基于多任务学习的自动驾驶车辆道路场景理解研究。
Sensors (Basel). 2023 Jul 7;23(13):6238. doi: 10.3390/s23136238.
2
Multi-Task Environmental Perception Methods for Autonomous Driving.用于自动驾驶的多任务环境感知方法
Sensors (Basel). 2024 Aug 28;24(17):5552. doi: 10.3390/s24175552.
3
A Multi-Task Road Feature Extraction Network with Grouped Convolution and Attention Mechanisms.一种具有分组卷积和注意力机制的多任务道路特征提取网络。
Sensors (Basel). 2023 Sep 30;23(19):8182. doi: 10.3390/s23198182.
4
Mobip: a lightweight model for driving perception using MobileNet.Mobip:一种使用MobileNet进行驾驶感知的轻量级模型。
Front Neurorobot. 2023 Dec 4;17:1291875. doi: 10.3389/fnbot.2023.1291875. eCollection 2023.
5
A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception.一种基于双颈部结构的多任务网络用于自动驾驶感知。
Sensors (Basel). 2024 Feb 28;24(5):1547. doi: 10.3390/s24051547.
6
A panoramic driving perception fusion algorithm based on multi-task learning.基于多任务学习的全景驾驶感知融合算法。
PLoS One. 2024 Jun 4;19(6):e0304691. doi: 10.1371/journal.pone.0304691. eCollection 2024.
7
Robust Drivable Road Region Detection for Fixed-Route Autonomous Vehicles Using Map-Fusion Images.基于地图融合图像的固定路线自动驾驶车辆鲁棒可行驶区域检测
Sensors (Basel). 2018 Nov 27;18(12):4158. doi: 10.3390/s18124158.
8
Optimal Configuration of Multi-Task Learning for Autonomous Driving.用于自动驾驶的多任务学习的最优配置
Sensors (Basel). 2023 Dec 9;23(24):9729. doi: 10.3390/s23249729.
9
Analysis of Lane-Changing Decision-Making Behavior of Autonomous Vehicles Based on Molecular Dynamics.基于分子动力学的自动驾驶车辆变道决策行为分析。
Sensors (Basel). 2022 Oct 12;22(20):7748. doi: 10.3390/s22207748.
10
ASA-BiSeNet: improved real-time approach for road lane semantic segmentation of low-light autonomous driving road scenes.ASA-双向分割网络:用于低光照自动驾驶道路场景车道语义分割的改进实时方法。
Appl Opt. 2023 Jul 1;62(19):5224-5235. doi: 10.1364/AO.486302.

引用本文的文献

1
Multi-Task Environmental Perception Methods for Autonomous Driving.用于自动驾驶的多任务环境感知方法
Sensors (Basel). 2024 Aug 28;24(17):5552. doi: 10.3390/s24175552.
2
A panoramic driving perception fusion algorithm based on multi-task learning.基于多任务学习的全景驾驶感知融合算法。
PLoS One. 2024 Jun 4;19(6):e0304691. doi: 10.1371/journal.pone.0304691. eCollection 2024.
3
Optimal Configuration of Multi-Task Learning for Autonomous Driving.用于自动驾驶的多任务学习的最优配置

本文引用的文献

1
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
2
On-road vehicle detection: a review.道路车辆检测:综述
IEEE Trans Pattern Anal Mach Intell. 2006 May;28(5):694-711. doi: 10.1109/TPAMI.2006.104.
Sensors (Basel). 2023 Dec 9;23(24):9729. doi: 10.3390/s23249729.
4
Multi-object detection for crowded road scene based on ML-AFP of YOLOv5.基于YOLOv5的ML-AFP的拥挤道路场景多目标检测
Sci Rep. 2023 Oct 12;13(1):17310. doi: 10.1038/s41598-023-43458-3.