• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

增强车联网通信中的最小感知失败距离:一种深度强化学习方法。

Enhancing the Minimum Awareness Failure Distance in V2X Communications: A Deep Reinforcement Learning Approach.

作者信息

Guzmán Leguel Anthony Kyung, Nguyen Hoa-Hung, Gómez Gutiérrez David, Yoo Jinwoo, Jeong Han-You

机构信息

Department of Electrical Engineering, Pusan National University, Busan 46241, Republic of Korea.

Intelligent Systems Research Lab, Intel Labs, Intel Tecnología de México, Jalisco 45017, Mexico.

出版信息

Sensors (Basel). 2024 Sep 20;24(18):6086. doi: 10.3390/s24186086.

DOI:10.3390/s24186086
PMID:39338833
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11435541/
Abstract

Vehicle-to-everything (V2X) communication is pivotal in enhancing cooperative awareness in vehicular networks. Typically, awareness is viewed as a vehicle's ability to perceive and share real-time kinematic information. We present a novel definition of awareness in V2X communications, conceptualizing it as a multi-faceted concept involving vehicle detection, tracking, and maintaining their safety distances. To enhance this awareness, we propose a deep reinforcement learning framework for the joint control of beacon rate and transmit power (DRL-JCBRTP). Our DRL-JCBRTP framework integrates LSTM-based actor networks and MLP-based critic networks within the Soft Actor-Critic (SAC) algorithm to effectively learn optimal policies. Leveraging local state information, the DRL-JCBRTP scheme uses an innovative reward function to increase the minimum awareness failure distance. Our SLMLab-Gym-VEINS simulations show that the DRL-JCBRTP scheme outperforms existing beaconing schemes in minimizing awareness failure probability and maximizing awareness distance, ultimately improving driving safety.

摘要

车与万物(V2X)通信对于增强车辆网络中的协同感知至关重要。通常,感知被视为车辆感知和共享实时运动信息的能力。我们提出了一种V2X通信中感知的新颖定义,将其概念化为一个多方面的概念,涉及车辆检测、跟踪以及保持它们的安全距离。为了增强这种感知,我们提出了一种用于信标速率和发射功率联合控制的深度强化学习框架(DRL-JCBRTP)。我们的DRL-JCBRTP框架在软演员-评论家(SAC)算法中集成了基于长短期记忆网络(LSTM)的智能体网络和基于多层感知器(MLP)的评论家网络,以有效学习最优策略。DRL-JCBRTP方案利用局部状态信息,使用创新的奖励函数来增加最小感知失败距离。我们的SLMLab-Gym-VEINS模拟表明,DRL-JCBRTP方案在最小化感知失败概率和最大化感知距离方面优于现有信标方案,最终提高了驾驶安全性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/c23a55360e38/sensors-24-06086-g024.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/a7cc38213b1e/sensors-24-06086-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/95722355e9c1/sensors-24-06086-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/87510201b5df/sensors-24-06086-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/b75b06d5a4ff/sensors-24-06086-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/af169165e5d7/sensors-24-06086-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/509e15c21bbb/sensors-24-06086-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/1dd1b1689d4c/sensors-24-06086-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/bf3abb34c25a/sensors-24-06086-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/5a98c4003475/sensors-24-06086-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/36446db7394e/sensors-24-06086-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/79a56484827e/sensors-24-06086-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/a806f2d9eacf/sensors-24-06086-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/ba31ce1f7cd7/sensors-24-06086-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/b0c2ab4888d9/sensors-24-06086-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/357585ba46d3/sensors-24-06086-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/c12bf256c582/sensors-24-06086-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/20a6933f3f12/sensors-24-06086-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/2ddd7f8f1bb5/sensors-24-06086-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/f14fce23404a/sensors-24-06086-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/fd726f93a934/sensors-24-06086-g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/f8eb4291ca41/sensors-24-06086-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/f7c550eb851b/sensors-24-06086-g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/e858b6bd2f4f/sensors-24-06086-g023.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/c23a55360e38/sensors-24-06086-g024.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/a7cc38213b1e/sensors-24-06086-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/95722355e9c1/sensors-24-06086-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/87510201b5df/sensors-24-06086-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/b75b06d5a4ff/sensors-24-06086-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/af169165e5d7/sensors-24-06086-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/509e15c21bbb/sensors-24-06086-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/1dd1b1689d4c/sensors-24-06086-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/bf3abb34c25a/sensors-24-06086-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/5a98c4003475/sensors-24-06086-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/36446db7394e/sensors-24-06086-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/79a56484827e/sensors-24-06086-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/a806f2d9eacf/sensors-24-06086-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/ba31ce1f7cd7/sensors-24-06086-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/b0c2ab4888d9/sensors-24-06086-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/357585ba46d3/sensors-24-06086-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/c12bf256c582/sensors-24-06086-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/20a6933f3f12/sensors-24-06086-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/2ddd7f8f1bb5/sensors-24-06086-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/f14fce23404a/sensors-24-06086-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/fd726f93a934/sensors-24-06086-g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/f8eb4291ca41/sensors-24-06086-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/f7c550eb851b/sensors-24-06086-g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/e858b6bd2f4f/sensors-24-06086-g023.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8808/11435541/c23a55360e38/sensors-24-06086-g024.jpg

相似文献

1
Enhancing the Minimum Awareness Failure Distance in V2X Communications: A Deep Reinforcement Learning Approach.增强车联网通信中的最小感知失败距离:一种深度强化学习方法。
Sensors (Basel). 2024 Sep 20;24(18):6086. doi: 10.3390/s24186086.
2
Energy-Efficient Resource Allocation Based on Deep Q-Network in V2V Communications.基于深度 Q 网络的车对车通信中的节能资源分配。
Sensors (Basel). 2023 Jan 23;23(3):1295. doi: 10.3390/s23031295.
3
Joint Optimization of Age of Information and Energy Consumption in NR-V2X System Based on Deep Reinforcement Learning.基于深度强化学习的NR-V2X系统中信息年龄与能量消耗的联合优化
Sensors (Basel). 2024 Jul 4;24(13):4338. doi: 10.3390/s24134338.
4
Cooperative Downloading for LEO Satellite Networks: A DRL-Based Approach.低轨卫星网络中的协同下载:基于 DRL 的方法。
Sensors (Basel). 2022 Sep 10;22(18):6853. doi: 10.3390/s22186853.
5
Deep Reinforcement Learning-Empowered Resource Allocation for Mobile Edge Computing in Cellular V2X Networks.深度强化学习助力蜂窝车联网网络中移动边缘计算的资源分配
Sensors (Basel). 2021 Jan 7;21(2):372. doi: 10.3390/s21020372.
6
A Cascaded Multi-Agent Reinforcement Learning-Based Resource Allocation for Cellular-V2X Vehicular Platooning Networks.基于级联多智能体强化学习的蜂窝车联网车辆编队网络资源分配
Sensors (Basel). 2024 Aug 30;24(17):5658. doi: 10.3390/s24175658.
7
Intelligent control of self-driving vehicles based on adaptive sampling supervised actor-critic and human driving experience.基于自适应采样监督式智能体-评论家算法和人类驾驶经验的自动驾驶车辆智能控制
Math Biosci Eng. 2024 May 24;21(5):6077-6096. doi: 10.3934/mbe.2024267.
8
A Multi-Agent Deep Reinforcement Learning-Based Popular Content Distribution Scheme in Vehicular Networks.一种基于多智能体深度强化学习的车载网络流行内容分发方案。
Entropy (Basel). 2023 May 12;25(5):792. doi: 10.3390/e25050792.
9
PC5-Based Cellular-V2X Evolution and Deployment.基于 PC5 的蜂窝车联网演进与部署。
Sensors (Basel). 2021 Jan 27;21(3):843. doi: 10.3390/s21030843.
10
Beam management optimization for V2V communications based on deep reinforcement learning.基于深度强化学习的车对车通信波束管理优化
Sci Rep. 2023 Nov 22;13(1):20440. doi: 10.1038/s41598-023-47769-3.

本文引用的文献

1
Approximate reinforcement learning to control beaconing congestion in distributed networks.用于控制分布式网络中信标拥塞的近似强化学习
Sci Rep. 2022 Jan 7;12(1):142. doi: 10.1038/s41598-021-04123-9.