• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于集成传感、通信和电力传输系统中稳健波束成形的深度强化学习

Deep Reinforcemnet Learning for Robust Beamforming in Integrated Sensing, Communication and Power Transmission Systems.

作者信息

Xie Chenfei, Xiu Yue, Yang Songjie, Miao Qilong, Chen Lu, Gao Yong, Zhang Zhongpei

机构信息

National Key Laboratory of Science and Technology on Communications, University of Electronic Science and Technology of China, Chengdu 611731, China.

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.

出版信息

Sensors (Basel). 2025 Jan 10;25(2):388. doi: 10.3390/s25020388.

DOI:10.3390/s25020388
PMID:39860757
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11768676/
Abstract

A communication network integrating multiple modes can effectively support the sustainable development of next-generation wireless communications. Integrated sensing, communication, and power transfer (ISCPT) represents an emerging technological paradigm that not only facilitates information transmission but also enables environmental sensing and wireless power transfer. To achieve optimal beamforming in transmission, it is crucial to satisfy multiple constraints, including quality of service (QoS), radar sensing accuracy, and power transfer efficiency, while ensuring fundamental system performance. The presence of multiple parametric constraints makes the problem a non-convex optimization challenge, underscoring the need for a solution that balances low computational complexity with high precision. Additionally, the accuracy of channel state information (CSI) is pivotal in determining the achievable rate, as imperfect or incomplete CSI can significantly degrade system performance and beamforming efficiency. Deep reinforcement learning (DRL), a machine learning technique where an agent learns by interacting with its environment, offers a promising approach that can dynamically optimize system performance through adaptive decision-making strategies. In this paper, we propose a DRL-based ISCPT framework, which effectively manages complex environmental states and continuously adjusts variables related to sensing, communication, and energy harvesting to enhance overall system efficiency and reliability. The achievable rate upper bound can be inferred through robust, learnable beamforming in the ISCPT system. Our results demonstrate that DRL-based algorithms significantly improve resource allocation, power management, and information transmission, particularly in dynamic and uncertain environments with imperfect CSI.

摘要

一个集成多种模式的通信网络能够有效地支持下一代无线通信的可持续发展。集成传感、通信和功率传输(ISCPT)代表了一种新兴的技术范式,它不仅有助于信息传输,还能实现环境感知和无线功率传输。为了在传输中实现最优波束成形,在确保基本系统性能的同时,满足包括服务质量(QoS)、雷达传感精度和功率传输效率在内的多个约束条件至关重要。多个参数约束的存在使得该问题成为一个非凸优化挑战,这突出了需要一种在低计算复杂度和高精度之间取得平衡的解决方案。此外,信道状态信息(CSI)的准确性对于确定可实现速率至关重要,因为不完美或不完整的CSI会显著降低系统性能和波束成形效率。深度强化学习(DRL)是一种机器学习技术,其中智能体通过与其环境交互来学习,它提供了一种有前景的方法,可以通过自适应决策策略动态优化系统性能。在本文中,我们提出了一个基于DRL的ISCPT框架,该框架有效地管理复杂的环境状态,并不断调整与传感、通信和能量收集相关的变量,以提高整体系统效率和可靠性。在ISCPT系统中,可以通过稳健的、可学习的波束成形来推断可实现速率的上限。我们的结果表明,基于DRL的算法显著改善了资源分配、功率管理和信息传输,特别是在具有不完美CSI的动态和不确定环境中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/29547b8ff4d4/sensors-25-00388-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/4f98e56ca751/sensors-25-00388-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/dadc376285fe/sensors-25-00388-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/33d034907d10/sensors-25-00388-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/8ed66398ba6b/sensors-25-00388-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/8cd01c9cafa1/sensors-25-00388-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/2dc17c2eb371/sensors-25-00388-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/3bac0cf1d6a9/sensors-25-00388-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/d547fe38c987/sensors-25-00388-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/3f6b36b2e841/sensors-25-00388-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/29547b8ff4d4/sensors-25-00388-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/4f98e56ca751/sensors-25-00388-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/dadc376285fe/sensors-25-00388-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/33d034907d10/sensors-25-00388-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/8ed66398ba6b/sensors-25-00388-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/8cd01c9cafa1/sensors-25-00388-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/2dc17c2eb371/sensors-25-00388-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/3bac0cf1d6a9/sensors-25-00388-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/d547fe38c987/sensors-25-00388-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/3f6b36b2e841/sensors-25-00388-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dc8/11768676/29547b8ff4d4/sensors-25-00388-g010.jpg

相似文献

1
Deep Reinforcemnet Learning for Robust Beamforming in Integrated Sensing, Communication and Power Transmission Systems.用于集成传感、通信和电力传输系统中稳健波束成形的深度强化学习
Sensors (Basel). 2025 Jan 10;25(2):388. doi: 10.3390/s25020388.
2
Joint Task Offloading and Resource Allocation for Intelligent Reflecting Surface-Aided Integrated Sensing and Communication Systems Using Deep Reinforcement Learning Algorithm.基于深度强化学习算法的智能反射面辅助集成传感与通信系统的联合任务卸载与资源分配
Sensors (Basel). 2023 Dec 18;23(24):9896. doi: 10.3390/s23249896.
3
Joint Beamforming, Power Allocation, and Splitting Control for SWIPT-Enabled IoT Networks with Deep Reinforcement Learning and Game Theory.基于深度强化学习和博弈论的支持同时无线信息与能量传输的物联网网络的联合波束成形、功率分配和分割控制
Sensors (Basel). 2022 Mar 17;22(6):2328. doi: 10.3390/s22062328.
4
Power Efficient Secure Full-Duplex SWIPT Using NOMA and D2D with Imperfect CSI.使用非正交多址接入(NOMA)和设备到设备(D2D)通信且信道状态信息(CSI)不完善的高能效安全全双工同时无线信息与能量传输(SWIPT)
Sensors (Basel). 2020 Sep 21;20(18):5395. doi: 10.3390/s20185395.
5
Waveform Design for the Integrated Sensing, Communication, and Simultaneous Wireless Information and Power Transfer System.集成传感、通信及同时进行无线信息与功率传输系统的波形设计
Sensors (Basel). 2024 Jun 25;24(13):4129. doi: 10.3390/s24134129.
6
Joint Data Transmission and Energy Harvesting for MISO Downlink Transmission Coordination in Wireless IoT Networks.无线物联网网络中 MISO 下行链路传输协调的联合数据传输和能量收集。
Sensors (Basel). 2023 Apr 11;23(8):3900. doi: 10.3390/s23083900.
7
Deep Reinforcement Learning-Assisted Optimization for Resource Allocation in Downlink OFDMA Cooperative Systems.深度强化学习辅助的下行链路正交频分多址协作系统资源分配优化
Entropy (Basel). 2023 Feb 24;25(3):413. doi: 10.3390/e25030413.
8
Robust Beamforming Design for Secure V2X Downlink System with Wireless Information and Power Transfer under a Nonlinear Energy Harvesting Model.基于非线性能量收集模型的安全 V2X 下行链路系统的无线信息和功率传输的鲁棒波束成形设计。
Sensors (Basel). 2018 Sep 30;18(10):3294. doi: 10.3390/s18103294.
9
Deep Reinforcement Learning-Based Coordinated Beamforming for mmWave Massive MIMO Vehicular Networks.基于深度强化学习的毫米波大规模 MIMO 车联网协同波束成形。
Sensors (Basel). 2023 Mar 3;23(5):2772. doi: 10.3390/s23052772.
10
Intelligent Reflecting Surfaces Beamforming Optimization with Statistical Channel Knowledge.基于统计信道知识的智能反射面波束成形优化
Sensors (Basel). 2022 Mar 20;22(6):2390. doi: 10.3390/s22062390.

本文引用的文献

1
Waveform Design for the Integrated Sensing, Communication, and Simultaneous Wireless Information and Power Transfer System.集成传感、通信及同时进行无线信息与功率传输系统的波形设计
Sensors (Basel). 2024 Jun 25;24(13):4129. doi: 10.3390/s24134129.
2
Analyzing a stochastic process driven by Ornstein-Uhlenbeck noise.分析由 Ornstein-Uhlenbeck 噪声驱动的随机过程。
Phys Rev E. 2018 Jan;97(1-1):012113. doi: 10.1103/PhysRevE.97.012113.