• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的抗干扰通信

Anti-Jamming Communication Using Imitation Learning.

作者信息

Zhou Zhanyang, Niu Yingtao, Wan Boyu, Zhou Wenhao

机构信息

Sixty-Third Research Institute, National University of Defense Technology, Nanjing 210007, China.

Fundamentals Department, Air Force Engineering University of PLA, Xi'an 710051, China.

出版信息

Entropy (Basel). 2023 Nov 16;25(11):1547. doi: 10.3390/e25111547.

DOI:10.3390/e25111547
PMID:37998239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10670521/
Abstract

The communication reliability of wireless communication systems is threatened by malicious jammers. Aiming at the problem of reliable communication under malicious jamming, a large number of schemes have been proposed to mitigate the effects of malicious jamming by avoiding the blocking interference of jammers. However, the existing anti-jamming schemes, such as fixed strategy, Reinforcement learning (RL), and deep Q network (DQN) have limited use of historical data, and most of them only pay attention to the current state changes and cannot gain experience from historical samples. In view of this, this manuscript proposes anti-jamming communication using imitation learning. Specifically, this manuscript addresses the problem of anti-jamming decisions for wireless communication in scenarios with malicious jamming and proposes an algorithm that consists of three steps: First, the heuristic-based Expert Trajectory Generation Algorithm is proposed as the expert strategy, which enables us to obtain the expert trajectory from historical samples. The trajectory mentioned in this algorithm represents the sequence of actions undertaken by the expert in various situations. Then obtaining a user strategy by imitating the expert strategy using an imitation learning neural network. Finally, adopting a functional user strategy for efficient and sequential anti-jamming decisions. Simulation results indicate that the proposed method outperforms the RL-based anti-jamming method and DQN-based anti-jamming method regarding solving continuous-state spectrum anti-jamming problems without causing "curse of dimensionality" and providing greater robustness against channel fading and noise as well as when the jamming pattern changes.

摘要

无线通信系统的通信可靠性受到恶意干扰器的威胁。针对恶意干扰下的可靠通信问题,人们提出了大量方案,通过避开干扰器的阻塞干扰来减轻恶意干扰的影响。然而,现有的抗干扰方案,如固定策略、强化学习(RL)和深度Q网络(DQN),对历史数据的利用有限,它们大多只关注当前状态变化,无法从历史样本中获取经验。鉴于此,本文提出了基于模仿学习的抗干扰通信方法。具体而言,本文解决了恶意干扰场景下无线通信的抗干扰决策问题,并提出了一种由三个步骤组成的算法:首先,提出基于启发式的专家轨迹生成算法作为专家策略,使我们能够从历史样本中获取专家轨迹。该算法中提到的轨迹代表专家在各种情况下采取的行动序列。然后,使用模仿学习神经网络通过模仿专家策略来获得用户策略。最后,采用有效的用户策略进行高效且连续的抗干扰决策。仿真结果表明,在解决连续状态频谱抗干扰问题时,所提方法在不引起“维度灾难”的情况下,在应对信道衰落和噪声以及干扰模式变化时,比基于RL的抗干扰方法和基于DQN的抗干扰方法具有更强的鲁棒性,性能更优。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/33b38e105268/entropy-25-01547-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/04df6455af32/entropy-25-01547-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/e31ed84d7b8f/entropy-25-01547-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/e6337f2efd89/entropy-25-01547-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/b5b80b790d7e/entropy-25-01547-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/3790422f991e/entropy-25-01547-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/8f9819a38576/entropy-25-01547-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/04976a8c1f2f/entropy-25-01547-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/9eb66cdb1910/entropy-25-01547-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/95c202ee9e61/entropy-25-01547-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/265f7208a4b3/entropy-25-01547-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/54367562809b/entropy-25-01547-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/33b38e105268/entropy-25-01547-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/04df6455af32/entropy-25-01547-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/e31ed84d7b8f/entropy-25-01547-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/e6337f2efd89/entropy-25-01547-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/b5b80b790d7e/entropy-25-01547-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/3790422f991e/entropy-25-01547-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/8f9819a38576/entropy-25-01547-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/04976a8c1f2f/entropy-25-01547-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/9eb66cdb1910/entropy-25-01547-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/95c202ee9e61/entropy-25-01547-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/265f7208a4b3/entropy-25-01547-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/54367562809b/entropy-25-01547-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/990a/10670521/33b38e105268/entropy-25-01547-g012.jpg

相似文献

1
Anti-Jamming Communication Using Imitation Learning.基于强化学习的抗干扰通信
Entropy (Basel). 2023 Nov 16;25(11):1547. doi: 10.3390/e25111547.
2
A Fast Anti-Jamming Algorithm Based on Imitation Learning for WSN.一种基于强化学习的无线传感器网络快速抗干扰算法
Sensors (Basel). 2023 Nov 17;23(22):9240. doi: 10.3390/s23229240.
3
A Distributed Anti-Jamming Algorithm Based on Actor-Critic Countering Intelligent Malicious Jamming for WSN.一种基于智能体-评论家的分布式抗干扰算法,用于应对无线传感器网络中的智能恶意干扰。
Sensors (Basel). 2022 Oct 25;22(21):8159. doi: 10.3390/s22218159.
4
GA-Dueling DQN Jamming Decision-Making Method for Intra-Pulse Frequency Agile Radar.基于遗传算法-双决斗深度Q网络的脉内频率捷变雷达干扰决策方法
Sensors (Basel). 2024 Feb 19;24(4):1325. doi: 10.3390/s24041325.
5
Airborne Radar Anti-Jamming Waveform Design Based on Deep Reinforcement Learning.基于深度强化学习的空基雷达抗干扰波形设计。
Sensors (Basel). 2022 Nov 10;22(22):8689. doi: 10.3390/s22228689.
6
On Performance Analysis of Protective Jamming Schemes in Wireless Sensor Networks.无线传感器网络中保护干扰方案的性能分析
Sensors (Basel). 2016 Nov 24;16(12):1987. doi: 10.3390/s16121987.
7
The Threat of Disruptive Jamming to Blockchain-Based Decentralized Federated Learning in Wireless Networks.无线网络中破坏性干扰对基于区块链的去中心化联邦学习的威胁
Sensors (Basel). 2024 Jan 15;24(2):535. doi: 10.3390/s24020535.
8
Robust multiple frequency multiple power localization schemes in the presence of multiple jamming attacks.存在多重干扰攻击时的稳健多频多功率定位方案。
PLoS One. 2017 May 11;12(5):e0177326. doi: 10.1371/journal.pone.0177326. eCollection 2017.
9
Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms.基于深度强化学习的复杂干扰波形决策
Entropy (Basel). 2022 Oct 10;24(10):1441. doi: 10.3390/e24101441.
10
Approximating Nash equilibrium for anti-UAV jamming Markov game using a novel event-triggered multi-agent reinforcement learning.基于新型事件触发多智能体强化学习的反无人机干扰马尔可夫博弈的纳什均衡逼近
Neural Netw. 2023 Apr;161:330-342. doi: 10.1016/j.neunet.2022.12.022. Epub 2023 Feb 2.

本文引用的文献

1
Anti-Jamming Strategy for Federated Learning in Internet of Medical Things: A Game Approach.
IEEE J Biomed Health Inform. 2023 Feb;27(2):888-899. doi: 10.1109/JBHI.2022.3183644. Epub 2023 Feb 3.