一种基于深度强化学习的协同通信干扰决策算法

宋佰霖; 许华; 齐子森; 饶宁; 彭翔

doi:10.12263/DZXB.20210814

您当前的位置：

首页 >

文章列表页 >

一种基于深度强化学习的协同通信干扰决策算法

电磁频谱智能+ | 更新时间：2025-12-08

- 一种基于深度强化学习的协同通信干扰决策算法
- A Collaborative Communication Jamming Decision Algorithm Based on Deep Reinforcement Learning
- 电子学报 2022年50卷第6期页码：1301-1309
- 作者机构：
  
  空军工程大学信息与导航学院，陕西西安 710077
- 作者简介：
  
  [ "宋佰霖男，1997年出生，辽宁沈阳人.现为空军工程大学硕士研究生.主要研究方向为通信对抗智能决策和深度强化学习.E-mail: songbail@126.com" ]
  [ "许华男，1976年出生，湖北宜昌人.现为空军工程大学信息与导航学院教授、博士生导师.主要研究方向为通信对抗、信号盲处理.E-mail: 13720720010@139.com" ]
- 基金信息：
  
  国家自然科学基金青年基金(6190656)
- DOI：10.12263/DZXB.20210814
  中图分类号： TN975;
- 收稿：2021-06-30，
  
  修回：2022-01-05，
  
  纸质出版：2022-06-25
- 稿件说明：
移动端阅览
宋佰霖,许华,齐子森等.一种基于深度强化学习的协同通信干扰决策算法[J].电子学报,2022,50(06):1301-1309.

SONG Bai-lin,XU Hua,QI Zi-sen,et al.A Collaborative Communication Jamming Decision Algorithm Based on Deep Reinforcement Learning[J].ACTA ELECTRONICA SINICA,2022,50(06):1301-1309.
宋佰霖,许华,齐子森等.一种基于深度强化学习的协同通信干扰决策算法[J].电子学报,2022,50(06):1301-1309. DOI： 10.12263/DZXB.20210814.

SONG Bai-lin,XU Hua,QI Zi-sen,et al.A Collaborative Communication Jamming Decision Algorithm Based on Deep Reinforcement Learning[J].ACTA ELECTRONICA SINICA,2022,50(06):1301-1309. DOI： 10.12263/DZXB.20210814.

摘要

针对协同电子战中跳频通信干扰协同决策难题，通过构建“整体优化、逐站决策”的协同决策模型，基于深度强化学习技术，设计了在Actor-Critic算法架构下融合优势函数的决策算法，并在奖励函数中嵌入专家激励机制以提高算法的探索能力，采用集中式训练方法优化决策网络，使算法能够输出资源利用率最高的干扰方案，并大幅提高决策效率.仿真结果表明，相比于现有智能决策算法，本文算法给出的干扰方案能够节约8%干扰资源，决策效率提高50%以上，具有较大实用价值.

Abstract

In order to solve the problem of collaborative decision-making of frequency-hopping communication jamming in collaborative electronic warfare

based on deep reinforcement learning

a collaborative jamming decision-making algorithm based on actor-critic algorithm framework is proposed

which fuses dominant functions by building a collaborative decision-making model of "overall optimization and making decision station by station". An expert experience mechanism is embedded in the reward function to improve the exploration ability of the algorithm

and the decision network is optimized by the distributed execution-centralized training method

so that the algorithm can output the jamming scheme with the highest resource utilization rate and greatly improve the efficiency of decision-making. The simulation results show that

compared with the existing intelligent decision algorithms

the jamming scheme presented in this paper can save 8% of the interference resources and improve the decision efficiency by more than 50%

which is of great practical value.

关键词

Keywords

references

XIAO L , LIU J L , LI Q D , et al . User-centric view of jamming games in cognitive radio networks [J]. IEEE Transactions on Information Forensics and Security , 2015 , 10 ( 12 ): 2578 - 2590 .

AMURU S , DHILLON H S , BUEHRER R M . On jamming against wireless networks [J]. IEEE Transactions on Wireless Communications , 2017 , 16 ( 1 ): 412 - 428 .

ZHOU P , WANG Q , WANG W , et al . Near-optimal and practical jamming-resistant energy-efficient cognitive radio communications [J]. IEEE Transactions on Information Forensics and Security , 2017 , 12 ( 11 ): 2807 - 2822 .

JIANG H Q , ZHANG Y R , XU H Y . Optimal allocation of cooperative jamming resource based on hybrid quantum-behaved particle swarm optimization and genetic algorithm [J]. IET Radar , Sonar & Navigation, 2017 , 11 ( 1 ): 185 - 192 .

施伟 , 冯旸赫 , 程光权 , 等 . 基于深度强化学习的多机协同空战方法研究 [J]. 自动化学报 , 2021 , 47 ( 7 ): 1610 - 1623 .

SHI W , FENG Y H , CHENG G Q , et al . Research on multi-aircraft cooperative air combat method based on deep reinforcement learning [J]. Acta Automatica Sinica , 2021 , 47 ( 7 ): 1610 - 1623 . (in Chinese)

ZHUANSUN S S , YANG J N , LIU H . An algorithm for jamming strategy using OMP and MAB [J]. EURASIP Journal on Wireless Communications and Networking , 2019 , 2019( 1 ): 1 - 11 .

AMURU S , TEKIN C , SCHAAR M VAN DER , et al . Jamming bandits—A novel learning method for optimal jamming [J]. IEEE Transactions on Wireless Communications , 2016 , 15 ( 4 ): 2792 - 2808 .

颛孙少帅 , 杨俊安 , 刘辉 , 等 . 采用双层强化学习的干扰决策算法 [J]. 西安交通大学学报 , 2018 , 52 ( 2 ): 63 - 69 .

ZHUANSUN S S , YANG J N , LIU H , et al . An algorithm for jamming decision using dual reinforcement learning [J]. Journal of Xi'an Jiaotong University , 2018 , 52 ( 2 ): 63 - 69 . (in Chinese)

许华 , 宋佰霖 , 蒋磊 , 等 . 一种通信对抗干扰资源分配智能决策算法 [J]. 电子与信息学报 , 2021 , 43 ( 11 ): 3086 - 3095 .

XU H , SONG B L , JIANG L , et al . An intelligent decision-making algorithm for communication countermeasure jamming resource allocation [J]. Journal of Electronics & Information Technology , 2021 , 43 ( 11 ): 3086 - 3095 . (in Chinese)

MNIH V , BADIA A P , MIRZA M , et al . Asynchronous methods for deep reinforcement learning [C]// Proceedings of the 33rd International Conference on Machine Learning(ICML) . New York : ACM , 2016 : 1928 - 1937 .

MNIH V , KAVUKCUOGLU K , SILVER D , et al . Human-level control through deep reinforcement learning [J]. Nature , 2015 , 518 ( 7540 ): 529 - 533 .

HUYNH N VAN , NGUYEN D N , HOANG D T , et al . " Jam me if You can: " defeating jammer with deep dueling neural network architecture and ambient backscattering augmented communications [J]. IEEE Journal on Selected Areas in Communications , 2019 , 37 ( 11 ): 2603 - 2620 .

陈思光 , 陈佳民 , 赵传信 . 基于深度强化学习的云边协同计算迁移研究 [J]. 电子学报 , 2021 , 49 ( 1 ): 157 - 166 .

CHEN S G , CHEN J M , ZHAO C X . Deep reinforcement learning based cloud-edge collaborative computation offloading mechanism [J]. Acta Electronica Sinica , 2021 , 49 ( 1 ): 157 - 166 . (in Chinese)

浏览量

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

一种基于深度强化学习的动态自适应干扰功率分配方法

基于因果思维树的电动汽车电池SOC预测模型

车联网边缘计算环境下基于流量预测的高效任务卸载策略研究