Welcome to Acta Armamentarii ! Today is

Acta Armamentarii ›› 2025, Vol. 46 ›› Issue (3): 240357-.doi: 10.12382/bgxb.2024.0357

Previous Articles     Next Articles

A Reinforcement Learning-based Radar Jamming Decision-making Method with Adaptive Setting of Exploration Rate

ZHANG Wang, SHAO Xuehui, TANG Huilong, WEI Jianlin, WANG Wei*()   

  1. School of Intelligent Science and Engineering, Harbin Engineering University, Harbin 150001, Heilongjiang, China
  • Received:2024-05-10 Online:2025-03-26
  • Contact: WANG Wei

Abstract:

The current radar jamming decision-making method based on reinforcement learning sets the exploration rate parameter according to a single factor and fixed law,which leads to the increase in the number of confrontation rounds required for algorithm convergence.A reinforcement learning-based radar jamming decision-making method with adaptive setting of exploration rate is proposed.Based on the Metropolis parameter adjustment criterion of simulated annealing method,an adaptive setting criterion of exploration rate is derived from the number of radar operating states recognized by jammers,the number of jamming successes,the change rate of algorithm convergence curve and the jammer’s cognition of radar in the process of countermeasures.According to the effectiveness of jamming action,a jamming action space clipping strategy is designed to reduce the dimension of jamming action space and further improve the convergence speed of the algorithm.In the simulation experiment,two different radar working state diagrams are designed and compared by using the Q-learning algorithm.The simulated results show that the proposed method can achieve the adaptive setting of exploration rate when the radar working state transition relationship changes.Compared with the exploration rate setting scheme based on simulated annealing method,single factor and fixed law,the number of confrontation rounds required for the convergence of the proposed method in the two state diagrams is reduced by 18%,26%,45% and 42%,44%,48%,respectively.At the same time,it can also obtain greater benefits and higher jamming success rate,which provides a new idea of exploration rate setting for multi-functional radar jamming decision-making based on reinforcement learning.

Key words: multi-functional radar, radar jamming decision-making, reinforcement learning, exploration rate