Welcome to Acta Armamentarii ! Today is

Acta Armamentarii ›› 2023, Vol. 44 ›› Issue (S2): 126-134.doi: 10.12382/bgxb.2023.0877

Special Issue: 群体协同与自主技术

Previous Articles     Next Articles

Simulation of Reinforcement Learning-based UAV Swarm Adversarial Strategy Deduction

CAO Zijian1, SUN Zelong2, YAN Guochuang3, FU Yanfang1,*(), YANG Bo1, LI Qinjie1, LEI Kailin1, GAO Linghang1   

  1. 1 School of Computer Science and Engineering, Xi’an Technological University, Xi’an 710021, Shaanxi, China
    2 School of Armament Science and Technology, Xi’an Technological University, Xi’an 710021, Shaanxi, China
    3 Test and Measuring Academy of Norinco Group, Huayin 714200, Shaanxi, China
  • Received:2023-09-05 Online:2024-01-10
  • Contact: FU Yanfang

Abstract:

The application of drone clusters in military warfare, public safety, and commercial fields is becoming increasingly widespread. But it is a challenge to develop the efficient strategiesin complex and ever-changing adversarial environments. In order to enable the drone clusters to autonomously learn and adapt to the change in adversarial environment, and improve the efficiency and success rate of task execution, a multi-agent reinforcement learning algorithm framework based on value decomposition is proposed. The behavior of drone clusters in different adversarial scenarios is simulated on a simulation platform, and the ability of drone clusters to make decisions in different situations is cultivated to achieve the optimal task objectives through reinforcement learning algorithms. The application and performance comparison of different reinforcement learning algorithms in drone swarm adversarial strategies are discussed. The experimental results show that the proposed algorithm shows good performance in various cluster confrontation environments, demonstrating its strong support in military drone cluster confrontation.

Key words: drone swarm, adversarial strategy, deep reinforcement learning, value decomposition

CLC Number: