Welcome to Acta Armamentarii ! Today is Share:

Acta Armamentarii ›› 2022, Vol. 43 ›› Issue (S1): 74-81.doi: 10.12382/bgxb.2022.A005

• Paper • Previous Articles     Next Articles

Simulation of Ground-air Cooperative Combat Based on Reinforcement Learning in Localization Environment

LI Li, LI Xuguang, GUO Kaijie, SHI Chao, CHEN Zhaowen   

  1. (Departament of Vehicle Integrated Electronics Research and Development,Institute of Computer Application Technology, Norinco Group, Beijing 100089, China)
  • Online:2022-06-28

Abstract: For the actual problems of lack of actual combat scenes and insufficient training data in the military field, the deep reinforcement learning method is used to realize a multi-agent decision-making model for the unmanned ground and air cooperative combat simulation. A virtual simulation environment is built using Phytium CPU and Kunlun K200 NPU as hardware platform and Kylin V10 operating system as software environment. The simulation environment state representation,agents’ action space and rewards mechanism are set, and a multi-agent decision-making model based on the deep deterministic policy gradient (MADDPG) algorithm is established. Simulation experiments verified that MADDPG algorithm can make reward value converge gradually in ground-air cooperative combat simulation scenarios, thus proving the effectiveness of MADDPG algorithm in the simulation scene of the ground-air cooperative combat.

Key words: ground-aircooperativecombat, reinforcementlearning, deepdeterministicpolicygradientalgorithm, multi-agentmodel, localizationenvironment

CLC Number: