Welcome to Acta Armamentarii ! Today is

Acta Armamentarii ›› 2024, Vol. 45 ›› Issue (12): 4423-4434.doi: 10.12382/bgxb.2023.1081

Previous Articles     Next Articles

Collaborative Regional Information Collection Strategy Based on MLAT-DRL Algorithm

LOU Shuhan1, WANG Chongchong1, GONG Wei1,2,*(), DENG Liyuan1, LI Li1,2   

  1. 1 School of Electronics and Information Engineering, Tongji University, Shanghai 201804, China
    2 Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University, Shanghai 201804, China
  • Received:2023-11-06 Online:2024-02-19
  • Contact: GONG Wei

Abstract:

Aiming at the difficulties faced by UAV swarm collaborative regional information collection in adversarial environment (e.g., complex environment structure and blocked swarm communication), a multi-level hybrid observation space with attention-deep reinforcement learning (MLAT-DRL) is proposed for decision making of UAV in information collection task. The proposed algorithm adopts a centralized training with decentralized execution paradigm, which realizes the efficient collaboration of UAV swarm in the absence of communications. In addition, a multi-level hybrid observation space method is proposed to develop the multi-scale representations of environmental features and realize the efficient use of global information and local observation. Moreover, the algorithm introduces a recurrent neural network incorporating an attention mechanism in the network, which improves the risk perception ability of UAV swarm. A prioritized experience replay strategy is employed to improve the utilization rate of samples and reduces the difficulty of training. It is verified from simulations that the proposed MLAT-DRL algorithm outperforms baseline algorithms in terms of data collection and risk aversion.

Key words: unmanned aerial vehicle swarm, regional information collection, multi-agent reinforcement learning, multi-level hybrid observation space, attention mechanism

CLC Number: