Welcome to Acta Armamentarii ! Today is

Acta Armamentarii ›› 2025, Vol. 46 ›› Issue (S1): 250606-.doi: 10.12382/bgxb.2025.0606

Previous Articles     Next Articles

A Reinforcement Learning Method for Optimizing Air Combat Threat Assessment via Cross-attention Mechanisms and Expert-guided Reward Shaping

SUN Kang1, XUE Dingrui1,2, FAN Ji1, LIN Yuqing2,*(), LI Bo1, WANG Kexin1, LIU Jiancheng1, WEI Siwen3,**()   

  1. 1 Northwest Institute of Mechanical and Electrical Engineering, Xianyang 712099,Shannxi, China
    2 School of Information EngineeringChang’an University, Xi’an 710018,Shannxi, China
    3 School of Computer Science and TechnologyXidian University, Xi’an 710071,Shannxi, China
  • Received:2025-07-08 Online:2025-11-06
  • Contact: LIN Yuqing, WEI Siwen

Abstract:

Aerial target threat assessment remains a critical component in modern military operations,particularly in highly dynamic and uncertain combat environments.The traditional methods are difficult to effectively handle multi-target threats,real-time decision-making,and environmental uncertainties.To address these limitations,this paper proposed a DCA-AEST framework which combines two novel modules:dynamic cross-attention feature extraction (DCAFE) and adaptive entropy SAC-Twin (AEST).The DCAFE module utilizes a hierarchical cross-attention mechanism to dynamically extract high-order feature interactions from complex multi-source battlefield data,thereby enhancing the accuracy of threat detection and prioritization.The AEST module integrates the reinforcement learning with the expert-guided reward shaping and adaptive entropy regularization,allowing the module to adaptively optimize its threat evaluation strategy in real-time.The proposed DCA-AEST framework is rigorously validated through extensive experiments in a high-fidelity adversarial combat simulation environment.The results demonstrate that DCA-AEST framework has superior performance in comparison to state-of-the-art models,showcasing significant improvements in adaptability,decision-making stability,and threat assessment accuracy in dynamic and uncertain combat scenarios.

Key words: threat assessment, dynamic cross-attention, expert-guided reward shaping, reinforcement learning