北京理工大学 宇航学院, 北京 100081
*邮箱: wangxf@bit.edu.cn
收稿:2024-01-26,
网络出版:2024-11-26,
纸质出版:2024-11-30
移动端阅览
李加申, 王晓芳, 林海. 引入虚拟目标的高超声速巡航导弹智能机动突防策略[J]. 兵工学报, 2024,45(11):3856-3867.
Jiashen LI, Xiaofang WANG, Hai LIN. Intelligent Penetration Policy for Hypersonic Cruise Missiles Based on Virtual Targets[J]. Acta Armamentarii, 2024, 45(11): 3856-3867.
李加申, 王晓芳, 林海. 引入虚拟目标的高超声速巡航导弹智能机动突防策略[J]. 兵工学报, 2024,45(11):3856-3867. DOI: 10.12382/bgxb.2023.1048.
Jiashen LI, Xiaofang WANG, Hai LIN. Intelligent Penetration Policy for Hypersonic Cruise Missiles Based on Virtual Targets[J]. Acta Armamentarii, 2024, 45(11): 3856-3867. DOI: 10.12382/bgxb.2023.1048.
针对高超声速巡航导弹机动突防时弹道偏离难以约束、突防策略对不同作战场景的泛化性能较差等问题
提出一种基于虚拟目标和上下文马尔可夫决策过程的智能机动突防决策算法。在以预定弹道为轴线的管状弹道包络面内选定多个静止的虚拟目标
采用深度强化学习算法对其相对预定弹道的位置参数进行决策;用比例导引律引导巡航弹依次攻击这些虚拟目标
在包络面内塑造出能满足突防要求的机动弹道。基于上下文马尔可夫决策过程
将针对单个作战场景的最优突防策略拓展到作战场景的概率分布上
提升突防策略对不同作战场景的适应性。仿真结果表明:该智能机动突防策略能在突防的同时约束弹道偏离
在拦截弹发射位置和机动能力发生变化时仍能保持良好性能。
An intelligent penetration policy using virtual targets and contextual Markov decision process (CMDP) for hypersonic cruise missiles is proposed to constrain the trajectory deviation and improve the generalization performance in different combat scenarios. The stationary virtual targets are chosen within a tubular envelope with the planned trajectory as axis
and the deep reinforcement learning algorithm is applied to decide their position relative to the axis. Then the proportional guidance law is used to guide the cruise missile to attack these virtual targets one by one with proportional guidance law
thus shaping a maneuvering trajectory meeting the requirements of penetration within the given envelope. The optimal penetration policy for a combat scenario is extended to the probability distribution of combat scenarios using CMDP to improve the generalization performance. The results demonstrate that the penetration policy constrains the trajectory deviation during penetraton and exhibits adaptability to variations of interceptor’s launch position and maneuvering capability.
雷虎民 , 骆长鑫 , 周池军 , 等 . 临近空间防御作战拦截弹制导与控制关键技术综述 [J ] . 航空兵器 , 2021 , 28 ( 2 ): 1 - 10 .
LEI H M , LUO C X , ZHOU C J , et al. Summary of key technologies of interceptor guidance and control in near space defense operations [J ] . Aero Weaponry , 2021 , 28 ( 2 ): 1 - 10 . (in Chinese)
汪丰麟 , 李沁远 , 范博 , 等 . 高超声速武器防御体系的发展现状与演进趋势 [J ] . 指挥与控制学报 , 2022 , 8 ( 4 ): 378 - 388 .
WANG F L , LI Q Y , FAN B , et al. Development status and trends of hypersonic weapon defense system [J ] . Journal of Command and Control , 2022 , 8 ( 4 ): 378 - 388 . (in Chinese)
张荣升 , 陈万春 . THAAD增程型拦截弹预测制导方法 [J ] . 北京航空航天大学学报 , 2021 , 47 ( 4 ): 863 - 874 .
ZHANG R S , CHEN W C . Predictive guidance method of THAAD-ER interceptor [J ] . Journal of Beijing University of Aeronautics and Astronautics , 2021 , 47 ( 4 ): 863 - 874 . (in Chinese)
石安华 , 李海燕 , 石卫波 , 等 . 临近空间高超声速巡航飞行器红外特征 [J ] . 兵工学报 , 2022 , 43 ( 4 ): 796 - 803 .
SHI A H , LI H Y , SHI W B , et al. Infrared radiation feature of near space hypersonic cruise vehicle [J ] . Acta Armamentarii , 2022 , 43 ( 4 ): 796 - 803 . (in Chinese) DOI: 10.12382/bgxb.2021.0105 http://doi.org/10.12382/bgxb.2021.0105 To meet the demand for optical characteristic data of near-space hypersonic cruise vehicles for the detectability analysis of space-based infrared early warning satellites and the design of future space-based infrared early warning satellitesthe flow-field parameters, body temperature, infrared radiation spectrum, infrared radiation luminance, and infrared radiation intensity of the hypersonic cruise vehicle like X-51A under the typical flight condition are studied based on the combined computation and analysis methods of hypersonic aerodynamics, combustion theory, heat transfer theory, gas radiation theory, and so on. The calculated results show that the radiation features of the hypersonic cruise vehicle like X-51A are influenced by the flight time and ballistic. The infrared radiation of the body is influenced obviously by the aerodynamic heating. The jet flame radiation of the vehicle using aviation kerosene as fuel has strong radiation band of CO<sub>2</sub>, H<sub>2</sub>O and CO with wavelength of 1.52 μm, 2.68 μm and 4.39 μm. The jet flame radiation contributes most to the total narrow-band radiation, including the strong characteristic radiation spectra of CO<sub>2</sub>, H<sub>2</sub>O and CO molecule. The body radiation of the vehicle dominates the total radiation at the wavelength band which doesn’t include the characteristic radiation spectra of CO<sub>2</sub>, H<sub>2</sub>O and CO molecule.
AN H , WU Q Q . Adaptive control of variable geometry inlet-configured air-breathing hypersonic vehicles [J ] . Journal of Spacecraft and Rockets , 2019 , 56 ( 5 ): 1520 - 1530 .
DALLE D , TORREZ S , DRISCOLL J . Turn performance of an air-breathing hypersonic vehicle [C ] // Proceedings of AIAA Atmospheric Flight Mechanics Conference. Oregon , Portland : AIAA , 2011 .
郭行 , 符文星 , 付斌 , 等 . 吸气式高超声速飞行器巡航段突防弹道规划 [J ] . 宇航学报 , 2017 , 38 ( 3 ): 287 - 295 .
GUO H , FU W X , FU B , et al. Penetration trajectory programming for air breathing hypersonic vehicles during the cruise phase [J ] . Journal of Astronautics , 2017 , 38 ( 3 ): 287 - 295 . (in Chinese)
王雨琪 , 宁国栋 , 王晓峰 , 等 . 基于微分对策的临近空间飞行器机动突防策略 [J ] . 航空学报 , 2020 , 41 ( 增刊2 ):724276.
WANG Y Q , NING G D , WANG X F , et al. Maneuver penetration strategy of near space vehicle based on differential game [J ] . Acta Aeronautica et Astronautica Sinica , 2020 , 41 ( S2 ): 724276. (in Chinese)
ENGLISH J T , WILHELM J P . Defender-Aware attacking guidance policy for the target-attacker-defender differential game [J ] . Journal of Aerospace Information Systems , 2021 , 18 ( 6 ): 366 - 376 .
HAREL M , MOSHAIOV A , ALKAHER D . Rationalizable strategies for the navigator-target-missile game [J ] . Journal of Guidance, Control, and Dynamics , 2020 , 43 ( 6 ): 1129 - 1142 .
王芳 , 林涛 , 张克 . 基于控制变量参数化的主动反拦截突防最优控制计算方法 [J ] . 航空学报 , 2015 , 36 ( 6 ): 2037 - 2046 . DOI: 10.7527/S1000-6893.2014.0359 http://doi.org/10.7527/S1000-6893.2014.0359 针对由于敌防空系统防御能力不断提高所带来的进攻导弹突防难题,提出主动反拦截突防(IAIP)的概念,以弥补传统机动突防仅考虑进攻导弹的逃逸而忽略其攻击任务的缺陷。根据IAIP制导的内涵,在综合考虑目标的机动性能、拦截导弹末段的拦截特性及进攻导弹的控制系统性能的基础上,建立进攻导弹-目标-拦截导弹的三体运动模型。将突防制导指令的设计等效为最优控制的求解,其中突防指令为实现燃料最省目标的最优解,进攻导弹的过载、拦截导弹的脱靶量、进攻导弹的攻击角、打击精度和突防后的视线角,分别为控制约束、路径约束和末端约束。借鉴控制变量参数化(CVP)方法将最优控制问题转化为非线性数学规划问题,并将路径约束离散化后采用序列二次规划(SQP)算法得到突防时机给定条件下制导指令的数值解。提出基于CVP的混合遗传算法(CVP-GA),用于求解最优突防时机及制导指令。仿真结果显示,采用IAIP最优控制算法的进攻导弹在成功突防后的打击精度仍可满足任务要求,且其燃料消耗相对于传统串联式突防方法降低了23.7%,验证了该方法的有效性及优越性。
WANG F , LIN T , ZHANG K . Control variable parameterization-based computational method for optimal control of initiative anti-interception penetration [J ] . Acta Aeronautica et Astronautica Sinica , 2015 , 36 ( 6 ): 2037 - 2046 . (in Chinese) DOI: 10.7527/S1000-6893.2014.0359 http://doi.org/10.7527/S1000-6893.2014.0359 To solve the problem caused by experiencing increases in defense capability of enemy antiaircraft system, the concept of initiative anti-interception penetration (IAIP) is proposed, to make up for the defects of the traditional penetration that only takes escape into account, while ignoring attack mission. According to the connotation of IAIP guidance, and considering the maneuver performance of target, terminal intercepting characteristics of interceptor missile and control system performance of attack missile, the three-body motion model, named attacker-target-interceptor, has been established, and the design of penetration guided command has been equivalent to solving nonlinear optimal control problem, where the penetration command is the solution of optimal control for minimizing fuel consumption, overload of attacker, miss distance of interceptor, attack angle, hit precision and angle of sight after penetration of attacker, are control constraint, path constraint and terminal constraint. After the optimal control problem is transformed into nonlinear integer programming model based on control variable parameterization (CVP) method and the continuous path constraint is simulated by scatter point, sequential quadratic program (SQP) algorithm is used to get the numerical solution of guidance commands under the condition of penetration occasion is given. An hydride genetic algorithm based on CVP (CVP-GA), is proposed to obtain the optimal penetration occasion and guided command. Simulation results show that hit accuracy of attacker used the optimal control of IAIP can still meet the mission requirements after successful penetration its fuel consumption is reduced by 23.7% compared with the traditional tandem penetration method, demonstrating the efficiency and superiority of the proposed method.
樊博璇 , 陈桂明 , 林洪涛 . 弹道导弹中段反应式机动突防规避策略 [J ] . 兵工学报 , 2022 , 43 ( 1 ): 69 - 78 .
FAN B X , CHEN G M , LIN H T . Mid-course reactive maneuver penetration and evading strategy of ballistic missile [J ] . Acta Armamentarii , 2022 , 43 ( 1 ): 69 - 78 . (in Chinese) DOI: 10.3969/j.issn.1000-1093.2022.01.008 http://doi.org/10.3969/j.issn.1000-1093.2022.01.008 The optimal maneuver penetration of ballistic missiles is studied to improve their penetration capability. Through the simplification and analysis of the motion model, it is determined whether the optimal penetration direction is located in the normal plane connecting the positions of ballistic missile and exoatmospheric kill vehicle. A multi-stage first-order differential motion equation is established based on the motion models of both offensive and defensive parties. The analytical expression of the optimal ballistic missile penetration time is obtained by solving this equation. The error limit of the model parameters that the penetration strategy can support is analyzed. In order to verify the correctness of the obtained optimal penetration time and optimal penetration direction, the two-dimensional and three-dimensional engagement simulations were carried out. The simulated results show that the evading strategy of short-time large pulse maneuver in the direction of the opposite position can be used to effectively penetrate the defense and achieve the maximum miss distance.
QIU X Q , GAO C S , JING W X . Maneuvering penetration strategies of ballistic missiles based on deep reinforcement learning [J ] . Proceedings of the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering , 2022 , 236 ( 16 ): 3494 - 3504 .
张晚晴 , 余文斌 , 李静琳 , 等 . 基于纵程解析解的飞行器智能横程机动再入协同制导 [J ] . 兵工学报 , 2021 , 42 ( 7 ): 1400 - 1411 .
ZHANG W Q , YU W B , LI J L , et al. Cooperative reentry guidance for intelligent lateral maneuver of hypersonic vehicle based on downrange analytical solution [J ] . Acta Armamentarii , 2021 , 42 ( 7 ): 1400 - 1411 . (in Chinese) DOI: 10.3969/j.issn.1000-1093.2021.07.007 http://doi.org/10.3969/j.issn.1000-1093.2021.07.007 A cooperative guidance law based on deep Q-learning network (DQN) algorithm for lateral maneuver of hypersonic vehicles is proposed to meet the requirement of cooperative saturation attack. The longitudinal and lateral guidance laws are deigned. The longitudinal lift-to-drag ratio is calculated to obtain the modulus of bank angle based on the high-precision longitudinal analytical solution. The lateral bank reversal logic is abstracted as a Markov decision process (MDP), and the reinforcement learning method can be used. A lateral intelligent maneuver decision-making device based on DQN algorithm is designed. The intelligent agent can be generated by offline training according to the mission requirements, and is called online to generate the sign of bank angle. Simulated results show that the proposed guidance law can autonomously generate bank angle reversals on-line, strictly meets the requirements of flight time constraints and energy management, and has high guidance accuracy and good mission adaptability. Compared with the coordinative reentry guidance law based on three-dimensional analytical solutions, the proposed guidance law is used to fully achieve the lateral maneuverability of vehicle and make it have higher penetration potential.
WANG Y K , ZHAO K , GUIRAO J L G , et al. Online intelligent maneuvering penetration methods of missile with respect to unknown intercepting strategies based on reinforcement learning [J ] . Electronic Research Archive , 2022 , 30 ( 12 ): 4366 - 4381 .
吴杰 , 张成 , 李淼 , 等 . 基于凸优化和LQR的火箭返回轨迹跟踪制导 [J ] . 北京航空航天大学学报 , 2022 , 48 ( 11 ): 2270 - 2280 .
WU J , ZHANG C , LI M , et al. Rocket return trajectory tracking guidance based on convex optimization and LQR [J ] . Journal of Beijing University of Aeronautics and Astronautics , 2022 , 48 ( 11 ): 2270 - 2280 . (in Chinese)
KIRK R , ZHANG A , GREFENSTETTE E , et al. A survey of zero-shot generalization in deep reinforcement learning [J ] . Journal of Artificial Intelligence Research , 2023 , 76 : 201 - 264 .
王琦 , 杨毅远 , 江季 . Easy RL:强化学习教程 [M ] . 北京 : 人民邮电出版社 , 2022 : 37 - 98 .
WANG Q , YANG Y Y , JIANG J . Easy RL: reinforcement learning tutorial [M ] . Beijing : Posts & Telecom Press , 2022 : 37 - 98 . (in Chinese)
CHOU P W , MATURANA D , SCHERER S . Improving stochastic policy gradients in continuous control with deep reinforcement learning using the beta distribution [C ] // Proceedings of the 34th International Conference on Machine Learning.Sydney , Australia : JMLR .org, 2017 .
HAARNOJA T , TANG H , ABBEEL P , et al. Reinforcement Learning with Deep Energy-Based Policies [C ] // Proceedings of the 34th International Conference on Machine Learning.Sydney, Australia:JMLR.org , 2017 .
HUANG W , DU W , XU R Y D . On the neural tangent kernel of deep networks with orthogonal initialization [C ] // Proceedings of the 30th International Joint Conference on Artificial Intelligence.Montreal,Canada:IJCAI , 2021 .
0
浏览量
237
下载量
0
CNKI被引量
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024360号