一种深度强化学习的雷达辐射源个体识别方法

doi:10.3969/j.issn.1000-1093.2018.12.016

摘要/Abstract

摘要： 针对传统依赖于人工经验提取辐射源个体特征的不足，提出一种基于深度强化学习的雷达辐射源个体识别方法。利用发射机非理想信道造成的辐射源信号包络在信号变化时呈现的不同瞬态信息，以信号包络前沿作为深度神经网络的输入状态，以辐射源类别作为当前输入状态的可选动作，通过卷积神经网络自动提取辐射源包络个体特征，并拟合当前状态动作对的Q值，进而以强化学习模型完成雷达辐射源个体识别任务。讨论了深度Q网络模型、深度双Q网络模型以及Dueling Network模型3种深度强化学习模型在辐射源识别任务中的应用。实测数据仿真实验表明：传统机器学习算法的识别率不足80%，而深度强化学习网络的识别率高达98.42%.

关键词: 雷达, 辐射源个体识别, 深度神经网络, 强化学习

Abstract: A specific emitter identification (SEI) method based on deep reinforcement learning is proposed on account of the deficiency of emitter individual feature extraction depending on artificial experience. Due to the differences of the transient information of signal envelope, which results from the change of the signal owing to a nonideal transmitter channel, an envelope rising edge is used as the input state of deep neural network, and the emitter classifications are used as the optional actions of the current input state. The envelope features are extracted automatically through the convolutional neural network (CNN), and Q values of the current state action pairs are fitted, thus completing the specific emitter identification task based on the reinforcement learning model. The applications of deep Q network (DQN), deep double Q network (DDQN) and Dueling network in the specific emitter identification are discussed. The measured results show that the recognition rate of traditional machine learning algorithm is less than 80%, but the deep reinforcement learning model can achieve the high recognition rate of 98.42%. Key

Key words: radar, specificemitteridentification, deepneuralnetwork, reinforcementlearning

中图分类号:

TN971⁺.1

冷鹏飞，徐朝阳. 一种深度强化学习的雷达辐射源个体识别方法[J]. 兵工学报, 2018, 39(12): 2420-2426.

LENG Peng-fei， XU Chao-yang. Specific Emitter Identification Based on Deep Reinforcement Learning[J]. Acta Armamentarii, 2018, 39(12): 2420-2426.

参考文献

［1］陈昌孝, 何明浩, 徐璟,等. 雷达辐射源识别技术研究进展［J］. 空军预警学院学报, 2014, 28(1):1-5,9.
CHEN Chang-xiao, HE Ming-hao, XU Jing, et al. Progress of study on recognition technology of radar emitter ［J］. Journal of Air Force Early Warning Academy, 2014, 28(1):1-5,9. (in Chinese)
［2］周志文, 黄高明, 陈海洋,等. 雷达辐射源识别算法综述［J］. 电讯技术, 2017, 57(8):973-980.
ZHOU Zhi-wen, HUANG Gao-ming, CHEN Hai-yang, et al. A review of radar emitter recognition algorithm［J］. Telecommunication Engineering, 2017, 57(8):973-980. (in Chinese)
［3］周志文, 黄高明, 高俊,等. 一种深度学习的雷达辐射源识别算法［J］. 西安电子科技大学学报, 2017, 44(3):77-82.
ZHOU Zhi-wen, HUANG Gao-ming, GAO Jun, et al. Radar emitter identification algorithm based on deep learning［J］. Journal of Xidian University, 2017, 44(3):77-82. (in Chinese)
［4］ Kawalec A, Owczarek R. Specific emitter identification using intrapulse data［C］∥Proceedings of European Radar Conference. Amsterdam, the Netherlands: IEEE, 2004:249-252.
［5］王宏伟, 赵国庆, 王玉军. 基于脉冲包络前沿高阶矩特征的辐射源个体识别［J］. 现代雷达, 2010, 32(10):42-45.
WANG Hong-wei, ZHAO Guo-qing, WANG Yu-jun. Specific emitter identification based on higher order moment of the envelope's front edge［J］. Modern Radar, 2010, 32(10):42-45. (in Chinese)
［6］程吉祥, 张葛祥, 李志丹. 基于时频原子方法的雷达辐射源个体识别［J］. 航天电子对抗, 2011, 27(1):54-57.
CHENG Ji-xiang, ZHANG Ge-xiang, LI Zhi-dan. A novel specific emitter identification method based on time-frequency atom approach［J］. Aerospace Electronic Warfare, 2011, 27(1):54-57.(in Chinese)
［7］ Wang L, Ji H B, Shi Y. Feature extraction and optimization of representative-slice in ambiguity function for moving radar emitter recognition［C］∥Proceedings of IEEE International Conference on Acoustics Speech and Signal Processing. Dallas, TX, US: IEEE, 2010:2246-2249.
［8］ Schmidhuber J. Deep learning in neural networks: an overview［J］. Neural Network, 2014, 61(1):85-117.
［9］周志华. 机器学习［M］. 北京:清华大学出版社, 2016: 380-381.
ZHOU Zhi-hua. Machine learning［M］. Beijing: Tsinghua University Press, 2016:380-381. (in Chinese)

［10］ Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks［C］∥Proceedings of International Conference on Neural Information Processing Systems. Spain: Neural Information Processing Systems Foundation, 2012:1097-1105.
［11］ Mnih V, Kavukcuoglu K, Silver D, et al. Human-level control through deep reinforcement learning［J］. Nature, 2015, 518(7540): 529-541.
［12］ Bengio Y I, Goodfellow J, Courville A. Deep learning ［M］. Cambridge, MA, US: MIT Press, 2016:340-341.
［13］ van Hasselt H, Guez A, Silver D. Deep reinforcement learning with double Q-learning［C］∥Proceedings of the 30th AAAI Conference on Artificial Intelligence. Phoenix, AZ, US: AAAI, 2016:2094-2100.
［14］ Wang Z, Schaul T, Hessel M, et al. Dueling network architectures for deep reinforcement learning［C］∥Proceedings of International Conference on Machine Learning. New York,NY, US: PMLR, 2016:1995-2003.
［15］ Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks［J］. Journal of Machine Learning Research, 2010, 9: 249-256.

第39卷第12期
2018 年12月兵工学报ACTA
ARMAMENTARIIVol.39No.12Dec. 2018

[1]	李正杰，陈红印，谢军伟，张浩为，刘斌. 一种防空服务质量模型下的集中式多输入多输出雷达三维机动跟踪功率分配方法[J]. 兵工学报, 2024, 45(4): 1321-1331.
[2]	熊光明, 罗震, 孙冬, 陶俊峰, 唐泽月, 吴超. 基于红外相机和毫米波雷达融合的烟雾遮挡无人驾驶车辆目标检测与跟踪[J]. 兵工学报, 2024, 45(3): 893-906.
[3]	张亮, 杜庆磊, 张昭建, 王永良. 一种慢时间维的密集假目标干扰主瓣匿影算法[J]. 兵工学报, 2024, 45(2): 606-617.
[4]	李松, 麻壮壮, 张蕴霖, 邵晋梁. 基于安全强化学习的多智能体覆盖路径规划[J]. 兵工学报, 2023, 44(S2): 101-113.
[5]	曹子建, 孙泽龙, 闫国闯, 傅妍芳, 杨博, 李秦洁, 雷凯麟, 高领航. 基于强化学习的无人机集群对抗策略推演仿真[J]. 兵工学报, 2023, 44(S2): 126-134.
[6]	刘畅, 雷红波, 林时尧, 范世鹏, 王江. 基于多模型网络的激光末制导炮弹诸元解算方法[J]. 兵工学报, 2023, 44(9): 2745-2755.
[7]	杨加秀, 李新凯, 张宏立, 王昊. 基于积分强化学习的四旋翼无人机鲁棒跟踪[J]. 兵工学报, 2023, 44(9): 2802-2813.
[8]	柳斌, 李雪梅. 一种基于激光雷达点云的自适应双半径滤波算法[J]. 兵工学报, 2023, 44(9): 2768-2777.
[9]	张凯歌, 卢志刚, 聂天常, 李志伟, 郭宇强. 面向无人装备的智能边缘计算软技术分析[J]. 兵工学报, 2023, 44(9): 2611-2621.
[10]	李超, 王瑞星, 黄建忠, 江飞龙, 魏雪梅, 孙延鑫. 稀疏奖励下基于强化学习的无人集群自主决策与智能协同[J]. 兵工学报, 2023, 44(6): 1537-1546.
[11]	张建东, 王鼎涵, 杨啟明, 史国庆, 陆屹, 张耀中. 基于分层强化学习的无人机空战多维决策[J]. 兵工学报, 2023, 44(6): 1547-1563.
[12]	郑泽新, 李伟, 邹鲲, 李艳福. 基于强化学习的对空雷达抗干扰波形设计[J]. 兵工学报, 2023, 44(5): 1422-1430.
[13]	霍健, 陈慧敏, 马云飞, 郭鹏宇, 杨旭, 孟祥盛. 基于MEMS激光雷达的车辆目标识别算法[J]. 兵工学报, 2023, 44(4): 940-948.
[14]	孔亚盟, 王国玉, 冯德军, 王俊杰. 相位调制表面周期非均匀间歇调制方法[J]. 兵工学报, 2023, 44(4): 1209-1216.
[15]	赵文飞, 陈健, 王, 滕克难. 基于强化学习的海上要地群协同防空动态火力分配[J]. 兵工学报, 2023, 44(11): 3516-3528.