欢迎访问《兵工学报》官方网站,今天是

兵工学报

• •    下一篇

面向有限缓冲区的多信道路由抗干扰决策研究

许江, 任国春*, 徐逸凡, 徐煜华, 龚玉萍, 郑学强, 崔丽, 韩昊   

  1. 陆军工程大学 通信工程学院,江苏 南京 210000
  • 收稿日期:2025-05-27 修回日期:2025-08-26
  • 基金资助:
    国家自然科学基金联合基金项目(U22B2002);江苏省自然科学基金青年基金项目(BK20231027)

A Finite Buffer Oriented Study of Anti-jamming Decisions for Multi-channel Routing

XU Jiang, REN Guochun*, XU Yifan, XU Yuhua, GONG Yupin, ZHENG Xueqiang, CUI Li, HAN Hao   

  1. College of Communications Engineering, Army Engineering University of PLA, Nanjing 210000, Jiangsu, China
  • Received:2025-05-27 Revised:2025-08-26

摘要: 该文章研究多源并发下无线自组织网络的多信道路由抗干扰问题。由于节点缓冲区有限且无线信道开放易受干扰,关键节点在承担多路数据转发任务时易发生缓冲区溢出和链路中断,加剧网络拥堵。提出一种面向有限缓冲区的多信道路由抗干扰决策方法,旨在解决动态干扰和高负载环境下的路由规划、信道选择以及拥塞控制的协同优化问题。将路由规划与信道决策问题建模为部分可观测随机博弈模型,并提出基于分层深度Q学习的多信道路由抗干扰决策算法:上层网络基于邻居缓冲区状态与目的地址优化路由路径以避免拥塞,下层网络结合路由决策与频谱感知结果动态规避干扰信道,通过融合跳数代价、拥塞惩罚与干扰规避的奖励函数设计,实现路由与抗干扰信道接入的协同优化。仿真结果表明,相较于对比算法,新算法在数据包传输成功率方面提升15%,有效增强了无线自组织网络在干扰环境下的可靠性和抗干扰能力。

关键词: 有限缓冲区, 多信道路由抗干扰, 分布式决策, 分层深度Q学习

Abstract: This paper investigates the multi-path routing anti-jamming problem in wireless self-organizing networks under multi-source concurrency. Due to limited node buffers and open wireless channels that are susceptible to jamming, critical nodes are prone to buffer overflow and link interruptions when undertaking multi-path data forwarding tasks, exacerbating network congestion. To this end, this article proposes a finite buffer-oriented multi-channel routing anti-jamming decision-making method to address the co-optimization of route planning, channel selection, and congestion control in dynamic jamming and high load environments. On this basis, the route planning and channel decision problem is modeled as a partially observable stochastic game model, and a multichannel routing anti-jamming decision algorithm based on hierarchical deep Q-learning is proposed: the upper layer network optimizes routing paths based on the neighbor buffer states and destination addresses to avoid congestion, the lower layer network combines the route decision-making with the spectral sensing results to dynamically avoid jamming channels, and a reward function is designed to achieve the optimal routing decision based on a fusion of the hop cost, the congestion penalty and the jamming avoidance. Through the design of reward function that combines hop cost, congestion penalty and jamming avoidance, the cooperative optimization of routing and anti-jamming channel access is realized. Simulation results show that compared with the comparison algorithm, the proposed algorithm improves the success rate of packet transmission by 15%, which effectively enhances the reliability and anti-jamming capability of wireless self-organized network in the jamming environment.

Key words: finite buffer, multi-channel routing anti-jamming, distributed decision making, hierarchical deep q-network

中图分类号: