Xuguang TIAN, Qinwen ZUO, Ding WANG, et al. A Path Planning Algorithm based on Deep Reinforcement Learning with Integrated A* Heuristic Search[J]. Acta Armamentarii, 2025, 46(S2): 250493.
DOI:
Xuguang TIAN, Qinwen ZUO, Ding WANG, et al. A Path Planning Algorithm based on Deep Reinforcement Learning with Integrated A* Heuristic Search[J]. Acta Armamentarii, 2025, 46(S2): 250493. DOI: 10.12382/bgxb.2025.0493.
A Path Planning Algorithm based on Deep Reinforcement Learning with Integrated A* Heuristic Search
To address the challenge of efficient path planning for agents in complex environments
this paper proposes a deep reinforcement learning path planning algorithm that integrates A
*
heuristic search. An enhanced deep Q-learning network (DQN) model is constructed by combining target networks
advantage networks and noise networks.A priority experience cache mechanism that incorporates A
*
prior knowledge is presented
enabling the agents to leverage the historical experience more effectively and reduce the initial exploration blindness.Additionally
an action selection strategy that incorporates A
*
heuristic search is designed.The A
*
algorithm’s heuristic search is introduced during action selection to avoiding local optima
thereby improving the efficiency and stability of path planning.The proposed algorithm is experimentally verified in multiple simulation environments
and compared with the traditional path planning algorithms.The results demonstrate that the proposed algorithm has significant improvements in path planning efficiency and stability compared to the conventional DQN algorithms.
LIU X L , LI H , GUO Z G . Workshop dynamic production capacity evaluation and implementation based on Dijkstra’s algorithm [J ] . Microcomputer Information , 2006 ( 12 ): 96 - 98 . (in Chinese)
ZHANG Y M , WANG J , FU Z X . Research on AGV path planning based on improved A * algorithm [J ] . Journal of Qingdao University (Engineering & Technology Edition ), 2024 , 39 ( 3 ): 13 - 19 . (in Chinese)
CHOSET H , LYNCH K , HUTCHINSON S , et al. Principles of robot motion:theory,algorithms,and implementations [M ] . Cambridge,MA ,US:MIT Press:512.
ZHAO X P , WANG G Q . Research on vehicle formation obstacle avoidance based on the improved artificial potential field method [J ] . Electronic Measurement Technology , 2025 , 48 ( 18 ): 13 - 19 . (in Chinese)
WANG H Y , CAO Y G , WANG X W . Research on motion planning of pomelo-picking dual- arm robot based on probability sampling EIT * algorithm and artificial potential field method [J ] . Journal of Zhejiang Sci-Tech University (Natural Science Edition) , 2025 , 53 ( 5 ): 720 - 730 . (in Chinese)
LAVALLE S . Rapidly-exploring random trees:a new tool for path planning [J ] . The Annual Research Report , 1998 : 11 - 39 .
CHEN M H . Research on automated terminal path planning based on hybrid genetic ant colony algorithm [D ] . Shanghai : East China Normal University , 2024 . (in Chinese)
SUTTON R S , BARTO A G . Introduction to reinforcement learning [M ] . Cambridge,MA , US : MIT Press ,1998.
LIU W T , SU S , TANG T , et al. A DQN-based intelligent control method for heavy haul trains on long steep downhill section [J ] . Transportation Research Part C:Emerging Technologies , 2021 , 129 : 103249 .
KOVARI B , ANGYAL B G , BECSI T . Deep reinforcement learning combined with RRT for trajectory tracking of autonomous vehicles [J ] . Transportation Research Procedia , 2024 , 78 : 246 - 253 .
ZHOU S Y , LIU X , XU Y F , et al. A deep Q-network (DQN) based path planning method for mobile robots [C ] //Proceedings of the International Conference on Information and Automation.Wuyishan, China:IEEE , 2018 : 366 - 371 .
VAN HASSELT H , GUEZ A , SIlVER D . Deep reinforcement learning with double Q-learning [J ] . Proceedings of the AAAI Conference on Artificial Intelligence.Phoenix,AZ US:AAAI Press , 2016 , 30 ( 1 ): 2094 - 2100 .
LUO L , ZHAO N , ZHU Y , et al. A * guiding DQN algorithm for automated guided vehicle pathfinding problem of robotic mobile fulfillment systems [J ] . Computers & Industrial Engineering , 2023 , 178 : 109112 .
ZHENG C W , HOU L Y . Dynamic obstacle avoidance path planning based on improved DQN [J ] . Journal of Beijing Information Science & Technology University (Natural Science Edition) , 2024 , 39 ( 5 ): 14 - 22 . (in Chinese)
LI L H , ZHANG S J , DU D L . Path planning via an improved DQN-based learning policy [J ] . IEEE Access , 2019 , 7 : 67319 - 67330 .
ZHANG F , GU C C , YANG F . An improved algorithm of robot path planning in complex environment based on double DQN [J ] . Lecture Notes in Electrical Engineering Advances in Guidance,Navigation and Control , 2021 , 644 : 303 - 313 .
SCHAUL T , QUAN J I , ANTONOGLOU L , et al. Prioritized experience replay:arXiv:1511.05952 [R ] . Ithaca,NY , US : Cornell University , 2015 : 1511 .05952.
MNIH V , KAVUKCUOGLU K , SILVER D , et al. Human-level control through deep reinforcement learning [J ] . Nature , 2015 , 518 : 529 - 533 .
WANG Z Y , SCHAUL T , HESSEL M . Dueling network architectures for deep reinforcement learning:arXiv:1511.06581 [R ] . Ithaca,NY , US : Cornell University , 2015 : 1511 .06581.
FORTUNATO M , AZAR M G , PIOT B , et al. Noisy networks for exploration [C ] //Proceedings of the 6th International Conference on Learning Representations.Vancouver,BC,Canada:OpenReview. net , 2018 .
HESSEL M , MODAYIL J , VAN HASSELT H , et al. Rainbow:Combining improvements in deep reinforcement learning arXiv:1710.02298 [R ] . Ithaca,NY , US : Cornell University , 2017 : 1710 .02298.
LIN L J Reinforcement learning for robots using neural networks [R ] . Pittsburgh,PA , US : Carnegie-Mellon University ,1993.