Welcome to Acta Armamentarii ! Today is Share:

Acta Armamentarii ›› 2021, Vol. 42 ›› Issue (12): 2655-2663.doi: 10.3969/j.issn.1000-1093.2021.12.013

• Paper • Previous Articles     Next Articles

A Deep Deterministic Policy Gradient Optimization Approach for Multi-users Data Offloading in Wireless PoweredCommunication Network

GENG Tianli1,2, GAO Ang1,2, WANG Qi1,2, DUAN Weijun1,2, HU Yansu3   

  1. (1.School of Electronics and Information,Northwestern Polytechnical University,Xi'an 710072,Shaanxi,China;2.State-Province Joint Engineering Laboratory of IoT Technology and Application,Xi'an 710072,Shaanxi,China; 3.School of Electronic Control, Chang'an University, Xi'an 710072, Shaanxi, China)
  • Online:2022-01-15

Abstract: In the wireless powered communication network (WPCN),the wireless devices can offload data through wireless backscattering and active radio frequency transmission.How to adjust the working mode as well as manage the time allocation of ambient backscattering and active RF transmission properly is a great challenge for reducing the system transmission delay and enhancing the transmission efficiency.A deep deterministic policy gradient(DDPG) algorithm is proposed to search the best time allocation in a continuous domain,in which the data size,the channel conditions and the fairness between wireless devices are considered.The experimental results show that DDPG algorithm achieves the algorithm convergence in finite time step, and all the wireless devices can complete the data offloading at the same time by introducing Jain fairness index.Compared with the traditional Round-Robin and Greedy algorithms,DDPG algorithm can be used to reduce the average transmission delay by 77.7% and 24.2%,respectively,and the energy efficiency is largely improved especially for wireless devices with a small amount of offloading data.

Key words: backscattering, dataoffloading, deepdeterministicpolicygradient, reinforcedlearning

CLC Number: