Resource Allocation in Wireless Control Systems via Deep Policy Gradient
Vinicius Lima, Mark Eisen, Konstantinos Gatsis, Alejandro Ribeiro
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 12:29
In wireless control systems, remote control of plants is achieved through closing of the control loop over a wireless channel. As wireless communication is noisy and subject to packet dropouts, proper allocation of limited resources, e.g. transmission power, across plants is critical for maintaining reliable operation. In this paper, we formulate the design of an optimal resource allocation policy that uses current plant states and wireless channel conditions to assign resources used to send control actuation information back to plants. While this problem is challenging due to its infinite dimensionality and need for explicit system model and state knowledge, we propose the use of deep reinforcement learning techniques to find data-driven resource allocation policies. In particular, we use model-free policy gradient methods to directly learn continuous power allocation policies without knowledge of plant dynamics or communication models. Numerical simulations demonstrate the strong performance of learned policies relative to baseline resource allocation methods.