Timezone: »

Linear Feature Encoding for Reinforcement Learning
Zhao Song · Ronald Parr · Xuejun Liao · Lawrence Carin

Tue Dec 06 09:00 AM -- 12:30 PM (PST) @ Area 5+6+7+8 #99

Feature construction is of vital importance in reinforcement learning, as the quality of a value function or policy is largely determined by the corresponding features. The recent successes of deep reinforcement learning (RL) only increase the importance of understanding feature construction. Typical deep RL approaches use a linear output layer, which means that deep RL can be interpreted as a feature construction/encoding network followed by linear value function approximation. This paper develops and evaluates a theory of linear feature encoding. We extend theoretical results on feature quality for linear value function approximation from the uncontrolled case to the controlled case. We then develop a supervised linear feature encoding method that is motivated by insights from linear value function approximation theory, as well as empirical successes from deep RL. The resulting encoder is a surprisingly effective method for linear value function approximation using raw images as inputs.

Author Information

Zhao Song (Duke University)
Ronald Parr (Duke University)
Xuejun Liao (Duke University)
Lawrence Carin (KAUST)

More from the Same Authors