Skip to yearly menu bar Skip to main content


Poster

PAC-Bayesian Model Selection for Reinforcement Learning

Mahdi Milani Fard · Joelle Pineau


Abstract:

This paper introduces the first set of PAC-Bayesian bounds for the batch reinforcement learning problem in finite state spaces. These bounds hold regardless of the correctness of the prior distribution. We demonstrate how such bounds can be used for model-selection in control problems where prior information is available either on the dynamics of the environment, or on the value of actions. Our empirical results confirm that PAC-Bayesian model-selection is able to leverage prior distributions when they are informative and, unlike standard Bayesian RL approaches, ignores them when they are misleading.

Live content is unavailable. Log in and register to view live content