NIPS Poster Nonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions

Poster

Nonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions

Jaedeug Choi · Kee-Eung Kim

Harrah’s Special Events Center 2nd Floor

[ Abstract ]

Abstract:

We present a nonparametric Bayesian approach to inverse reinforcement learning (IRL) for multiple reward functions. Most previous IRL algorithms assume that the behaviour data is obtained from an agent who is optimizing a single reward function, but this assumption is hard to be met in practice. Our approach is based on integrating the Dirichlet process mixture model into Bayesian IRL. We provide an efficient Metropolis-Hastings sampling algorithm utilizing the gradient of the posterior to estimate the underlying reward functions, and demonstrate that our approach outperforms the previous ones via experiments on a number of problem domains.

Live content is unavailable. Log in and register to view live content