Timezone: »
Poster
Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees
Gregory Dexter · Kevin Bello · Jean Honorio
Inverse Reinforcement Learning (IRL) is the problem of finding a reward function which describes observed/known expert behavior. The IRL setting is remarkably useful for automated control, in situations where the reward function is difficult to specify manually or as a means to extract agent preference. In this work, we provide a new IRL algorithm for the continuous state space setting with unknown transition dynamics by modeling the system using a basis of orthonormal functions. Moreover, we provide a proof of correctness and formal guarantees on the sample and time complexity of our algorithm. Finally, we present synthetic experiments to corroborate our theoretical guarantees.
Author Information
Gregory Dexter (Purdue University)
Kevin Bello (University of Chicago & Carnegie Mellon University)
Jean Honorio (Purdue University)
More from the Same Authors
-
2021 Spotlight: Fair Sparse Regression with Clustering: An Invex Relaxation for a Combinatorial Problem »
Adarsh Barik · Jean Honorio -
2021 Poster: Fair Sparse Regression with Clustering: An Invex Relaxation for a Combinatorial Problem »
Adarsh Barik · Jean Honorio -
2020 Poster: Fairness constraints can help exact inference in structured prediction »
Kevin Bello · Jean Honorio -
2019 Poster: Learning Bayesian Networks with Low Rank Conditional Probability Tables »
Adarsh Barik · Jean Honorio -
2019 Poster: On the Correctness and Sample Complexity of Inverse Reinforcement Learning »
Abi Komanduru · Jean Honorio -
2019 Poster: Exact inference in structured prediction »
Kevin Bello · Jean Honorio -
2018 Poster: Learning latent variable structured prediction models with Gaussian perturbations »
Kevin Bello · Jean Honorio -
2018 Poster: Information-theoretic Limits for Community Detection in Network Models »
Chuyang Ke · Jean Honorio -
2018 Poster: Computationally and statistically efficient learning of causal Bayes nets using path queries »
Kevin Bello · Jean Honorio -
2017 Poster: Learning Identifiable Gaussian Bayesian Networks in Polynomial Time and Sample Complexity »
Asish Ghoshal · Jean Honorio