Timezone: »

 
IL-flOw: Imitation Learning from Observation using Normalizing Flows
Wei-Di Chang · Juan Camilo Gamboa Higuera · Scott Fujimoto · David Meger · Gregory Dudek

We present an algorithm for Inverse Reinforcement Learning (IRL) from expert state observations only that decouples reward modelling from policy learning, unlike state-of-the-art adversarial methods which require updating the reward model during policy search and are known to be unstable and difficult to optimize. Our method, IL-flOw, recovers the expert policy by modelling state-state transitions, by generating rewards using deep density estimators trained on the demonstration trajectories, avoiding the instability issues of adversarial methods. We demonstrate that using the state transition log-probability density as a reward signal for forward reinforcement learning translates to matching the trajectory distribution of the expert demonstrations, and experimentally show good recovery of the true reward signal as well as state of the art results for imitation from observation on locomotion and robotic continuous control tasks.

Author Information

Wei-Di Chang (McGill University)
Juan Camilo Gamboa Higuera (McGill University)
Scott Fujimoto (McGill University)
David Meger (McGill University)
Gregory Dudek (McGill University & Samsung Research)

More from the Same Authors

  • 2021 Spotlight: A Minimalist Approach to Offline Reinforcement Learning »
    Scott Fujimoto · Shixiang (Shane) Gu
  • 2022 : Bayesian Q-learning With Imperfect Expert Demonstrations »
    Fengdi Che · Xiru Zhu · Doina Precup · David Meger · Gregory Dudek
  • 2022 : Bayesian Q-learning With Imperfect Expert Demonstrations »
    Fengdi Che · Xiru Zhu · Doina Precup · David Meger · Gregory Dudek
  • 2022 : Learning Successor Feature Representations to Train Robust Policies for Multi-task Learning »
    Melissa Mozifian · Dieter Fox · David Meger · Fabio Ramos · Animesh Garg
  • 2022 Poster: Continuous MDP Homomorphisms and Homomorphic Policy Gradient »
    Sahand Rezaei-Shoshtari · Rosie Zhao · Prakash Panangaden · David Meger · Doina Precup
  • 2021 Poster: A Minimalist Approach to Offline Reinforcement Learning »
    Scott Fujimoto · Shixiang (Shane) Gu
  • 2020 Workshop: AI for Earth Sciences »
    Surya Karthik Mukkavilli · Johanna Hansen · Natasha Dudek · Tom Beucler · Kelly Kochanski · Mayur Mudigonda · Karthik Kashinath · Amy McGovern · Paul D Miller · Chad Frischmann · Pierre Gentine · Gregory Dudek · Aaron Courville · Daniel Kammen · Vipin Kumar
  • 2020 Poster: An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay »
    Scott Fujimoto · David Meger · Doina Precup
  • 2020 Poster: 3D Shape Reconstruction from Vision and Touch »
    Edward Smith · Roberto Calandra · Adriana Romero · Georgia Gkioxari · David Meger · Jitendra Malik · Michal Drozdzal
  • 2019 : Poster Session »
    Matthia Sabatelli · Adam Stooke · Amir Abdi · Paulo Rauber · Leonard Adolphs · Ian Osband · Hardik Meisheri · Karol Kurach · Johannes Ackermann · Matt Benatan · GUO ZHANG · Chen Tessler · Dinghan Shen · Mikayel Samvelyan · Riashat Islam · Murtaza Dalal · Luke Harries · Andrey Kurenkov · Konrad Żołna · Sudeep Dasari · Kristian Hartikainen · Ofir Nachum · Kimin Lee · Markus Holzleitner · Vu Nguyen · Francis Song · Christopher Grimm · Felipe Leno da Silva · Yuping Luo · Yifan Wu · Alex Lee · Thomas Paine · Wei-Yang Qu · Daniel Graves · Yannis Flet-Berliac · Yunhao Tang · Suraj Nair · Matthew Hausknecht · Akhil Bagaria · Simon Schmitt · Bowen Baker · Paavo Parmas · Benjamin Eysenbach · Lisa Lee · Siyu Lin · Daniel Seita · Abhishek Gupta · Riley Simmons-Edler · Yijie Guo · Kevin Corder · Vikash Kumar · Scott Fujimoto · Adam Lerer · Ignasi Clavera Gilaberte · Nicholas Rhinehart · Ashvin Nair · Ge Yang · Lingxiao Wang · Sungryull Sohn · J. Fernando Hernandez-Garcia · Xian Yeow Lee · Rupesh Srivastava · Khimya Khetarpal · Chenjun Xiao · Luckeciano Carvalho Melo · Rishabh Agarwal · Tianhe Yu · Glen Berseth · Devendra Singh Chaplot · Jie Tang · Anirudh Srinivasan · Tharun Kumar Reddy Medini · Aaron Havens · Misha Laskin · Asier Mujika · Rohan Saphal · Joseph Marino · Alex Ray · Joshua Achiam · Ajay Mandlekar · Zhuang Liu · Danijar Hafner · Zhiwen Tang · Ted Xiao · Michael Walton · Jeff Druce · Ferran Alet · Zhang-Wei Hong · Stephanie Chan · Anusha Nagabandi · Hao Liu · Hao Sun · Ge Liu · Dinesh Jayaraman · John Co-Reyes · Sophia Sanborn
  • 2018 Poster: Multi-View Silhouette and Depth Decomposition for High Resolution 3D Object Representation »
    Edward Smith · Scott Fujimoto · David Meger
  • 2017 : Poster session »
    Xun Zheng · Tim G. J. Rudner · Christopher Tegho · Patrick McClure · Yunhao Tang · ASHWIN D'CRUZ · Juan Camilo Gamboa Higuera · Chandra Sekhar Seelamantula · Jhosimar Arias Figueroa · Andrew Berlin · Maxime Voisin · Alexander Amini · Thang Long Doan · Hengyuan Hu · Aleksandar Botev · Niko Suenderhauf · CHI ZHANG · John Lambert
  • 2013 Demonstration: Topic Modeling for Robots »
    Yogesh A Girdhar · Gregory Dudek