Timezone: »
Imitation from observation is a paradigm that consists of training agents using observations of expert demonstrations without direct access to the actions. Depending on the problem configuration, these demonstrations can be sequences of states or raw visual observations.One of the most common procedures adopted to solve this problem is to train a reward function from the demonstrations, but this task still remains a significant challenge.We approach this problem with a method of agent behavior representation in a latent space using demonstration videos.Our approach exploits recent algorithms of contrastive learning of image and video and uses a bootstrapping method to progressively train a trajectory encoding function with respect to the variation of the agent policy. This function is then used to compute the rewards provided to a standard Reinforcement Learning (RL) algorithm.Our method uses only a limited number of videos produced by an expert and we do not have access to the expert policy function.Our experiments show promising results on a set of continuous control tasks and demonstrate that learning a behavior encoder from videos allows building an efficient reward function for the agent.
Author Information
Medric Sonwa (University of Montréal)
Johanna Hansen (McGill University)
Eugene Belilovsky (Concordia University / Mila)
More from the Same Authors
-
2022 : Reducing Forgetting in Federated Learning with Truncated Cross-Entropy »
Gwen Legate · Lucas Page-Caccia · Eugene Belilovsky -
2022 : Imitation from Observation With Bootstrapped Contrastive Learning »
Medric Sonwa · Johanna Hansen · Eugene Belilovsky -
2022 : Imitation from Observation With Bootstrapped Contrastive Learning »
Medric Sonwa · Johanna Hansen · Eugene Belilovsky -
2022 : Deceiving the CKA Similarity Measure in Deep Learning »
MohammadReza Davari · Stefan Horoi · Amine Natik · Guillaume Lajoie · Guy Wolf · Eugene Belilovsky -
2022 : Adversarial Attacks on Feature Visualization Methods »
Michael Eickenberg · Eugene Belilovsky · Jonathan Marty -
2023 Poster: $\textbf{A}^2\textbf{CiD}^2$: Accelerating Asynchronous Communication in Decentralized Deep Learning »
Adel Nabli · Eugene Belilovsky · Edouard Oyallon -
2023 Poster: Guiding The Last Layer in Federated Learning with Pre-Trained Models »
Gwen Legate · Nicolas Bernier · Lucas Page-Caccia · Edouard Oyallon · Eugene Belilovsky -
2020 : Q/A and Discussion for Sensing & Sampling Session »
Johanna Hansen · Yogesh A Girdhar · Hannah Kerner · Renaud Detry -
2020 : Sensors and Sampling »
Johanna Hansen -
2020 Workshop: AI for Earth Sciences »
Surya Karthik Mukkavilli · Johanna Hansen · Natasha Dudek · Tom Beucler · Kelly Kochanski · Mayur Mudigonda · Karthik Kashinath · Amy McGovern · Paul D Miller · Chad Frischmann · Pierre Gentine · Gregory Dudek · Aaron Courville · Daniel Kammen · Vipin Kumar -
2020 Workshop: Differentiable computer vision, graphics, and physics in machine learning »
Krishna Murthy Jatavallabhula · Kelsey Allen · Victoria Dean · Johanna Hansen · Shuran Song · Florian Shkurti · Liam Paull · Derek Nowrouzezahrai · Josh Tenenbaum -
2020 : Opening remarks »
Krishna Murthy Jatavallabhula · Kelsey Allen · Johanna Hansen · Victoria Dean -
2019 Poster: Online Continual Learning with Maximal Interfered Retrieval »
Rahaf Aljundi · Eugene Belilovsky · Tinne Tuytelaars · Laurent Charlin · Massimo Caccia · Min Lin · Lucas Page-Caccia -
2018 : Coffee break + posters 1 »
Samuel Myer · Wei-Ning Hsu · Jialu Li · Monica Dinculescu · Lea Schönherr · Ehsan Hosseini-Asl · Skyler Seto · Oiwi Parker Jones · Imran Sheikh · Thomas Manzini · Yonatan Belinkov · Nadir Durrani · Alexander Amini · Johanna Hansen · Gabi Shalev · Jamin Shin · Paul Smolensky · Lisa Fan · Zining Zhu · Hamid Eghbal-zadeh · Benjamin Baer · Abelino Jimenez · Joao Felipe Santos · Jan Kremer · Erik McDermott · Andreas Krug · Tzeviya S Fuchs · Shuai Tang · Brandon Carter · David Gifford · Albert Zeyer · André Merboldt · Krishna Pillutla · Katherine Lee · Titouan Parcollet · Orhan Firat · Gautam Bhattacharya · JAHANGIR ALAM · Mirco Ravanelli