Timezone: »

Generalizable Imitation Learning from Observation via Inferring Goal Proximity
Youngwoon Lee · Andrew Szot · Shao-Hua Sun · Joseph Lim

Tue Dec 07 08:30 AM -- 10:00 AM (PST) @

Task progress is intuitive and readily available task information that can guide an agent closer to the desired goal. Furthermore, a task progress estimator can generalize to new situations. From this intuition, we propose a simple yet effective imitation learning from observation method for a goal-directed task using a learned goal proximity function as a task progress estimator for better generalization to unseen states and goals. We obtain this goal proximity function from expert demonstrations and online agent experience, and then use the learned goal proximity as a dense reward for policy training. We demonstrate that our proposed method can robustly generalize compared to prior imitation learning methods on a set of goal-directed tasks in navigation, locomotion, and robotic manipulation, even with demonstrations that cover only a part of the states.

Author Information

Youngwoon Lee (University of Southern California)
Andrew Szot (Georgia Institute of Technology)
Shao-Hua Sun (University of Southern California)
Joseph Lim (MIT)

More from the Same Authors