Timezone: »
Learning from demonstration methods usually leverage close to optimal demonstrations to accelerate training. By contrast, when demonstrating a task, human teachers deviate from optimal demonstrations and pedagogically modify their behavior by giving demonstrations that best disambiguate the goal they want to demonstrate. Analogously, human learners excel at pragmatically inferring the intent of the teacher, facilitating communication between the two agents. These mechanisms are critical in the few demonstrations regime, where inferring the goal is more difficult. In this paper, we implement pedagogy and pragmatism mechanisms by leveraging a Bayesian model of Goal Inference from demonstrations. We highlight the benefits of this model in multi-goal teacher-learner setups with two artificial agents that learn with goal-conditioned Reinforcement Learning. We show that combining BGI-agents (a pedagogical teacher and a pragmatic learner) results in faster learning and reduced goal ambiguity over standard learning from demonstrations, especially in the few demonstrations regime.
Author Information
Hugo Caselles-Dupré (ISIR (Sorbonne Université))
Postdoc working on Reinforcement Learning and Developmental Robotics.
Olivier Sigaud (Sorbonne University)
Mohamed CHETOUANI (ISIR, UMR 7222)
More from the Same Authors
-
2021 : Learning Collective Action under Risk Diversity »
Ramona Merhej · Fernando Santos · Francisco S. Melo · Mohamed CHETOUANI · Francisco Santos -
2022 : Overcoming Referential Ambiguity in language-guided goal-conditioned Reinforcement Learning »
Hugo Caselles-Dupré · Olivier Sigaud · Mohamed CHETOUANI -
2022 Poster: EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL »
Thomas Carta · Pierre-Yves Oudeyer · Olivier Sigaud · Sylvain Lamprier -
2020 : Poster Session »
Kwanyoung Park · Haizi Yu · Alban Laflaquière · Yizhou Zhang · Hugo Caselles-Dupré · Charlie Snell · Philip Ball · Jhoseph Shin · Jelena Sucevic · Kezhen Chen · Won-Seok Choi · Eon-Suk Ko · Xu Ji -
2019 Poster: Symmetry-Based Disentangled Representation Learning requires Interaction with Environments »
Hugo Caselles-Dupré · Michael Garcia Ortiz · David Filliat -
2019 Poster: Learning Compositional Neural Programs with Recursive Tree Search and Planning »
Thomas PIERROT · Guillaume Ligner · Scott Reed · Olivier Sigaud · Nicolas Perrin · Alexandre Laterre · David Kas · Karim Beguir · Nando de Freitas -
2019 Spotlight: Learning Compositional Neural Programs with Recursive Tree Search and Planning »
Thomas PIERROT · Guillaume Ligner · Scott Reed · Olivier Sigaud · Nicolas Perrin · Alexandre Laterre · David Kas · Karim Beguir · Nando de Freitas