Timezone: »

 
Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning
Chaitanya Kharyal · Tanmay Sinha · Vijaya Sai Krishna Gottipati · Srijita Das · Matthew Taylor
Event URL: https://openreview.net/forum?id=KEH4KSoJh2W »

A long-running challenge in the reinforcement learning (RL) community has been to train a goal-conditioned agent in a sparse reward environment such that it could also generalize to other unseen goals. Empirical results in Fetch-Reach and a novel driving simulator demonstrate that our proposed algorithm, Multi-Teacher Asymmetric Self-Play, allows one agent (i.e., a teacher) to create a successful curriculum for another agent (i.e., the student). Surprisingly, results also show that training with multiple teachers actually helps the student learn faster. Our analysis shows that multiple teachers can provide better coverage of the state space, selecting diverse sets of goals, and better helping a student learn. Moreover, results show that completely new students can learn offline from the goals generated by teachers that trained with a previous student. This is crucial in the context of industrial robotics where repeatedly training a teacher agent is expensive and sometimes infeasible.

Author Information

Chaitanya Kharyal (International Institute of Information Technology Hyderabad)
Tanmay Sinha (International Institute of Information Technology, Hyderabad)
Vijaya Sai Krishna Gottipati (AI Redefined)

RL Researcher at AI-Redefined.

Srijita Das (University of Alberta)
Matthew Taylor (U. of Alberta)

More from the Same Authors

  • 2021 : Structured Low-Rank Tensor Learning »
    Jayadev Naram · Tanmay Sinha · Pawan Kumar
  • 2021 : Safe Evaluation For Offline Learning: \\Are We Ready To Deploy? »
    Hager Radi · Josiah Hanna · Peter Stone · Matthew Taylor
  • 2021 : Safe Evaluation For Offline Learning: \\Are We Ready To Deploy? »
    Hager Radi · Josiah Hanna · Peter Stone · Matthew Taylor
  • 2022 Poster: Multiagent Q-learning with Sub-Team Coordination »
    Wenhan Huang · Kai Li · Kun Shao · Tianze Zhou · Matthew Taylor · Jun Luo · Dongge Wang · Hangyu Mao · Jianye Hao · Jun Wang · Xiaotie Deng
  • 2022 : Fifteen-minute Competition Overview Video »
    Tianpei Yang · Iuliia Kotseruba · Montgomery Alban · Amir Rasouli · Soheil Mohamad Alizadeh Shabestary · Randolph Goebel · Matthew Taylor · Liam Paull · Florian Shkurti
  • 2022 Workshop: Deep Reinforcement Learning Workshop »
    Karol Hausman · Qi Zhang · Matthew Taylor · Martha White · Suraj Nair · Manan Tomar · Risto Vuorio · Ted Xiao · Zeyu Zheng · Manan Tomar
  • 2022 Spotlight: Lightning Talks 5A-3 »
    Minting Pan · Xiang Chen · Wenhan Huang · Can Chang · Zhecheng Yuan · Jianzhun Shao · Yushi Cao · Peihao Chen · Ke Xue · Zhengrong Xue · Zhiqiang Lou · Xiangming Zhu · Lei Li · Zhiming Li · Kai Li · Jiacheng Xu · Dongyu Ji · Ni Mu · Kun Shao · Tianpei Yang · Kunyang Lin · Ningyu Zhang · Yunbo Wang · Lei Yuan · Bo Yuan · Hongchang Zhang · Jiajun Wu · Tianze Zhou · Xueqian Wang · Ling Pan · Yuhang Jiang · Xiaokang Yang · Xiaozhuan Liang · Hao Zhang · Weiwen Hu · Miqing Li · YAN ZHENG · Matthew Taylor · Huazhe Xu · Shumin Deng · Chao Qian · YI WU · Shuncheng He · Wenbing Huang · Chuanqi Tan · Zongzhang Zhang · Yang Gao · Jun Luo · Yi Li · Xiangyang Ji · Thomas Li · Mingkui Tan · Fei Huang · Yang Yu · Huazhe Xu · Dongge Wang · Jianye Hao · Chuang Gan · Yang Liu · Luo Si · Hangyu Mao · Huajun Chen · Jianye Hao · Jun Wang · Xiaotie Deng
  • 2022 Spotlight: Multiagent Q-learning with Sub-Team Coordination »
    Wenhan Huang · Kai Li · Kun Shao · Tianze Zhou · Matthew Taylor · Jun Luo · Dongge Wang · Hangyu Mao · Jianye Hao · Jun Wang · Xiaotie Deng
  • 2022 Competition: Driving SMARTS »
    Amir Rasouli · Matthew Taylor · Iuliia Kotseruba · Tianpei Yang · Randolph Goebel · Soheil Mohamad Alizadeh Shabestary · Montgomery Alban · Florian Shkurti · Liam Paull
  • 2022 Workshop: Reinforcement Learning for Real Life (RL4RealLife) Workshop »
    Yuxi Li · Emma Brunskill · MINMIN CHEN · Omer Gottesman · Lihong Li · Yao Liu · Zhiwei Tony Qin · Matthew Taylor
  • 2021 : Learning Representations for Pixel-based Control: What Matters and Why? »
    Manan Tomar · Utkarsh A Mishra · Amy Zhang · Matthew Taylor
  • 2021 Workshop: Deep Reinforcement Learning »
    Pieter Abbeel · Chelsea Finn · David Silver · Matthew Taylor · Martha White · Srijita Das · Yuqing Du · Andrew Patterson · Manan Tomar · Olivia Watkins
  • 2020 : Contributed Talk: Maximum Reward Formulation In Reinforcement Learning »
    Vijaya Sai Krishna Gottipati · Yashaswi Pathak · Rohan Nuttall · Sahir . · Raviteja Chunduru · Ahmed Touati · Sriram Ganapathi · Matthew Taylor · Sarath Chandar