Multi-agent Imitation learning (MAIL) refers to the problem that agents learn to perform a task interactively in a multi-agent system through observing and mimicking expert demonstrations, without any knowledge of a reward function from the environment. MAIL has received a lot of attention due to promising results achieved on synthesized tasks, with the potential to be applied to complex real-world multi-agent tasks. Key challenges for MAIL include sample efficiency and scalability. In this paper, we proposed Bayesian multi-type mean field multi-agent imitation learning (BM3IL). Our method improves sample efficiency through establishing a Bayesian formulation for MAIL, and enhances scalability through introducing a new multi-type mean field approximation. We demonstrate the performance of our algorithm through benchmarking with three state-of-the-art multi-agent imitation learning algorithms on several tasks, including solving a multi-agent traffic optimization problem in a real-world transportation network. Experimental results indicate that our algorithm significantly outperforms all other algorithms in all scenarios.
Fan Yang (University at Buffalo)
Alina Vereshchaka (University at Buffalo)
Changyou Chen (University at Buffalo)
Wen Dong (University at Buffalo)
Wen Dong is an Assistant Professor of Computer Science and Engineering at the State University of New York at Buffalo with a joint appointment in the Institute of Sustainable Transportation and Logistics. He focuses on modeling human interaction dynamics with stochastic process theory through combining the power of “big data” and the logic/reasoning power of agent-based models, to solve our societies most challenging problems such as transportation sustainability and efficiency. Wen Dong holds a Ph.D. in Media Arts and Sciences from Massachusetts Institute of Technology. His email address is firstname.lastname@example.org.
Related Events (a corresponding poster, oral, or spotlight)
2020 Spotlight: Bayesian Multi-type Mean Field Multi-agent Imitation Learning »
Tue. Dec 8th 04:00 -- 04:10 AM Room Orals & Spotlights: Reinforcement Learning
More from the Same Authors
2020 Poster: Learning Manifold Implicitly via Explicit Heat-Kernel Learning »
Yufan Zhou · Changyou Chen · Jinhui Xu
2017 Poster: Expectation Propagation with Stochastic Kinetic Model in Complex Interaction Systems »
Le Fang · Fan Yang · Wen Dong · Tong Guan · Chunming Qiao
2016 Poster: Using Social Dynamics to Make Individual Predictions: Variational Inference with a Stochastic Kinetic Model »
Zhen Xu · Wen Dong · Sargur N Srihari