Timezone: »
Due to its high sample complexity, simulation is, as of today, critical for the successful application of reinforcement learning. Many real-world problems, however, exhibit overly complex dynamics, making their full-scale simulation computationally slow. In this paper, we show how to factorize large networked systems of many agents into multiple local regions such that we can build separate simulators that run independently and in parallel. To monitor the influence that the different local regions exert on one another, each of these simulators is equipped with a learned model that is periodically trained on real trajectories. Our empirical results reveal that distributing the simulation among different processes not only makes it possible to train large multi-agent systems in just a few hours but also helps mitigate the negative effects of simultaneous learning.
Author Information
Miguel Suau (Delft University of Technology)
Jinke He (Delft University of Technology)
Mustafa Mert Çelikok (Aalto University)
Matthijs Spaan (Delft University of Technology)
Frans Oliehoek (TU Delft)
More from the Same Authors
-
2021 : Offline Contextual Bandits for Wireless Network Optimization »
Miguel Suau -
2022 : Differentiable User Models »
Alex Hämäläinen · Mustafa Mert Çelikok · Samuel Kaski -
2020 Poster: Influence-Augmented Online Planning for Complex Environments »
Jinke He · Miguel Suau · Frans Oliehoek -
2020 Poster: MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning »
Elise van der Pol · Daniel E Worrall · Herke van Hoof · Frans Oliehoek · Max Welling -
2020 Poster: Multi-agent active perception with prediction rewards »
Mikko Lauri · Frans Oliehoek -
2019 : Poster Presentations »
Rahul Mehta · Andrew Lampinen · Binghong Chen · Sergio Pascual-Diaz · Jordi Grau-Moya · Aldo Faisal · Jonathan Tompson · Yiren Lu · Khimya Khetarpal · Martin Klissarov · Pierre-Luc Bacon · Doina Precup · Thanard Kurutach · Aviv Tamar · Pieter Abbeel · Jinke He · Maximilian Igl · Shimon Whiteson · Wendelin Boehmer · Raphaël Marinier · Olivier Pietquin · Karol Hausman · Sergey Levine · Chelsea Finn · Tianhe Yu · Lisa Lee · Benjamin Eysenbach · Emilio Parisotto · Eric Xing · Ruslan Salakhutdinov · Hongyu Ren · Anima Anandkumar · Deepak Pathak · Christopher Lu · Trevor Darrell · Alexei Efros · Phillip Isola · Feng Liu · Bo Han · Gang Niu · Masashi Sugiyama · Saurabh Kumar · Janith Petangoda · Johan Ferret · James McClelland · Kara Liu · Animesh Garg · Robert Lange -
2019 Poster: Machine Teaching of Active Sequential Learners »
Tomi Peltola · Mustafa Mert Çelikok · Pedram Daee · Samuel Kaski -
2011 Poster: Efficient Offline Communication Policies for Factored Multiagent POMDPs »
João V Messias · Matthijs Spaan · Pedro U Lima