Timezone: »
Mean-Field Game (MFG) serves as a crucial mathematical framework in modeling the collective behavior of individual agents interacting stochastically with a large population. In this work, we aim at solving a challenging class of MFGs in which the differentiability of these interacting preferences may not be available to the solver, and the population is urged to converge exactly to some desired distribution. These setups are, despite being well-motivated for practical purposes, complicated enough to paralyze most (deep) numerical solvers. Nevertheless, we show that Schrödinger Bridge — as an entropy-regularized optimal transport model — can be generalized to accepting mean-field structures, hence solving these MFGs. This is achieved via the application of Forward-Backward Stochastic Differential Equations theory, which, intriguingly, leads to a computational framework with a similar structure to Temporal Difference learning. As such, it opens up novel algorithmic connections to Deep Reinforcement Learning that we leverage to facilitate practical training. We show that our proposed objective function provides necessary and sufficient conditions to the mean-field problem. Our method, named Deep Generalized Schrödinger Bridge (DeepGSB), not only outperforms prior methods in solving classical population navigation MFGs, but is also capable of solving 1000-dimensional opinion depolarization, setting a new state-of-the-art numerical solver for high-dimensional MFGs. Our code will be made available at https://github.com/ghliu/DeepGSB.
Author Information
Guan-Horng Liu (Georgia Institute of Technology)
Tianrong Chen (Georgia Institute of Technology)
Oswin So (Massachusetts Institute of Technology)
Evangelos Theodorou (Georgia Institute of Technology)
More from the Same Authors
-
2021 Spotlight: Second-Order Neural ODE Optimizer »
Guan-Horng Liu · Tianrong Chen · Evangelos Theodorou -
2021 : Likelihood Training of Schrödinger Bridges using Forward-Backward SDEs Theory »
Tianrong Chen · Guan-Horng Liu · Evangelos Theodorou -
2022 : Data-driven discovery of non-Newtonian astronomy via learning non-Euclidean Hamiltonian »
Oswin So · Gongjie Li · Evangelos Theodorou · Molei Tao -
2022 Panel: Panel 6B-3: Exponential Family Model-Based… & Deep Generalized Schrödinger… »
Guan-Horng Liu · Gene Li -
2022 : Invited Talk: Guan-Horng Liu »
Guan-Horng Liu -
2021 Poster: Second-Order Neural ODE Optimizer »
Guan-Horng Liu · Tianrong Chen · Evangelos Theodorou -
2020 : Contributed talks in Session 2 (Zoom) »
Martin Takac · Samuel Horváth · Guan-Horng Liu · Nicolas Loizou · Sharan Vaswani -
2020 : Contributed Video: DDPNOpt: Differential Dynamic Programming Neural Optimizer, Guan-Horng Liu »
Guan-Horng Liu -
2020 : Poster Session 1 (gather.town) »
Laurent Condat · Tiffany Vlaar · Ohad Shamir · Mohammadi Zaki · Zhize Li · Guan-Horng Liu · Samuel Horváth · Mher Safaryan · Yoni Choukroun · Kumar Shridhar · Nabil Kahale · Jikai Jin · Pratik Kumar Jawanpuria · Gaurav Kumar Yadav · Kazuki Koyama · Junyoung Kim · Xiao Li · Saugata Purkayastha · Adil Salim · Dighanchal Banerjee · Peter Richtarik · Lakshman Mahto · Tian Ye · Bamdev Mishra · Huikang Liu · Jiajie Zhu