Timezone: »
The factorization of state-action value functions for Multi-Agent Reinforcement Learning (MARL) is important. Existing studies are limited by their representation capability, sample efficiency, and approximation error. To address these challenges, we propose, ResQ, a MARL value function factorization method, which can find the optimal joint policy for any state-action value function through residual functions. ResQ masks some state-action value pairs from a joint state-action value function, which is transformed as the sum of a main function and a residual function. ResQ can be used with mean-value and stochastic-value RL. We theoretically show that ResQ can satisfy both the individual global max (IGM) and the distributional IGM principle without representation limitations. Through experiments on matrix games, the predator-prey, and StarCraft benchmarks, we show that ResQ can obtain better results than multiple expected/stochastic value factorization methods.
Author Information
Siqi Shen (Xiamen University)
Mengwei Qiu (Xiamen University)
Jun Liu (Xiamen University)
Weiquan Liu (Xiamen University)
Yongquan Fu (National University of Defense Technology)
I am an associate professor (Master Supervisor) in National Key Laboratory for Parallel and Distributed Processing & College of Computer Science, at National University of Defense Technology. My research focuses on network measurement and performance optimization for data center and geo-distributed networking systems. I am particularly interested in solving problems motivated by Online Data-intensitve applications, online social networks and large-scale data.
Xinwang Liu (National University of Defense Technology)
Cheng Wang (Xiamen University, Tsinghua University)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 Poster: ResQ: A Residual Q Function-based Approach for Multi-Agent Reinforcement Learning Value Factorization »
Thu. Dec 1st 05:00 -- 07:00 PM Room Hall J #402
More from the Same Authors
-
2022 Poster: Align then Fusion: Generalized Large-scale Multi-view Clustering with Anchor Matching Correspondences »
Siwei Wang · Xinwang Liu · Suyuan Liu · Jiaqi Jin · Wenxuan Tu · Xinzhong Zhu · En Zhu -
2023 Poster: E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning »
Xiuhong Lin · Changjie Qiu · zhipeng cai · Siqi Shen · Yu Zang · Weiquan Liu · Xuesheng Bian · Matthias Müller · Cheng Wang -
2023 Poster: On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions »
Yufeng Zhang · Jialu Pan · Wanwei Liu · Zhenbang Chen · Xinwang Liu · J Wang · Li Ken Li -
2023 Poster: RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization »
Siqi Shen · Chennan Ma · Chao Li · Weiquan Liu · Yongquan Fu · Songzhu Mei · Xinwang Liu · Cheng Wang -
2022 Spotlight: Stability and Generalization of Kernel Clustering: from Single Kernel to Multiple Kernel »
Weixuan Liang · Xinwang Liu · Yong Liu · sihang zhou · Jun-Jie Huang · Siwei Wang · Jiyuan Liu · Yi Zhang · En Zhu -
2022 Spotlight: Lightning Talks 1A-4 »
Siwei Wang · Jing Liu · Nianqiao Ju · Shiqian Li · Eloïse Berthier · Muhammad Faaiz Taufiq · Arsene Fansi Tchango · Chen Liang · Chulin Xie · Jordan Awan · Jean-Francois Ton · Ziad Kobeissi · Wenguan Wang · Xinwang Liu · Kewen Wu · Rishab Goel · Jiaxu Miao · Suyuan Liu · Julien Martel · Ruobin Gong · Francis Bach · Chi Zhang · Rob Cornish · Sanmi Koyejo · Zhi Wen · Yee Whye Teh · Yi Yang · Jiaqi Jin · Bo Li · Yixin Zhu · Vinayak Rao · Wenxuan Tu · Gaetan Marceau Caron · Arnaud Doucet · Xinzhong Zhu · Joumana Ghosn · En Zhu -
2022 Spotlight: Align then Fusion: Generalized Large-scale Multi-view Clustering with Anchor Matching Correspondences »
Siwei Wang · Xinwang Liu · Suyuan Liu · Jiaqi Jin · Wenxuan Tu · Xinzhong Zhu · En Zhu -
2022 Poster: HSurf-Net: Normal Estimation for 3D Point Clouds by Learning Hyper Surfaces »
Qing Li · Yu-Shen Liu · Jin-San Cheng · Cheng Wang · Yi Fang · Zhizhong Han -
2022 Poster: Stability and Generalization of Kernel Clustering: from Single Kernel to Multiple Kernel »
Weixuan Liang · Xinwang Liu · Yong Liu · sihang zhou · Jun-Jie Huang · Siwei Wang · Jiyuan Liu · Yi Zhang · En Zhu -
2019 Poster: Effective End-to-end Unsupervised Outlier Detection via Inlier Priority of Discriminative Network »
Siqi Wang · Yijie Zeng · Xinwang Liu · En Zhu · Jianping Yin · Chuanfu Xu · Marius Kloft