Timezone: »
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Yuandong Ding · Mingxiao Feng · Guozi Liu · Wei Jiang · Chuheng Zhang · Li Zhao · Lei Song · Houqiang Li · Yan Jin · Jiang Bian
In this paper, we consider the inventory management~(IM) problem where we need to make replenishment decisions for a large number of stock keeping units (SKUs) to balance their supply and demand. In our setting, the constraint on the shared resources (such as the inventory capacity) couples the otherwise independent control for each SKU. We formulate the problem with this structure as Shared-Resource Stochastic Game (SRSG) and propose an efficient algorithm called Context-aware Decentralized PPO (CD-PPO). Through extensive experiments, we demonstrate that CD-PPO can accelerate the learning procedure compared with standard MARL algorithms.
Author Information
Yuandong Ding (Huazhong University of Science and Technology)
Mingxiao Feng (University of Science and Technology of China)
Guozi Liu (Carnegie Mellon University)
Wei Jiang (University of Illinois at Urbana-Champaign)
Chuheng Zhang (Tsinghua University)
Li Zhao (Microsoft Research)
Lei Song (Microsoft)
Houqiang Li (University of Science and Technology of China)
Yan Jin (Huazhong University of Science & Technology)
Jiang Bian (Microsoft)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 : Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management »
Dates n/a. Room
More from the Same Authors
-
2022 Poster: An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context »
Xiaoyu Chen · Xiangming Zhu · Yufeng Zheng · Pushi Zhang · Li Zhao · Wenxue Cheng · Peng CHENG · Yongqiang Xiong · Tao Qin · Jianyu Chen · Tie-Yan Liu -
2022 Poster: LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning »
Mingyu Yang · Jian Zhao · Xunhan Hu · Wengang Zhou · Jiangcheng Zhu · Houqiang Li -
2022 Poster: Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret »
Jiawei Huang · Li Zhao · Tao Qin · Wei Chen · Nan Jiang · Tie-Yan Liu -
2022 Spotlight: Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret »
Jiawei Huang · Li Zhao · Tao Qin · Wei Chen · Nan Jiang · Tie-Yan Liu -
2022 Spotlight: Lightning Talks 4A-1 »
Jiawei Huang · Su Jia · Abdurakhmon Sadiev · Ruomin Huang · Yuanyu Wan · Denizalp Goktas · Jiechao Guan · Andrew Li · Wei-Wei Tu · Li Zhao · Amy Greenwald · Jiawei Huang · Dmitry Kovalev · Yong Liu · Wenjie Liu · Peter Richtarik · Lijun Zhang · Zhiwu Lu · R Ravi · Tao Qin · Wei Chen · Hu Ding · Nan Jiang · Tie-Yan Liu -
2022 Spotlight: Lightning Talks 3A-3 »
Xu Yan · Zheng Dong · Qiancheng Fu · Jing Tan · Hezhen Hu · Fukun Yin · Weilun Wang · Ke Xu · Heshen Zhan · Wen Liu · Qingshan Xu · Xiaotong Zhao · Chaoda Zheng · Ziheng Duan · Zilong Huang · Xintian Shi · Wengang Zhou · Yew Soon Ong · Pei Cheng · Hujun Bao · Houqiang Li · Wenbing Tao · Jiantao Gao · Bin Kang · Weiwei Xu · Limin Wang · Ruimao Zhang · Tao Chen · Gang Yu · Rynson Lau · Shuguang Cui · Zhen Li -
2022 Spotlight: Hand-Object Interaction Image Generation »
Hezhen Hu · Weilun Wang · Wengang Zhou · Houqiang Li -
2022 Poster: Hand-Object Interaction Image Generation »
Hezhen Hu · Weilun Wang · Wengang Zhou · Houqiang Li -
2022 Poster: Efficient and Effective Multi-task Grouping via Meta Learning on Task Combinations »
Xiaozhuang Song · Shun Zheng · Wei Cao · James Yu · Jiang Bian -
2021 Poster: Curriculum Offline Imitating Learning »
Minghuan Liu · Hanye Zhao · Zhengyu Yang · Jian Shen · Weinan Zhang · Li Zhao · Tie-Yan Liu -
2021 Poster: PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning »
Tao Yu · Cuiling Lan · Wenjun Zeng · Mingxiao Feng · Zhizheng Zhang · Zhibo Chen -
2021 Poster: Distributional Reinforcement Learning for Multi-Dimensional Reward Functions »
Pushi Zhang · Xiaoyu Chen · Li Zhao · Wei Xiong · Tao Qin · Tie-Yan Liu -
2021 Poster: Dual Progressive Prototype Network for Generalized Zero-Shot Learning »
Chaoqun Wang · Shaobo Min · Xuejin Chen · Xiaoyan Sun · Houqiang Li -
2021 Poster: Contextual Similarity Aggregation with Self-attention for Visual Re-ranking »
Jianbo Ouyang · Hui Wu · Min Wang · Wengang Zhou · Houqiang Li -
2021 Poster: Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning »
Jongjin Park · Younggyo Seo · Chang Liu · Li Zhao · Tao Qin · Jinwoo Shin · Tie-Yan Liu -
2021 Poster: Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training »
Hongwei Xue · Yupan Huang · Bei Liu · Houwen Peng · Jianlong Fu · Houqiang Li · Jiebo Luo -
2020 Poster: Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method »
Qi Zhou · Yufei Kuang · Zherui Qiu · Houqiang Li · Jie Wang -
2020 Poster: MESA: Boost Ensemble Imbalanced Learning with MEta-SAmpler »
Zhining Liu · Pengfei Wei · Jing Jiang · Wei Cao · Jiang Bian · Yi Chang -
2020 Poster: RD$^2$: Reward Decomposition with Representation Decomposition »
Zichuan Lin · Derek Yang · Li Zhao · Tao Qin · Guangwen Yang · Tie-Yan Liu -
2019 Poster: Fully Parameterized Quantile Function for Distributional Reinforcement Learning »
Derek Yang · Li Zhao · Zichuan Lin · Tao Qin · Jiang Bian · Tie-Yan Liu -
2019 Poster: Distributional Reward Decomposition for Reinforcement Learning »
Zichuan Lin · Li Zhao · Derek Yang · Tao Qin · Tie-Yan Liu · Guangwen Yang