Timezone: »
Poster
Online Allocation and Learning in the Presence of Strategic Agents
Steven Yin · Shipra Agrawal · Assaf Zeevi
We study the problem of allocating $T$ sequentially arriving items among $n$ homogenous agents under the constraint that each agent must receive a prespecified fraction of all items, with the objective of maximizing the agents' total valuation of items allocated to them. The agents' valuations for the item in each round are assumed to be i.i.d. but their distribution is apriori unknown to the central planner.vTherefore, the central planner needs to implicitly learn these distributions from the observed values in order to pick a good allocation policy. However, an added challenge here is that the agents are strategic with incentives to misreport their valuations in order to receive better allocations. This sets our work apart both from the online auction mechanism design settings which typically assume known valuation distributions and/or involve payments, and from the online learning settings that do not consider strategic agents. To that end, our main contribution is an online learning based allocation mechanism that is approximately Bayesian incentive compatible, and when all agents are truthful, guarantees a sublinear regret for individual agents' utility compared to that under the optimal offline allocation policy.
Author Information
Steven Yin (Scriptus.app)
Shipra Agrawal (Columbia University)
Assaf Zeevi (Columbia University)
More from the Same Authors
-
2021 Spotlight: A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms »
Anand Kalvit · Assaf Zeevi -
2023 : Regret Bounds for Optimistic Follow The Leader: Applications in Portfolio Selection and Linear Regression »
Sudeep Raja Putta · Shipra Agrawal -
2023 Poster: Dynamic Pricing and Learning with Bayesian Persuasion »
Shipra Agrawal · Yiding Feng · Wei Tang -
2022 Poster: Dynamic Learning in Large Matching Markets »
Anand Kalvit · Assaf Zeevi -
2022 Poster: Optimal Efficiency-Envy Trade-Off via Optimal Transport »
Steven Yin · Christian Kroer -
2021 : Machine Learning for Combinatorial Optimization + Q&A »
Maxime Gasse · Simon Bowly · Chris Cameron · Quentin Cappart · Jonas Charfreitag · Laurent Charlin · Shipra Agrawal · Didier Chetelat · Justin Dumouchelle · Ambros Gleixner · Aleksandr Kazachkov · Elias Khalil · Pawel Lichocki · Andrea Lodi · Miles Lubin · Christopher Morris · Dimitri Papageorgiou · Augustin Parjadis · Sebastian Pokutta · Antoine Prouvost · Yuandong Tian · Lara Scavuzzo · Giulia Zarpellon -
2021 Poster: A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms »
Anand Kalvit · Assaf Zeevi -
2020 Poster: Towards Problem-dependent Optimal Learning Rates »
Yunbei Xu · Assaf Zeevi -
2020 Spotlight: Towards Problem-dependent Optimal Learning Rates »
Yunbei Xu · Assaf Zeevi -
2020 Poster: From Finite to Countable-Armed Bandits »
Anand Kalvit · Assaf Zeevi -
2019 : Learning in structured MDPs with convex cost function: improved regret bounds for inventory management »
Shipra Agrawal -
2017 Poster: Optimistic posterior sampling for reinforcement learning: worst-case regret bounds »
Shipra Agrawal · Randy Jia -
2017 Spotlight: Posterior sampling for reinforcement learning: worst-case regret bounds »
Shipra Agrawal · Randy Jia -
2014 Poster: Stochastic Multi-Armed-Bandit Problem with Non-stationary Rewards »
Omar Besbes · Yonatan Gur · Assaf Zeevi