Timezone: »
Spotlight
Distributed Exploration in Multi-Armed Bandits
Eshcar Hillel · Zohar Karnin · Tomer Koren · Ronny Lempel · Oren Somekh
Sun Dec 08 10:14 AM -- 10:18 AM (PST) @ Harvey's Convention Center Floor, CC
We study exploration in Multi-Armed Bandits (MAB) in a setting where~$k$ players collaborate in order to identify an $\epsilon$-optimal arm. Our motivation comes from recent employment of MAB algorithms in computationally intensive, large-scale applications. Our results demonstrate a non-trivial tradeoff between the number of arm pulls required by each of the players, and the amount of communication between them. In particular, our main result shows that by allowing the $k$ players to communicate \emph{only once}, they are able to learn $\sqrt{k}$ times faster than a single player. That is, distributing learning to $k$ players gives rise to a factor~$\sqrt{k}$ parallel speed-up. We complement this result with a lower bound showing this is in general the best possible. On the other extreme, we present an algorithm that achieves the ideal factor $k$ speed-up in learning performance, with communication only logarithmic in~$1/\epsilon$.
Author Information
Eshcar Hillel (Yahoo! Labs)
Zohar Karnin (Yahoo Research)
Tomer Koren (Technion)
Ronny Lempel (Yahoo! Labs)
Oren Somekh (Yahoo! Labs)
Related Events (a corresponding poster, oral, or spotlight)
-
2013 Poster: Distributed Exploration in Multi-Armed Bandits »
Sun. Dec 8th through Mon the 9th Room Harrah's Special Events Center, 2nd Floor
More from the Same Authors
-
2016 Poster: Online Pricing with Strategic and Patient Buyers »
Michal Feldman · Tomer Koren · Roi Livni · Yishay Mansour · Aviv Zohar -
2016 Poster: The Limits of Learning with Missing Data »
Brian Bullins · Elad Hazan · Tomer Koren -
2016 Poster: Multi-armed Bandits: Competing with Optimal Sequences »
Zohar Karnin · Oren Anava -
2016 Poster: Verification Based Solution for Structured MAB Problems »
Zohar Karnin -
2015 Poster: Copeland Dueling Bandits »
Masrour Zoghi · Zohar Karnin · Shimon Whiteson · Maarten de Rijke -
2015 Poster: Fast Rates for Exp-concave Empirical Risk Minimization »
Tomer Koren · Kfir Y. Levy -
2015 Poster: Bandit Smooth Convex Optimization: Improving the Bias-Variance Tradeoff »
Ofer Dekel · Ronen Eldan · Tomer Koren -
2015 Spotlight: Bandit Smooth Convex Optimization: Improving the Bias-Variance Tradeoff »
Ofer Dekel · Ronen Eldan · Tomer Koren -
2014 Poster: The Blinded Bandit: Learning with Adaptive Feedback »
Ofer Dekel · Elad Hazan · Tomer Koren -
2013 Poster: Near-Optimal Entrywise Sampling for Data Matrices »
Dimitris Achlioptas · Zohar Karnin · Edo Liberty -
2011 Poster: Beating SGD: Learning SVMs in Sublinear Time »
Elad Hazan · Tomer Koren · Nati Srebro