Timezone: »
Poster
Dueling Bandits: Beyond Condorcet Winners to General Tournament Solutions
Siddartha Ramamohan · Arun Rajkumar · Shivani Agarwal · Shivani Agarwal
Recent work on deriving $O(\log T)$ anytime regret bounds for stochastic dueling bandit problems has considered mostly Condorcet winners, which do not always exist, and more recently, winners defined by the Copeland set, which do always exist. In this work, we consider a broad notion of winners defined by tournament solutions in social choice theory, which include the Copeland set as a special case but also include several other notions of winners such as the top cycle, uncovered set, and Banks set, and which, like the Copeland set, always exist. We develop a family of UCB-style dueling bandit algorithms for such general tournament solutions, and show $O(\log T)$ anytime regret bounds for them. Experiments confirm the ability of our algorithms to achieve low regret relative to the target winning set of interest.
Author Information
Siddartha Ramamohan (Indian Institute of Science)
Arun Rajkumar (Indian Institute of Technology Madras)
Shivani Agarwal (Radcliffe Institute)
Shivani Agarwal (University of Pennsylvania)
More from the Same Authors
-
2020 Poster: Bayes Consistency vs. H-Consistency: The Interplay between Surrogate Loss Functions and the Scoring Function Class »
Mingyuan Zhang · Shivani Agarwal -
2020 Spotlight: Bayes Consistency vs. H-Consistency: The Interplay between Surrogate Loss Functions and the Scoring Function Class »
Mingyuan Zhang · Shivani Agarwal -
2020 Poster: Choice Bandits »
Arpit Agarwal · Nicholas Johnson · Shivani Agarwal -
2014 Workshop: Analysis of Rank Data: Confluence of Social Choice, Operations Research, and Machine Learning »
Shivani Agarwal · Hossein Azari Soufiani · Guy Bresler · Sewoong Oh · David Parkes · Arun Rajkumar · Devavrat Shah -
2014 Poster: On the Statistical Consistency of Plug-in Classifiers for Non-decomposable Performance Measures »
Harikrishna Narasimhan · Rohit Vaish · Shivani Agarwal -
2014 Poster: Online Decision-Making in General Combinatorial Spaces »
Arun Rajkumar · Shivani Agarwal -
2013 Poster: Convex Calibrated Surrogates for Low-Rank Loss Matrices with Applications to Subset Ranking Losses »
Harish G Ramaswamy · Shivani Agarwal · Ambuj Tewari -
2013 Poster: On the Relationship Between Binary Classification, Bipartite Ranking, and Binary Class Probability Estimation »
Harikrishna Narasimhan · Shivani Agarwal -
2013 Spotlight: On the Relationship Between Binary Classification, Bipartite Ranking, and Binary Class Probability Estimation »
Harikrishna Narasimhan · Shivani Agarwal -
2013 Spotlight: Convex Calibrated Surrogates for Low-Rank Loss Matrices with Applications to Subset Ranking Losses »
Harish G Ramaswamy · Shivani Agarwal · Ambuj Tewari -
2012 Poster: Classification Calibration Dimension for General Multiclass Losses »
Harish G Ramaswamy · Shivani Agarwal -
2012 Spotlight: Classification Calibration Dimension for General Multiclass Losses »
Harish G Ramaswamy · Shivani Agarwal -
2009 Workshop: Advances in Ranking »
Shivani Agarwal · Chris J Burges · Yacov Crammer