Poster
|
Wed 16:30
|
An Equivalence Between Static and Dynamic Regret Minimization
Andrew Jacobsen · Francesco Orabona
|
|
Poster
|
Wed 16:30
|
Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
Victor Boone · Zihan Zhang
|
|
Poster
|
Fri 11:00
|
Federated Online Prediction from Experts with Differential Privacy: Separations and Regret Speed-ups
Fengyu Gao · Ruiquan Huang · Jing Yang
|
|
Workshop
|
|
Robust Offline Learning via Adversarial World Models
Uljad Berdica · Kelvin Li · Michael Beukman · Alexander D. Goldie · Perla Maiolino · Jakob Foerster
|
|
Workshop
|
|
Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness
Nikola Pavlovic · Sudeep Salgia · Qing Zhao
|
|
Workshop
|
|
Regulation of Algorithmic Collusion, Refined: Testing Worst-case Calibrated Regret
Jason Hartline · Chang Wang · Chenhao Zhang
|
|
Poster
|
Thu 11:00
|
Regret Minimization in Stackelberg Games with Side Information
Keegan Harris · Steven Wu · Maria-Florina Balcan
|
|
Workshop
|
|
An Information-Theoretic Analysis of Thompson Sampling for Logistic Bandits
Amaury Gouverneur · Borja Rodríguez Gálvez · Tobias Oechtering · Mikael Skoglund
|
|
Poster
|
Wed 16:30
|
Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to Optimal
Juliusz Ziomek · Masaki Adachi · Michael A Osborne
|
|
Poster
|
Thu 16:30
|
Improved Regret for Bandit Convex Optimization with Delayed Feedback
Yuanyu Wan · Chang Yao · Mingli Song · Lijun Zhang
|
|
Poster
|
Wed 11:00
|
A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of Θ(T2/3) and its Application to Best-of-Both-Worlds
Taira Tsuchiya · Shinji Ito
|
|
Poster
|
Fri 11:00
|
Is O(log N) practical? Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL
Enoch H. Kang · P. R. Kumar
|
|