Toggle Poster Visibility
Poster
Tue Dec 10 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #192
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
Poster
Tue Dec 10 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #193
Limiting Extrapolation in Linear Approximate Value Iteration
Poster
Tue Dec 10 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #194
Propagating Uncertainty in Reinforcement Learning via Wasserstein Barycenters
Poster
Tue Dec 10 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #195
Provably Efficient Q-Learning with Low Switching Cost
Poster
Tue Dec 10 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #196
Regret Bounds for Learning State Representations in Reinforcement Learning
Poster
Tue Dec 10 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #197
Safe Exploration for Interactive Machine Learning
Poster
Tue Dec 10 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #198
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning
[
Paper]
[
3 min Video]
Poster
Tue Dec 10 05:30 PM -- 07:30 PM (PST) @ East Exhibition Hall B + C #178
Almost Horizon-Free Structure-Aware Best Policy Identification with a Generative Model
Poster
Tue Dec 10 05:30 PM -- 07:30 PM (PST) @ East Exhibition Hall B + C #179
Better Exploration with Optimistic Actor Critic
Poster
Tue Dec 10 05:30 PM -- 07:30 PM (PST) @ East Exhibition Hall B + C #180
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle
Poster
Tue Dec 10 05:30 PM -- 07:30 PM (PST) @ East Exhibition Hall B + C #181
Explicit Planning for Efficient Exploration in Reinforcement Learning
Poster
Tue Dec 10 05:30 PM -- 07:30 PM (PST) @ East Exhibition Hall B + C #182
Exploration Bonus for Regret Minimization in Discrete and Continuous Average Reward MDPs
Poster
Tue Dec 10 05:30 PM -- 07:30 PM (PST) @ East Exhibition Hall B + C #183
Information-Theoretic Confidence Bounds for Reinforcement Learning
Poster
Tue Dec 10 05:30 PM -- 07:30 PM (PST) @ East Exhibition Hall B + C #184
Worst-Case Regret Bounds for Exploration via Randomized Value Functions